WO2019232821A1 - Method for processing risk control data, device, computer apparatus, and storage medium - Google Patents

Method for processing risk control data, device, computer apparatus, and storage medium Download PDF

Info

Publication number
WO2019232821A1
WO2019232821A1 PCT/CN2018/092327 CN2018092327W WO2019232821A1 WO 2019232821 A1 WO2019232821 A1 WO 2019232821A1 CN 2018092327 W CN2018092327 W CN 2018092327W WO 2019232821 A1 WO2019232821 A1 WO 2019232821A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
event
scene
information
wind control
Prior art date
Application number
PCT/CN2018/092327
Other languages
French (fr)
Chinese (zh)
Inventor
张小敏
李云利
李辉
石宇
张文君
朱小冬
陈晶
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019232821A1 publication Critical patent/WO2019232821A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities

Definitions

  • the present application relates to the field of computer technology, and in particular, to a method, device, computer equipment, and storage medium for risk control data processing.
  • a risk control data processing method includes:
  • the wind control data includes scene information, scene events corresponding to each of the scene information, and rules containing wind control rules corresponding to each of the scene events A set, and a wind control hit rate corresponding to each of the wind control rules;
  • each of the target data calculate an information similarity between the scene information in the target data and the scene information in each of the basic data, and if the information similarity is greater than or equal to a preset similarity threshold, Then obtain the basic data where the scene information corresponding to the information similarity is located, and use the basic data as the data to be recommended;
  • a wind control data processing device includes:
  • a data acquisition module is configured to acquire wind control data of each professional company in the wind control database, where the wind control data includes scene information, a scene event corresponding to each of the scene information, and a corresponding event of each of the scene events.
  • a rule set including a risk control rule, and a risk control hit rate corresponding to each of the risk control rules;
  • a data screening module configured to select, from the wind control data, wind control data in which the rule set is empty as target data, and use the wind control data in which the rule set is not empty as basic data;
  • An information calculation module is configured to calculate, for each of the target data, information similarity between scene information in the target data and scene information in each of the basic data, and if the information similarity is greater than or equal to If the similarity threshold is set, the basic data where the scene information corresponding to the information similarity is located is used as the data to be recommended;
  • An event calculation module configured to calculate an event similarity between a scene event in each of the target data and a scene event in each of the to-be-recommended data
  • the rule recommendation module is configured to obtain the data to be recommended corresponding to the event similarity with the largest value among the event similarities, and recommend the risk control rule in the data to be recommended to the professional company corresponding to the target data.
  • a computer device includes a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, and the processor implements the above-mentioned wind control data processing when the processor executes the computer-readable instructions Method steps.
  • One or more non-volatile readable storage media storing computer-readable instructions, which, when executed by one or more processors, cause the one or more processors to implement the above-mentioned Control the steps of the data processing method.
  • FIG. 1 is a schematic diagram of an application environment of a wind control data processing method according to an embodiment of the present application
  • FIG. 2 is a flowchart of a wind control data processing method according to an embodiment of the present application.
  • FIG. 3 is a flowchart of acquiring wind control data in a wind control data processing method according to an embodiment of the present application
  • step S5 is an implementation flowchart of step S5 in the wind control data processing method according to an embodiment of the present application
  • FIG. 5 is a flowchart of updating a wind control database in a wind control data processing method according to an embodiment of the present application
  • FIG. 6 is a schematic diagram of a wind control data processing device according to an embodiment of the present application.
  • FIG. 7 is a schematic diagram of a computer device in an embodiment of the present application.
  • FIG. 1 shows an application environment provided by an embodiment of the present application.
  • the application environment includes a server and a client.
  • the server and the client are connected through a network, and the client is used to receive a recommendation request sent by a professional company. And send the received recommendation request from the professional company to the server.
  • the client can specifically but not limited to various personal computers, laptops, smartphones, tablets and portable wearable devices; the server is used to carry out risk control data.
  • the server can be implemented by an independent server or a server cluster composed of multiple servers.
  • the risk control data processing method provided in the embodiment of the present application is applied to a server.
  • FIG. 2 illustrates an implementation process of a risk control data processing method provided by this embodiment. Details are as follows:
  • S1 Obtain the wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each scene information, a rule set containing wind control rules corresponding to each scene event, and each The risk control hit rate corresponding to each risk control rule.
  • the method for obtaining the wind control data may be specifically obtained from the wind control database in real time, or may be obtained from the wind control database at predetermined intervals, and may be specifically set according to actual application requirements.
  • the wind control data includes scene information, scene events, rule sets, and wind control hit ratios.
  • the correspondence between the scene information and the scene events can be a one-to-one correspondence or a one-to-many relationship.
  • the rule set is a one-to-one relationship; the rule set can be empty or it can contain one or more risk control rules; each risk control rule has a one-to-one correspondence with the risk control hit rate, and the risk control hit rate is The number of times a scene event hits its corresponding risk control rule as a percentage of the total number of occurrences of the scene event is used to indicate the hit degree of each scene event.
  • scene information "login” corresponds to a scene event "login transaction interface of user IP”
  • a rule set corresponding to this scene event has a wind
  • the control rule is "the number of consecutive logins of a user IP at the same time, less than or equal to the preset number, otherwise the user IP login is restricted.”
  • the risk control data with the empty rule set is selected from the risk control data as the target data, and the wind control data with the non-empty rule set is used as the basic data.
  • the target data is wind control data without a wind control rule configured
  • the basic data is wind control data with a wind control rule configured.
  • the rule set in the wind control data obtained in step S1 is traversed. If the rule set is retrieved as empty, the risk control data in which the rule set is located is marked as the target data. If the rule set retrieved is not empty, then The risk control data where the rule set is located is marked as the basic data.
  • S3 For each target data, calculate the information similarity between the scene information in the target data and the scene information in each basic data. If the information similarity is greater than or equal to a preset similarity threshold, obtain the information The basic data where the scene information corresponding to the similarity is located, and the basic data is used as the data to be recommended.
  • the information similarity can be specifically obtained by calculating a ratio between the number of times the words in the scene information in the base data appear in the scene information in the target data and the total number of words in the scene information in the target data.
  • the process of calculating information similarity is detailed as follows:
  • the scene information of a basic data is "discount offer”
  • the vocabulary set of the basic scene information obtained by segmenting this scene information is unit1: discount / offer / activity
  • the scene information in a target data is "promotional big reward event” "
  • the target scene information vocabulary set after segmenting the scene information in the target data is unit2: promotion / discount / big reward / activity
  • the total number of times the word in unit1 appears in unit2 is 2
  • the target scene information vocabulary set The total number of words is 4, and the ratio of the total number of times to the total number of words in the target scene information vocabulary set is 50%, that is, the information similarity between the scene information and the scene information of the target data is 50%.
  • the calculated information similarity is greater than or equal to a preset similarity threshold, the basic data where the scene information corresponding to the information similarity is located is used as the data to be recommended.
  • the preset similarity threshold is 50%.
  • the similarity between the information of the basic data's scene information is "discount offers" and the target data's scene information is "promotional offers and big reward events”. If the similarity of the information is equal to a preset similarity threshold, the basic data where the scene information is located is obtained, and the basic data is used as the data to be recommended.
  • the word segmentation processing can use a tool with a word segmentation function, which is not limited here.
  • the preset similarity threshold may be specifically set according to actual service requirements, and is not limited here. Calculating information similarity can also be calculated using Jaccard's similarity formula, or other similarity calculation methods, which are not limited here.
  • the target scene event vocabulary set is deduplicated and merged with each basic scene event vocabulary set to obtain the scene event in the target data and the total event vocabulary set after each basic scene event is merged. For example, suppose the target data A contains 1 10 scene events, according to the target data A obtained in step S3, there are a total of 10 scene events to be recommended, and then 10 total event vocabulary sets can be obtained;
  • a basic scenario event is "login transaction interface of user IP”
  • the basic scenario event vocabulary set obtained by segmenting this basic scenario event is unit3: user / IP // login / transaction / interface, a scenario in the target data
  • the event is "users use the account and password to log in to the trading interface”
  • the target event event vocabulary set after segmenting the scene events in the target data is unit4: user / use / account / password / login / transaction / interface, go to unit3 and unit4 Repeat the merge to get the target scene event vocabulary set and the basic scene event vocabulary set
  • the combined total event vocabulary set unit5 user / IP // login / transaction / interface / use / account / password / login, calculate the words in unit3 Word frequency in unit5: User 1, IP1, 1, Login 1, Transaction 1, Interface 1, Use 0, Account 0, Password 0, Login 0, Calculate the word frequency of words in unit4 in unit5: User 1, IP0, 0, login 0, transaction 1, interface 1, use 1, account 1, password 1, login 1, the word frequency vector of the basic scene
  • cosine similarity formula is Among them, x i represents the component of the word frequency vector of the basic scene event vocabulary set in its corresponding total event vocabulary set, y i represents the component of the word frequency vector of the target scene event vocabulary set in each total event vocabulary set, and p represents the cosine Similarity value, n represents the total number of words in the total event vocabulary set.
  • simhash mass similarity algorithm to calculate, or other similarity calculation methods, which are not limited here.
  • S5 Obtain the data to be recommended corresponding to the event similarity with the largest value in the event similarity, and recommend the risk control rules in the data to be recommended to the professional company corresponding to the target data.
  • step S4 from the event similarity between the scene event in the target data and the scene event of each to-be-recommended data calculated in step S4, the event similarity with the largest value is selected to obtain the to-be-recommended data corresponding to the event similarity And recommend the risk control rules in the data to be recommended to the professional company corresponding to the target data.
  • the recommended method may specifically be email or instant messaging, or other methods, and the specific method may be set according to actual application requirements, and is not limited here.
  • the wind control data with an empty rule set is selected, and the target data without the risk control rule is determined, and the rule set is not empty.
  • the risk control data is used as the basic data, which is beneficial to the subsequent targeted search for an adapted risk control rule for the target data.
  • the scene information in each target data and each basic data are calculated. The information similarity between the scene information of the. If the information similarity is greater than or equal to a preset similarity threshold, the basic data where the scene information corresponding to the information similarity is located is used as the data to be recommended.
  • the data to be recommended can find the scope and direction of the risk control rules adapted to the target data, which is conducive to subsequent selection of the best recommendation results from the data to be recommended.
  • the method of calculating similarity is simple, fast, and easy to operate. Further, Calculate the event similarity between the scene events in each target data and the scene events in each to-be-recommended data to obtain the number
  • the data to be recommended corresponding to the event similarity with the largest value, and the risk control rules in the data to be recommended are recommended to the professional company corresponding to the target data.
  • the risk control data processing method further includes the following steps:
  • a risk control recommendation request from a professional company whose risk control data is not included in the risk control database, obtains the risk control data of the professional company, and saves the risk control data as updated target data to the risk control. Database.
  • a risk control recommendation request is received from a professional company whose risk control data is not included in the risk control database.
  • an adapted company can be found in the risk control database for the professional company. Risk control rules, and recommend the adapted risk control rules to the professional company, reflecting practicality and broad applicability.
  • the method for processing wind control data further includes the following steps:
  • S7 Monitor each scene event of each professional company. If it is detected that the scene event is triggered and the risk control rule in the rule set corresponding to the scene event is detected, then the hit information of the scene event is recorded, where the hit information includes the The time when a scene event occurs and the risk control rules that the scene event hits.
  • a scenario event hits a risk control rule in a rule set corresponding to the scenario event, which means that a situation occurring in the scenario event violates a constraint condition in the configured risk control rule, or exceeds a constraint condition in the risk control rule.
  • a scenario event is "Users can receive merchant coupons during coupon discount activities" and the configured risk control rule is "A user IP can only receive coupons once, otherwise, the use of that IP is restricted”.
  • the specific process of the event is that the user logs in the merchant software and clicks the button to receive the coupon. If it is detected that the same user IP has clicked to receive the coupon 100 times at the same time, it may be that the user used the fraud software to conduct a coupon to the merchant. Unreasonable possession, confirm that it violates the risk control rules configured for this scenario event, and will limit the use of the user IP in the merchant software, that is, the scenario event hits the risk control rule in the rule set corresponding to the scenario event.
  • each scene event of each professional company monitors each scene event of each professional company, record the occurrence time of the detected scene event when it is triggered, and when the scene event is triggered, whether to hit the risk control rule in the rule set corresponding to the scene event, and record
  • the wind control log in the wind control database can directly reflect the hit situation of each scene event, which is beneficial to the subsequent analysis and calculation of wind control data.
  • S8 Count the data amount of each scene event in a preset time period according to the hit information every predetermined time interval, where the data amount includes the number of scene event occurrences and each risk control in the rule set corresponding to the scene event The number of times the rule was hit.
  • every predetermined time interval according to the hit information recorded in the wind control log in step S7, the number of occurrences of each scene event within a preset period of time and a rule set corresponding to the scene event are counted. The number of times each risk control rule was hit.
  • step S7 For example, assuming that the preset time period is "1st to 15th in January”, continue to use the example in step S7. For example, if the statistics are obtained from “1st to 15th in January", the scenario event "Users can pick up coupons” In the discount campaign, the number of times of "Get Merchant Coupon” occurred 500 times, and the number of hits to the corresponding risk control rules was 5 times.
  • predetermined time interval and the preset time period may be specifically set according to actual service requirements, and are not limited here.
  • the wind control hit rate of each scene event is the number of times each wind control rule is hit in the rule set corresponding to the scene event within a preset time period of each scene event and the occurrence of the scene event.
  • step S8 the scenario event "users can collect merchant coupons during coupon collection discounts” has a risk control hit rate of 1% during the period of "1st to 15th of January".
  • each scene event of each professional company is monitored, the time when the detected scene event is triggered, and when the scene event is triggered, whether the risk control rule in the rule set corresponding to the scene event is hit.
  • the wind control log recorded in the wind control database can intuitively reflect the hit situation of each scene event, which is conducive to subsequent analysis and calculation of wind control data, so as to count the scene of each scene event within a preset time period.
  • the number of event occurrences and the number of times each risk control rule in the rule set corresponding to the scenario event is calculated can calculate the risk control hit rate of each scenario event, which can fully reflect the degree of risk of each scenario event.
  • step S5 the data to be recommended corresponding to the event similarity with the largest value in the event similarity is obtained, and the risk control rule in the data to be recommended is recommended to the target data.
  • the corresponding professional company includes the following steps:
  • S501 Acquire to-be-recommended data corresponding to the event similarity with the largest numerical value in the event similarity as standard data.
  • step S4 from the event similarity between the scene event of each target data and the scene event of each to-be-recommended data obtained in step S4, the event similarity with the largest numerical value is selected, and the wind corresponding to the event similarity corresponds to the wind.
  • Control data is used as standard data to be recommended to professional companies.
  • S502 From the standard data, filter out the wind control hit ratio with the largest value, and recommend the wind control rule corresponding to the wind control hit ratio to the professional company corresponding to the target data.
  • the wind control hit rate with the largest value is selected, and the wind control rule corresponding to the wind control hit rate is selected.
  • a risk control rule most suitable for the target data it is recommended to the professional company corresponding to the target data.
  • the risk control with the largest value is selected from the standard data.
  • Hit rate, and the risk control rule corresponding to the risk control hit rate is used as the risk control rule most suitable for the target data. It is recommended to the professional company corresponding to the target data.
  • the risk control hit rate can intuitively reflect its corresponding scene events. The degree of risk occurrence is conducive to improving the level of risk prevention of scene events of the target data.
  • step S5 the method for processing wind control data further includes the following steps:
  • S10 Obtain the target risk control rule confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data.
  • the confirmation information fed back by the professional company corresponding to the target data is received, and the confirmation information includes the recommended risk control rule confirmed for use by the professional company, and the confirmed risk control rule is used as the target risk control rule.
  • S11 In the risk control database, configure the target risk control rule into a rule set corresponding to the target data.
  • the wind control data in the wind control database is updated, that is, the target wind control rule is configured into a rule set corresponding to the target data.
  • the target risk control rule confirmed by the professional company in the confirmation information is obtained, and the target risk control rule is configured in the risk control database to the target data
  • updating the wind control data can fully reflect the hit situation of the target data configured with the wind control rules.
  • the wind control data can be used to have similar or same scenarios with the target data. For other professional companies that do not have risk control rules, recommending risk control rules will help improve the level of risk prevention of each professional company.
  • a wind control data processing device corresponds to the wind control data processing method in the above embodiment in a one-to-one correspondence.
  • the risk control data processing device includes a data acquisition module 601, a data screening module 602, an information calculation module 603, an event calculation module 604, and a rule recommendation module 605.
  • the detailed description of each function module is as follows:
  • a data acquisition module 601 is used to acquire wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each scene information, and each scene event corresponding to a wind control rule. Rule set, and the risk control hit rate corresponding to each risk control rule;
  • the data screening module 602 is configured to screen out wind control data from which the rule set is empty as target data, and use the wind control data whose rule set is not empty as basic data;
  • An information calculation module 603 is configured to calculate, for each target data, the information similarity between the scene information in the target data and the scene information in each basic data, if the information similarity is greater than or equal to a preset similarity threshold , Obtain the basic data where the scene information corresponding to the information similarity is located, and use the basic data as the data to be recommended;
  • An event calculation module 604 configured to calculate an event similarity between a scene event in each target data and a scene event in each to-be-recommended data
  • the rule recommendation module 605 is configured to obtain the data to be recommended corresponding to the event similarity with the largest value in the event similarity, and recommend the risk control rule in the data to be recommended to the professional company corresponding to the target data.
  • the risk control data processing device further includes:
  • the request receiving module 606 is configured to update the target data according to the risk control recommendation request if a risk control recommendation request from a professional company is received.
  • the risk control data processing device further includes:
  • a hit recording module 607 is used to monitor each scene event of each professional company. If it is detected that a scene event is triggered and the risk control rule in the rule set corresponding to the scene event is detected, the hit information of the scene event is recorded, where The hit information includes the occurrence time of the scene event and the risk control rule of the scene event hit;
  • a data statistics module 608 is configured to count the data amount of each scene event in a preset time period according to the hit information at a predetermined time interval, where the data amount includes the number of scene event occurrences and a rule set corresponding to the scene event The number of times each risk control rule was hit;
  • the hit calculation module 609 is configured to calculate a wind control hit rate of each scene event according to the amount of data.
  • rule recommendation module 605 includes:
  • a data obtaining unit 6051 is configured to obtain data to be recommended corresponding to the event similarity with the largest numerical value among the event similarities as standard data;
  • a rule recommending unit 6052 is used to filter out the wind control hit ratio with the largest value from the standard data, and recommend the wind control rule corresponding to the wind control hit ratio to the professional company corresponding to the target data.
  • the risk control data processing device further includes:
  • An information feedback module 610 is configured to obtain the target risk control rule confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data;
  • a rule configuration module 611 is configured to configure a target risk control rule in a risk control database to a rule set corresponding to the target data.
  • Each module in the above-mentioned wind control data processing device may be implemented in whole or in part by software, hardware, and a combination thereof.
  • the above-mentioned modules may be embedded in the hardware in or independent of the processor in the computer device, or may be stored in the memory of the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure diagram may be as shown in FIG. 7.
  • the computer device includes a processor, a memory, a network interface, and a database connected through a system bus.
  • the processor of the computer device is used to provide computing and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, computer-readable instructions, and a database.
  • the internal memory provides an environment for the operation of the operating system and computer-readable instructions in a non-volatile storage medium.
  • the database of the computer equipment is used to store risk control data.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer-readable instructions are executed by a processor to implement a method for processing wind control data.
  • a computer device including a memory, a processor, and computer-readable instructions stored on the memory and executable on the processor.
  • the processor executes the computer-readable instructions
  • the wind control of the foregoing embodiment is implemented.
  • the steps of the data processing method are, for example, steps S1 to S5 shown in FIG. 2.
  • the processor executes the computer-readable instructions
  • the functions of the modules / units of the wind control data processing device in the foregoing embodiment are implemented, for example, the functions of modules 601 to 605 shown in FIG. 6. To avoid repetition, we will not repeat them here.
  • a non-volatile storage medium on which computer-readable instructions are stored, and when the computer-readable instructions are executed by a processor, the wind control data processing method in the foregoing method embodiment is implemented, or the computer
  • the readable instructions are executed by the processor, the functions of each module / unit in the wind control data processing device in the above device embodiment are realized. To avoid repetition, we will not repeat them here.
  • Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory can include random access memory (RAM) or external cache memory.
  • RAM is available in various forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), dual data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain (Synchlink), DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).

Landscapes

  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Operations Research (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Educational Administration (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application relates to the technical field of computers, and provides a method for processing risk control data, a device, a computer apparatus, and a storage medium. The method comprises: acquiring risk control data items, using risk control data items having a null rule set as target data items, and using risk control data items having a non-empty rule set as basic data items; calculating an information similarity level between scenario information in the target data items and scenario information in each of the basic data items, and acquiring the basic data item that contains the scenario information corresponding to the information similarity level as a data item to be recommended; calculating an event similarity level between a scenario event in each of the target data items and a scenario event in each data item to be recommended; and acquiring a data item to be recommended corresponding to an event similarity level having the largest numerical value, and recommending a risk control rule in the data item to be recommended to a professional company corresponding to the target data items. The invention achieves, at a low cost, timely recommendation of a risk control rule effectively preventing risks to a professional company having no configured risk control rules, thereby improving risk prevention of the professional company.

Description

风控数据处理方法、装置、计算机设备及存储介质Risk control data processing method, device, computer equipment and storage medium
本申请以2018年06月08日提交的申请号为201810585015.7,名称为“风控数据处理方法、装置、计算机设备及存储介质”的中国发明专利申请为基础,并要求其优先权。This application is based on a Chinese invention patent application filed on June 08, 2018 with application number 201810585015.7 and entitled "Risk Control Data Processing Method, Apparatus, Computer Equipment, and Storage Medium", and claims its priority.
技术领域Technical field
本申请涉及计算机技术领域,尤其涉及一种风控数据处理方法、装置、计算机设备及存储介质。The present application relates to the field of computer technology, and in particular, to a method, device, computer equipment, and storage medium for risk control data processing.
背景技术Background technique
目前,在许多网络购物、网络交易等业务场景进行的过程中,容易出现各种漏洞、风险或经济犯罪现象,往往需要针对各类业务场景类型,配置适配的风控规则,对容易高频发生风险的业务场景的情况,其更需要配置风控规则来保护业务场景安全,预防风险,减少经济损失。At present, in many online shopping, online transaction and other business scenarios, various loopholes, risks, or economic crimes are prone to occur. It is often necessary to configure appropriate risk control rules for various types of business scenarios to prevent high frequencies. In the case of a risky business scenario, it is more necessary to configure risk control rules to protect the security of the business scenario, prevent risks, and reduce economic losses.
传统技术中,通常采用人工方式对每个专业公司的各类业务场景进行针对性分析,开发相应的风控规则,但是,这种方式开发周期长,投入成本高,并且不能及时提供有效预防风险的风控规则。In traditional technology, manual methods are usually used to analyze the various business scenarios of each professional company and develop corresponding risk control rules. However, this method has a long development cycle, high input costs, and cannot provide effective risk prevention in a timely manner. Risk control rules.
发明内容Summary of the Invention
基于此,有必要针对上述技术问题,提供一种可以低成本的将有效预防风险的风控规则及时推荐给没有配置风控规则的专业公司,提高专业公司的风险预防水平的风控数据处理方法、装置、计算机设备及存储介质。Based on this, it is necessary to provide a risk control data processing method that can timely recommend risk control rules that effectively prevent risks to professional companies that are not equipped with risk control rules and improve the risk prevention level of professional companies in response to the above technical problems. , Devices, computer equipment and storage media.
一种风控数据处理方法,包括:A risk control data processing method includes:
获取风控数据库中每个专业公司的风控数据,其中,所述风控数据包括场景信息,每个所述场景信息对应的场景事件,每个所述场景事件对应的包含风控规则的规则集合,以及每个所述风控规则对应的风控命中率;Obtain the wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each of the scene information, and rules containing wind control rules corresponding to each of the scene events A set, and a wind control hit rate corresponding to each of the wind control rules;
从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据;Selecting, from the risk control data, risk control data in which the rule set is empty as target data, and using the risk control data in which the rule set is not empty as basic data;
针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景 信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据;For each of the target data, calculate an information similarity between the scene information in the target data and the scene information in each of the basic data, and if the information similarity is greater than or equal to a preset similarity threshold, Then obtain the basic data where the scene information corresponding to the information similarity is located, and use the basic data as the data to be recommended;
计算每个所述目标数据中的场景事件与每个所述待推荐数据中的场景事件之间的事件相似度;Calculating an event similarity between a scene event in each of the target data and a scene event in each of the to-be-recommended data;
获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司。Acquire the to-be-recommended data corresponding to the event similarity with the largest value among the event similarities, and recommend the risk control rules in the to-be-recommended data to the professional company corresponding to the target data.
一种风控数据处理装置,包括:A wind control data processing device includes:
数据获取模块,用于获取风控数据库中每个专业公司的风控数据,其中,所述风控数据包括场景信息,每个所述场景信息对应的场景事件,每个所述场景事件对应的包含风控规则的规则集合,以及每个所述风控规则对应的风控命中率;A data acquisition module is configured to acquire wind control data of each professional company in the wind control database, where the wind control data includes scene information, a scene event corresponding to each of the scene information, and a corresponding event of each of the scene events. A rule set including a risk control rule, and a risk control hit rate corresponding to each of the risk control rules;
数据筛选模块,用于从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据;A data screening module, configured to select, from the wind control data, wind control data in which the rule set is empty as target data, and use the wind control data in which the rule set is not empty as basic data;
信息计算模块,用于针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据;An information calculation module is configured to calculate, for each of the target data, information similarity between scene information in the target data and scene information in each of the basic data, and if the information similarity is greater than or equal to If the similarity threshold is set, the basic data where the scene information corresponding to the information similarity is located is used as the data to be recommended;
事件计算模块,用于计算每个所述目标数据中的场景事件与每个所述待推荐数据中的场景事件之间的事件相似度;An event calculation module, configured to calculate an event similarity between a scene event in each of the target data and a scene event in each of the to-be-recommended data;
规则推荐模块,用于获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司。The rule recommendation module is configured to obtain the data to be recommended corresponding to the event similarity with the largest value among the event similarities, and recommend the risk control rule in the data to be recommended to the professional company corresponding to the target data.
一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现上述风控数据处理方法的步骤。A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, and the processor implements the above-mentioned wind control data processing when the processor executes the computer-readable instructions Method steps.
一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行时实现上述风控数据处理方法的步骤。One or more non-volatile readable storage media storing computer-readable instructions, which, when executed by one or more processors, cause the one or more processors to implement the above-mentioned Control the steps of the data processing method.
本申请的一个或多个实施例的细节在下面的附图和描述中提出,本申请的其他特征和优点将从说明书、附图以及权利要求变得明显。Details of one or more embodiments of the present application are set forth in the accompanying drawings and description below, and other features and advantages of the present application will become apparent from the description, the drawings, and the claims.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要 使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the drawings used in the description of the embodiments of the application will be briefly introduced below. Obviously, the drawings in the following description are just some embodiments of the application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without paying creative labor.
图1是本申请一实施例中风控数据处理方法的一应用环境示意图;FIG. 1 is a schematic diagram of an application environment of a wind control data processing method according to an embodiment of the present application;
图2是本申请一实施例中风控数据处理方法的一流程图;2 is a flowchart of a wind control data processing method according to an embodiment of the present application;
图3是本申请一实施例中风控数据处理方法中对风控数据进行获取的一流程图;3 is a flowchart of acquiring wind control data in a wind control data processing method according to an embodiment of the present application;
图4是本申请一实施例中风控数据处理方法中步骤S5的一实现流程图;4 is an implementation flowchart of step S5 in the wind control data processing method according to an embodiment of the present application;
图5是本申请一实施例中风控数据处理方法中对风控数据库进行更新的一流程图;5 is a flowchart of updating a wind control database in a wind control data processing method according to an embodiment of the present application;
图6是本申请一实施例中风控数据处理装置的一示意图;6 is a schematic diagram of a wind control data processing device according to an embodiment of the present application;
图7是本申请一实施例中计算机设备的一示意图。FIG. 7 is a schematic diagram of a computer device in an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In the following, the technical solutions in the embodiments of the present application will be clearly and completely described with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of this application.
图1示出了本申请实施例提供的应用环境,该应用环境包括服务端和客户端,其中,服务端和客户端之间通过网络进行连接,客户端用于接收专业公司发送的推荐请求,并且将接收到的专业公司的推荐请求发送到服务端,客户端具体可以但不限于各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备;服务端用于对风控数据进行处理,服务端具体可以用独立的服务器或者多个服务器组成的服务器集群实现。本申请实施例提供的风控数据处理方法应用于服务端。FIG. 1 shows an application environment provided by an embodiment of the present application. The application environment includes a server and a client. The server and the client are connected through a network, and the client is used to receive a recommendation request sent by a professional company. And send the received recommendation request from the professional company to the server. The client can specifically but not limited to various personal computers, laptops, smartphones, tablets and portable wearable devices; the server is used to carry out risk control data. For processing, the server can be implemented by an independent server or a server cluster composed of multiple servers. The risk control data processing method provided in the embodiment of the present application is applied to a server.
请参阅图2,图2示出本实施例提供的风控数据处理方法的实现流程。详述如下:Please refer to FIG. 2, which illustrates an implementation process of a risk control data processing method provided by this embodiment. Details are as follows:
S1:获取风控数据库中每个专业公司的风控数据,其中,风控数据包括场景信息,每个场景信息对应的场景事件,每个场景事件对应的包含风控规则的规则集合,以及每个风控规则对应的风控命中率。S1: Obtain the wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each scene information, a rule set containing wind control rules corresponding to each scene event, and each The risk control hit rate corresponding to each risk control rule.
在本实施例中,获取风控数据的方式具体可以是实时从风控数据库中获取,也可以是每隔预定的时间从风控数据库中获取,具体可以根据实际应用的需要进行设置,此处不做限制。In this embodiment, the method for obtaining the wind control data may be specifically obtained from the wind control database in real time, or may be obtained from the wind control database at predetermined intervals, and may be specifically set according to actual application requirements. Here, No restrictions.
具体地,风控数据包括场景信息、场景事件、规则集合和风控命中率,其中,场景信息与场景事件的对应关系可以是一一对应的关系,也可以是一对多的关系;场景事件与规 则集合是一一对应的关系;规则集合可以为空,也可以包含一个或多个风控规则;每个风控规则与风控命中率是一一对应的关系,风控命中率是每个场景事件命中其对应的风控规则的次数占该场景事件发生的总次数的百分比,用于表示每个场景事件的命中程度。Specifically, the wind control data includes scene information, scene events, rule sets, and wind control hit ratios. The correspondence between the scene information and the scene events can be a one-to-one correspondence or a one-to-many relationship. The rule set is a one-to-one relationship; the rule set can be empty or it can contain one or more risk control rules; each risk control rule has a one-to-one correspondence with the risk control hit rate, and the risk control hit rate is The number of times a scene event hits its corresponding risk control rule as a percentage of the total number of occurrences of the scene event is used to indicate the hit degree of each scene event.
例如,专业公司A的风控数据中,有一个场景信息“登录”,该场景信息“登录”对应有一个场景事件“用户IP的登录交易界面”,该场景事件对应的规则集合中有一个风控规则“一个用户IP,在同一时间可以连续登录的次数,小于等于预设次数,否则限制该用户IP登录”。For example, in the risk control data of professional company A, there is scene information "login", and the scene information "login" corresponds to a scene event "login transaction interface of user IP", and a rule set corresponding to this scene event has a wind The control rule is "the number of consecutive logins of a user IP at the same time, less than or equal to the preset number, otherwise the user IP login is restricted."
S2:从风控数据中筛选出规则集合为空的风控数据,作为目标数据,并将规则集合不为空的风控数据作为基础数据。S2: The risk control data with the empty rule set is selected from the risk control data as the target data, and the wind control data with the non-empty rule set is used as the basic data.
在本实施例中,目标数据是没有配置风控规则的风控数据,基础数据是已经配置有风控规则的风控数据。In this embodiment, the target data is wind control data without a wind control rule configured, and the basic data is wind control data with a wind control rule configured.
具体地,遍历步骤S1中获取的风控数据中的规则集合,若检索到规则集合为空,则将该规则集合所在的风控数据标记为目标数据,若检索到规则集合不为空,则将该规则集合所在的风控数据标记为基础数据。Specifically, the rule set in the wind control data obtained in step S1 is traversed. If the rule set is retrieved as empty, the risk control data in which the rule set is located is marked as the target data. If the rule set retrieved is not empty, then The risk control data where the rule set is located is marked as the basic data.
从获取到的大量风控数据中,明确区分出没有配置风控规则的目标数据,和已经配置了风控规则基础数据,有利于后续在基础数据中,寻找目标数据适配的风控规则。From the obtained large amount of risk control data, it is clearly distinguished between target data that has not been configured with risk control rules, and basic data that has been configured with risk control rules, which is conducive to the subsequent search of target data to adapt to the risk control rules in the basic data.
S3:针对每个目标数据,计算该目标数据中的场景信息与每个基础数据中的场景信息之间的信息相似度,若信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据。S3: For each target data, calculate the information similarity between the scene information in the target data and the scene information in each basic data. If the information similarity is greater than or equal to a preset similarity threshold, obtain the information The basic data where the scene information corresponding to the similarity is located, and the basic data is used as the data to be recommended.
在本实施例中,信息相似度具体可以通过计算基础数据中的场景信息中的单词在目标数据中的场景信息中出现的次数与目标数据中的场景信息的单词总个数之间的比值得到,计算信息相似度的过程详述如下:In this embodiment, the information similarity can be specifically obtained by calculating a ratio between the number of times the words in the scene information in the base data appear in the scene information in the target data and the total number of words in the scene information in the target data. The process of calculating information similarity is detailed as follows:
分别对目标数据中的场景信息和每个基础数据中的场景信息进行分词处理,得到目标场景信息词汇集合和每个基础场景信息词汇集合,若基础场景信息词汇集合中的单词在目标场景信息词汇集合中出现一次,则标记为“1”,出现两次则标记为“2”,同理,出现N次则标记为“N”,没有出现,则标记为“0”,最终获取每个基础场景信息词汇集合中的单词在目标场景信息词汇集合中出现的总次数,并计算每个总次数与目标场景信息词汇集合中单词总个数的比值。示例如下:Perform segmentation processing on the scene information in the target data and the scene information in each basic data to obtain the target scene information vocabulary set and each basic scene information vocabulary set. If the words in the basic scene information vocabulary set are in the target scene information vocabulary If it appears once in the set, it will be marked as "1", if it appears twice, it will be marked as "2". Similarly, if it appears N times, it will be marked as "N". If it does not appear, it will be marked as "0". The total number of times a word in the scene information vocabulary appears in the target scene information vocabulary set, and the ratio of each total number of times to the total number of words in the target scene information vocabulary set is calculated. Examples are as follows:
假设一个基础数据的场景信息为“折扣优惠活动”,对这个场景信息进行分词得到的基础场景信息词汇集合为unit1:折扣/优惠/活动,一个目标数据中的场景信息为“促销优 惠大酬宾活动”,对目标数据中的场景信息分词后的目标场景信息词汇集合为unit2:促销/优惠/大酬宾/活动,则unit1中的单词在unit2中出现的总次数为2,目标场景信息词汇集合中单词总个数为4,该总次数与目标场景信息词汇集合中单词总个数的比值为50%,即该场景信息与目标数据的场景信息之间的信息相似度为50%。Assume that the scene information of a basic data is "discount offer", and the vocabulary set of the basic scene information obtained by segmenting this scene information is unit1: discount / offer / activity, and the scene information in a target data is "promotional big reward event" ", The target scene information vocabulary set after segmenting the scene information in the target data is unit2: promotion / discount / big reward / activity, then the total number of times the word in unit1 appears in unit2 is 2, and the target scene information vocabulary set The total number of words is 4, and the ratio of the total number of times to the total number of words in the target scene information vocabulary set is 50%, that is, the information similarity between the scene information and the scene information of the target data is 50%.
具体地,若计算得到的信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据。Specifically, if the calculated information similarity is greater than or equal to a preset similarity threshold, the basic data where the scene information corresponding to the information similarity is located is used as the data to be recommended.
例如,假设预设的相似度阈值为50%,沿用上述示例,基础数据的场景信息为“折扣优惠活动”与目标数据的场景信息为“促销优惠大酬宾活动”的信息相似度为50%,该信息相似度等于预设的相似度阈值,则获取该场景信息所在的基础数据,并将该基础数据作为待推荐数据。For example, suppose the preset similarity threshold is 50%. Following the above example, the similarity between the information of the basic data's scene information is "discount offers" and the target data's scene information is "promotional offers and big reward events". If the similarity of the information is equal to a preset similarity threshold, the basic data where the scene information is located is obtained, and the basic data is used as the data to be recommended.
需要说明的是,分词处理可以采用具有分词功能的工具,此处不做限制。预设的相似度阈值具体可以根据实际业务需求进行设置,此处不做限制。计算信息相似度还可以采用杰卡德相似度公式进行计算,或是其他相似度计算方式,此处不做限制。It should be noted that the word segmentation processing can use a tool with a word segmentation function, which is not limited here. The preset similarity threshold may be specifically set according to actual service requirements, and is not limited here. Calculating information similarity can also be calculated using Jaccard's similarity formula, or other similarity calculation methods, which are not limited here.
S4:计算每个目标数据中的场景事件与每个待推荐数据中的场景事件之间的事件相似度。S4: Calculate the event similarity between the scene event in each target data and the scene event in each to-be-recommended data.
在本实施例中,计算事件相似度的过程详述如下:In this embodiment, the process of calculating the event similarity is detailed as follows:
对目标数据中的场景事件和每个基础数据中的场景事件进行分词处理,得到目标场景事件词汇集合和每个基础场景事件词汇集合;Perform word segmentation on the scene events in the target data and the scene events in each basic data to obtain the target scene event vocabulary set and each basic scene event vocabulary set;
将目标场景事件词汇集合分别和每个基础场景事件词汇集合,进行去重合并,得到目标数据中的场景事件和每个基础场景事件合并后的总事件词汇集合,例如,假设目标数据A包含1个场景事件,根据步骤S3得到的目标数据A对应的待推荐数据的场景事件共有10个,则可得到10个总事件词汇集合;The target scene event vocabulary set is deduplicated and merged with each basic scene event vocabulary set to obtain the scene event in the target data and the total event vocabulary set after each basic scene event is merged. For example, suppose the target data A contains 1 10 scene events, according to the target data A obtained in step S3, there are a total of 10 scene events to be recommended, and then 10 total event vocabulary sets can be obtained;
计算目标场景事件词汇集合中的每个单词,在每个总事件词汇集合中的词频,得到目标场景事件词汇集合在每个总事件词汇集合中的词频向量,以及计算每个基础场景事件词汇集合中的每个单词,在其对应的总事件词汇集合中的词频,得到每个基础场景事件词汇集合各自的词频向量;Calculate each word in the target scene event vocabulary set, the word frequency in each total event vocabulary set, get the word frequency vector of the target scene event vocabulary set in each total event vocabulary set, and calculate the basic scene event vocabulary set For each word in the set, the word frequency in its corresponding total event vocabulary set is used to obtain the word frequency vector of each basic scene event vocabulary set;
计算目标场景事件词汇集合的词频向量和每个基础场景事件词汇集合的词频向量之间的余弦相似度,余弦相似度的值越大则表示该值对应的场景事件与目标数据中的场景事件越相似,该余弦相似度即事件相似度。Calculate the cosine similarity between the word frequency vector of the target scene event vocabulary set and the word frequency vector of each base scene event vocabulary set. The larger the value of the cosine similarity, the more the scene event corresponding to the value and the scene event in the target data are. Similar, the cosine similarity is the event similarity.
为了更好地理解本步骤,现举例说明如下:To better understand this step, here is an example:
假设一个基础场景事件为“用户IP的登录交易界面”,对这个基础场景事件进行分词得到的基础场景事件词汇集合为unit3:用户/IP/的/登录/交易/界面,一个目标数据中的场景事件为“用户使用账号密码登入交易界面”,对目标数据中的场景事件分词后的目标场景事件词汇集合为unit4:用户/使用/账号/密码/登入/交易/界面,将unit3和unit4进行去重合并,得到目标场景事件词汇集合和基础场景事件词汇集合,合并后的总事件词汇集合unit5:用户/IP/的/登录/交易/界面/使用/账号/密码/登入,计算unit3中的单词在unit5的词频:用户1,IP1,的1,登录1,交易1,界面1,使用0,账号0,密码0,登入0,计算unit4中的单词在unit5的词频:用户1,IP0,的0,登录0,交易1,界面1,使用1,账号1,密码1,登入1,可以得到基础场景事件词汇集合的词频向量为(1,1,1,1,1,1,0,0,0,0),目标场景事件词汇集合的词频向量为(1,0,0,0,1,1,1,1,1,1),根据余弦相似度公式,计算目标场景事件词汇集合的词频向量和基础场景事件词汇集合的词频向量间的余弦相似度,得到的余弦值为0.463,即该场景事件与该目标数据中的场景事件之间的事件相似度为0.463。Assume that a basic scenario event is "login transaction interface of user IP", and the basic scenario event vocabulary set obtained by segmenting this basic scenario event is unit3: user / IP // login / transaction / interface, a scenario in the target data The event is "users use the account and password to log in to the trading interface", and the target event event vocabulary set after segmenting the scene events in the target data is unit4: user / use / account / password / login / transaction / interface, go to unit3 and unit4 Repeat the merge to get the target scene event vocabulary set and the basic scene event vocabulary set, and the combined total event vocabulary set unit5: user / IP // login / transaction / interface / use / account / password / login, calculate the words in unit3 Word frequency in unit5: User 1, IP1, 1, Login 1, Transaction 1, Interface 1, Use 0, Account 0, Password 0, Login 0, Calculate the word frequency of words in unit4 in unit5: User 1, IP0, 0, login 0, transaction 1, interface 1, use 1, account 1, password 1, login 1, the word frequency vector of the basic scene event vocabulary set is (1,1,1,1,1,1,0, 0,0,0), the word frequency vector of the target scene event vocabulary set is (1,0,0,0,1,1,1,1,1,1), and the target scene event vocabulary set is calculated according to the cosine similarity formula The cosine similarity between the word frequency vector of the vector and the word frequency vector of the base scene event vocabulary set, and the cosine value obtained is 0.463, that is, the event similarity between the scene event and the scene event in the target data is 0.463.
需要说明的是,余弦相似度公式为
Figure PCTCN2018092327-appb-000001
其中,x i表示基础场景事件词汇集合在其对应的总事件词汇集合中的词频向量的分量,y i表示目标场景事件词汇集合在每个总事件词汇集合中的词频向量的分量,p表示余弦相似度的值,n表示总事件词汇集合中的单词的总个数,计算事件相似度还可以采用simhash海量相似度算法进行计算,或是其他相似度计算方式,此处不做限制。
It should be noted that the cosine similarity formula is
Figure PCTCN2018092327-appb-000001
Among them, x i represents the component of the word frequency vector of the basic scene event vocabulary set in its corresponding total event vocabulary set, y i represents the component of the word frequency vector of the target scene event vocabulary set in each total event vocabulary set, and p represents the cosine Similarity value, n represents the total number of words in the total event vocabulary set. To calculate the event similarity, you can also use the simhash mass similarity algorithm to calculate, or other similarity calculation methods, which are not limited here.
S5:获取事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给目标数据对应的专业公司。S5: Obtain the data to be recommended corresponding to the event similarity with the largest value in the event similarity, and recommend the risk control rules in the data to be recommended to the professional company corresponding to the target data.
具体地,从步骤S4中计算得到目标数据中的场景事件与每个待推荐数据的场景事件之间的事件相似度中,选择数值最大的事件相似度,获取该事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给目标数据对应的专业公司。Specifically, from the event similarity between the scene event in the target data and the scene event of each to-be-recommended data calculated in step S4, the event similarity with the largest value is selected to obtain the to-be-recommended data corresponding to the event similarity And recommend the risk control rules in the data to be recommended to the professional company corresponding to the target data.
需要说明的是,推荐的方式具体可以是邮件或即时消息,也可以是其他方式,具体可以根据实际应用的需要进行设置,此处不做限制。It should be noted that the recommended method may specifically be email or instant messaging, or other methods, and the specific method may be set according to actual application requirements, and is not limited here.
本实施例中,通过获取风控数据库中每个专业公司的风控数据,筛选出规则集合为空的风控数据,确定为没有配置风控规则的目标数据,并将规则集合不为空的风控数据作为基础数据,从而有利于后续有针对性的为该目标数据寻找适配的风控规则,同时,针对每 个目标数据,计算每个目标数据中的场景信息与每个基础数据中的场景信息之间的信息相似度,若信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据,通过待推荐数据能够找到目标数据适配的风控规则的范围和方向,有利于后续从待推荐数据中筛选出最优的推荐结果,且计算相似度的方法,简单快捷,易于操作,进一步地,计算每个目标数据中的场景事件与每个待推荐数据中的场景事件之间的事件相似度,获取数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给目标数据对应的专业公司,针对目标数据中场景事件,找到适用于该场景事件的风控规则,低成本地实现为没有配置风控规则的目标数据对应的专业公司及时推荐适配的风控规则,有利于提高专业公司的风险预防水平。In this embodiment, by acquiring the wind control data of each professional company in the wind control database, the wind control data with an empty rule set is selected, and the target data without the risk control rule is determined, and the rule set is not empty. The risk control data is used as the basic data, which is beneficial to the subsequent targeted search for an adapted risk control rule for the target data. At the same time, for each target data, the scene information in each target data and each basic data are calculated. The information similarity between the scene information of the. If the information similarity is greater than or equal to a preset similarity threshold, the basic data where the scene information corresponding to the information similarity is located is used as the data to be recommended. The data to be recommended can find the scope and direction of the risk control rules adapted to the target data, which is conducive to subsequent selection of the best recommendation results from the data to be recommended. The method of calculating similarity is simple, fast, and easy to operate. Further, Calculate the event similarity between the scene events in each target data and the scene events in each to-be-recommended data to obtain the number The data to be recommended corresponding to the event similarity with the largest value, and the risk control rules in the data to be recommended are recommended to the professional company corresponding to the target data. For the scene events in the target data, find the risk control rules applicable to the scene events It is a low-cost implementation for professional companies corresponding to target data that has not been equipped with risk control rules to recommend adapted risk control rules in a timely manner, which is conducive to improving the risk prevention level of professional companies.
在一实施例中,在步骤S2之后,并且在步骤S3之前,该风控数据处理方法还包括如下步骤:In an embodiment, after step S2 and before step S3, the risk control data processing method further includes the following steps:
S6:若接收到专业公司的风控推荐请求,则根据风控推荐请求更新目标数据。S6: If a risk control recommendation request from a professional company is received, the target data is updated according to the risk control recommendation request.
具体地,接收风控数据未被收录在风控数据库中的专业公司的风控推荐请求,获取该专业公司的风控数据,并将该风控数据作为更新后的目标数据,保存到风控数据库中。Specifically, it receives a risk control recommendation request from a professional company whose risk control data is not included in the risk control database, obtains the risk control data of the professional company, and saves the risk control data as updated target data to the risk control. Database.
在本实施例中,接收到风控数据未被收录在风控数据库中的专业公司的风控推荐请求,根据该风控推荐请求,可以在风控数据库中,为该专业公司查找适配的风控规则,并将找到的适配的风控规则推荐给该专业公司,体现出实用性和广泛适用性。In this embodiment, a risk control recommendation request is received from a professional company whose risk control data is not included in the risk control database. According to the risk control recommendation request, an adapted company can be found in the risk control database for the professional company. Risk control rules, and recommend the adapted risk control rules to the professional company, reflecting practicality and broad applicability.
在一实施例中,如图3所示,在步骤S1之前,该风控数据处理方法还包括如下步骤:In an embodiment, as shown in FIG. 3, before step S1, the method for processing wind control data further includes the following steps:
S7:监控每个专业公司的每个场景事件,若检测到场景事件被触发并且命中该场景事件对应的规则集合中的风控规则,则记录该场景事件的命中信息,其中,命中信息包括该场景事件的发生时间,以及该场景事件命中的风控规则。S7: Monitor each scene event of each professional company. If it is detected that the scene event is triggered and the risk control rule in the rule set corresponding to the scene event is detected, then the hit information of the scene event is recorded, where the hit information includes the The time when a scene event occurs and the risk control rules that the scene event hits.
在本实施例中,场景事件命中该场景事件对应的规则集合中的风控规则,是指该场景事件中发生的情形违背了配置的风控规则中的约束条件,或超过了风控规则中的限制条件。In this embodiment, a scenario event hits a risk control rule in a rule set corresponding to the scenario event, which means that a situation occurring in the scenario event violates a constraint condition in the configured risk control rule, or exceeds a constraint condition in the risk control rule. Restrictions.
例如,假设有一场景事件为“用户可在领券折扣活动中,领取商家优惠券”,配置的风控规则为“一个用户IP只能领取一次优惠券,否则,限制该IP的使用”,场景事件具体发生过程为用户在商家软件中登录,点击领取优惠券按钮,若检测到同一用户IP在同一时间内,点击领取了100次优惠券,则可能是有用户通过欺诈软件对商家进行优惠券的不合理占有,确认与该场景事件配置的风控规则相违背,并将限制该用户IP在商家软件中的使用,即场景事件命中该场景事件对应的规则集合中的风控规则。For example, suppose a scenario event is "Users can receive merchant coupons during coupon discount activities" and the configured risk control rule is "A user IP can only receive coupons once, otherwise, the use of that IP is restricted". The specific process of the event is that the user logs in the merchant software and clicks the button to receive the coupon. If it is detected that the same user IP has clicked to receive the coupon 100 times at the same time, it may be that the user used the fraud software to conduct a coupon to the merchant. Unreasonable possession, confirm that it violates the risk control rules configured for this scenario event, and will limit the use of the user IP in the merchant software, that is, the scenario event hits the risk control rule in the rule set corresponding to the scenario event.
具体地,监控每个专业公司的每个场景事件,将检测到的场景事件被触发时的发生时间,以及场景事件在触发时,是否命中该场景事件对应的规则集合中的风控规则,记录在风控数据库的风控日志中,可以直观反映每个场景事件的命中情况,有利于后续对风控数据的分析和计算。Specifically, monitor each scene event of each professional company, record the occurrence time of the detected scene event when it is triggered, and when the scene event is triggered, whether to hit the risk control rule in the rule set corresponding to the scene event, and record The wind control log in the wind control database can directly reflect the hit situation of each scene event, which is beneficial to the subsequent analysis and calculation of wind control data.
S8:每隔预定的时间间隔,根据命中信息,统计每个场景事件在预设时间段的数据量,其中,数据量包括场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数。S8: Count the data amount of each scene event in a preset time period according to the hit information every predetermined time interval, where the data amount includes the number of scene event occurrences and each risk control in the rule set corresponding to the scene event The number of times the rule was hit.
具体地,每隔预定的时间间隔,根据步骤S7中记录在风控日志中的命中信息,统计每个场景事件在预设时间段内的该场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数。Specifically, every predetermined time interval, according to the hit information recorded in the wind control log in step S7, the number of occurrences of each scene event within a preset period of time and a rule set corresponding to the scene event are counted. The number of times each risk control rule was hit.
例如,假设预设时间段为“1月份的1号至15号”,继续使用步骤S7中的示例,如统计得到“1月份的1号至15号”中,场景事件“用户可在领券折扣活动中,领取商家优惠券”发生的次数为500次,命中其对应的风控规则的次数为5次。For example, assuming that the preset time period is "1st to 15th in January", continue to use the example in step S7. For example, if the statistics are obtained from "1st to 15th in January", the scenario event "Users can pick up coupons" In the discount campaign, the number of times of "Get Merchant Coupon" occurred 500 times, and the number of hits to the corresponding risk control rules was 5 times.
需要说明的是,预定的时间间隔和预设时间段,具体可以根据实际业务需求进行设置,此处不做限制。It should be noted that the predetermined time interval and the preset time period may be specifically set according to actual service requirements, and are not limited here.
S9:根据数据量,计算每个场景事件的风控命中率。S9: Calculate the wind control hit rate of each scene event according to the amount of data.
在本实施例中,每个场景事件的风控命中率是每个场景事件在预设时间段内的该场景事件对应的规则集合中每个风控规则被命中的次数和该场景事件发生的次数的比值,命中率越高,则表示该场景事件越容易发生风险。In this embodiment, the wind control hit rate of each scene event is the number of times each wind control rule is hit in the rule set corresponding to the scene event within a preset time period of each scene event and the occurrence of the scene event. The ratio of the number of times, the higher the hit rate, it means that the scene event is more prone to risk.
继续使用步骤S8中的示例,如场景事件“用户可在领券折扣活动中,领取商家优惠券”在“1月份的1号至15号”的时间段内的风控命中率为1%。Continue to use the example in step S8, for example, the scenario event "users can collect merchant coupons during coupon collection discounts" has a risk control hit rate of 1% during the period of "1st to 15th of January".
在本实施例中,监控每个专业公司的每个场景事件,将检测到的场景事件被触发时的发生时间,以及场景事件触发时,是否命中该场景事件对应的规则集合中的风控规则,记录风控数据库的风控日志中,可以直观反映每个场景事件的命中情况,有利于后续对风控数据的分析和计算,从而通过统计每个场景事件在预设时间段内的该场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数,可以计算出每个场景事件的风控命中率,可以充分体现每个场景事件的易发生风险的程度。In this embodiment, each scene event of each professional company is monitored, the time when the detected scene event is triggered, and when the scene event is triggered, whether the risk control rule in the rule set corresponding to the scene event is hit. The wind control log recorded in the wind control database can intuitively reflect the hit situation of each scene event, which is conducive to subsequent analysis and calculation of wind control data, so as to count the scene of each scene event within a preset time period. The number of event occurrences and the number of times each risk control rule in the rule set corresponding to the scenario event is calculated can calculate the risk control hit rate of each scenario event, which can fully reflect the degree of risk of each scenario event.
在一实施例中,如图4所示,步骤S5中,即获取事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给目标数据对应的专业公司具体包括如下步骤:In an embodiment, as shown in FIG. 4, in step S5, the data to be recommended corresponding to the event similarity with the largest value in the event similarity is obtained, and the risk control rule in the data to be recommended is recommended to the target data. The corresponding professional company includes the following steps:
S501:获取事件相似度中数值最大的事件相似度对应的待推荐数据,作为标准数据。S501: Acquire to-be-recommended data corresponding to the event similarity with the largest numerical value in the event similarity as standard data.
具体地,从步骤S4中得到的每个目标数据的场景事件和每个待推荐数据的场景事件之间的事件相似度中,选取数值最大的事件相似度,并将该事件相似度对应的风控数据作为待推荐给专业公司的标准数据。Specifically, from the event similarity between the scene event of each target data and the scene event of each to-be-recommended data obtained in step S4, the event similarity with the largest numerical value is selected, and the wind corresponding to the event similarity corresponds to the wind. Control data is used as standard data to be recommended to professional companies.
S502:从标准数据中,筛选出数值最大的风控命中率,并将该风控命中率对应的风控规则,推荐给目标数据对应的专业公司。S502: From the standard data, filter out the wind control hit ratio with the largest value, and recommend the wind control rule corresponding to the wind control hit ratio to the professional company corresponding to the target data.
具体地,根据步骤S9中经统计和计算到的每个场景事件的风控命中率,在标准数据中,筛选出数值最大的风控命中率,并将该风控命中率对应的风控规则,作为最适配目标数据的风控规则,推荐给目标数据对应的专业公司。Specifically, according to the wind control hit rate of each scene event calculated and calculated in step S9, in the standard data, the wind control hit rate with the largest value is selected, and the wind control rule corresponding to the wind control hit rate is selected. As a risk control rule most suitable for the target data, it is recommended to the professional company corresponding to the target data.
例如,在标准数据中,有一场景事件“在购物平台,进行购物交易”,该场景事件对应有风控规则一“每笔交易金额不能超过2万元,否则,停止交易”,且风控命中率为5%,有风控规则二“每天的交易次数不得超过20次,否则,停止交易”,且风控命中率为2%,则选取风控命中率为5%对应的风控规则一,推荐给目标数据对应的专业公司。For example, in the standard data, there is a scenario event "On a shopping platform, a shopping transaction is performed." This scenario event corresponds to the risk control rule-"The amount of each transaction cannot exceed 20,000 yuan, otherwise, the transaction is stopped." The rate is 5%. There is risk control rule two. “The number of transactions per day must not exceed 20, otherwise, the transaction is stopped.” If the risk control hit rate is 2%, then the risk control rule 1 corresponding to the risk control hit rate of 5% is selected. , Recommended to the professional company corresponding to the target data.
在本实施例中,通过选取数值最大的事件相似度,并将该事件相似度对应的风控数据作为待推荐给专业公司的标准数据,同时,从标准数据中,筛选出数值最大的风控命中率,并将该风控命中率对应的风控规则,作为最适配目标数据的风控规则,推荐给目标数据对应的专业公司,风控命中率可以直观反映出其对应的场景事件易发生风险的程度,有利于提高该目标数据的场景事件预防风险的水平。In this embodiment, by selecting the event similarity with the largest value and using the risk control data corresponding to the event similarity as the standard data to be recommended to a professional company, at the same time, the risk control with the largest value is selected from the standard data. Hit rate, and the risk control rule corresponding to the risk control hit rate is used as the risk control rule most suitable for the target data. It is recommended to the professional company corresponding to the target data. The risk control hit rate can intuitively reflect its corresponding scene events. The degree of risk occurrence is conducive to improving the level of risk prevention of scene events of the target data.
在一实施例中,如图5所示,在步骤S5之后,该风控数据处理方法还包括如下步骤:In an embodiment, as shown in FIG. 5, after step S5, the method for processing wind control data further includes the following steps:
S10:根据目标数据对应的专业公司反馈的确认信息,获取确认信息中该专业公司确认的目标风控规则。S10: Obtain the target risk control rule confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data.
具体地,接收到目标数据对应的专业公司反馈的确认信息,确认信息中包含了该专业公司确认使用的推荐的风控规则,其中,该被确认使用的风控规则,作为目标风控规则。Specifically, the confirmation information fed back by the professional company corresponding to the target data is received, and the confirmation information includes the recommended risk control rule confirmed for use by the professional company, and the confirmed risk control rule is used as the target risk control rule.
S11:在风控数据库中,将目标风控规则配置到目标数据对应的规则集合中。S11: In the risk control database, configure the target risk control rule into a rule set corresponding to the target data.
具体地,根据步骤S10中获取到的目标风控规则,更新风控数据库中的风控数据,即将目标风控规则配置到目标数据对应的规则集合中。Specifically, according to the target wind control rule obtained in step S10, the wind control data in the wind control database is updated, that is, the target wind control rule is configured into a rule set corresponding to the target data.
在本实施例中,根据目标数据对应的专业公司反馈的确认信息,获取确认信息中该专业公司确认的目标风控规则,并在风控数据库中,将目标风控规则配置到目标数据对应的规则集合中,对风控数据进行更新,一方面可以充分体现配置了风控规则的目标数据的命中情况,另一方面,可以利用风控数据,为与该目标数据有相似或相同场景,却没有配置 风控规则的其他专业公司,进行风控规则推荐,有利于提高各个专业公司预防风险的水平。In this embodiment, according to the confirmation information fed back by the professional company corresponding to the target data, the target risk control rule confirmed by the professional company in the confirmation information is obtained, and the target risk control rule is configured in the risk control database to the target data In the rule set, updating the wind control data can fully reflect the hit situation of the target data configured with the wind control rules. On the other hand, the wind control data can be used to have similar or same scenarios with the target data. For other professional companies that do not have risk control rules, recommending risk control rules will help improve the level of risk prevention of each professional company.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence numbers of the steps in the above embodiments does not mean the order of execution. The execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiments of this application.
在一实施例中,提供一种风控数据处理装置,该风控数据处理装置与上述实施例中风控数据处理方法一一对应。如图6所示,该风控数据处理装置包括数据获取模块601、数据筛选模块602、信息计算模块603、事件计算模块604和规则推荐模块605。各功能模块详细说明如下:In one embodiment, a wind control data processing device is provided. The wind control data processing device corresponds to the wind control data processing method in the above embodiment in a one-to-one correspondence. As shown in FIG. 6, the risk control data processing device includes a data acquisition module 601, a data screening module 602, an information calculation module 603, an event calculation module 604, and a rule recommendation module 605. The detailed description of each function module is as follows:
数据获取模块601,用于获取风控数据库中每个专业公司的风控数据,其中,风控数据包括场景信息,每个场景信息对应的场景事件,每个场景事件对应的包含风控规则的规则集合,以及每个风控规则对应的风控命中率;A data acquisition module 601 is used to acquire wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each scene information, and each scene event corresponding to a wind control rule. Rule set, and the risk control hit rate corresponding to each risk control rule;
数据筛选模块602,用于从风控数据中筛选出规则集合为空的风控数据,作为目标数据,并将规则集合不为空的风控数据作为基础数据;The data screening module 602 is configured to screen out wind control data from which the rule set is empty as target data, and use the wind control data whose rule set is not empty as basic data;
信息计算模块603,用于针对每个目标数据,计算该目标数据中的场景信息与每个基础数据中的场景信息之间的信息相似度,若信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据;An information calculation module 603 is configured to calculate, for each target data, the information similarity between the scene information in the target data and the scene information in each basic data, if the information similarity is greater than or equal to a preset similarity threshold , Obtain the basic data where the scene information corresponding to the information similarity is located, and use the basic data as the data to be recommended;
事件计算模块604,用于计算每个目标数据中的场景事件与每个待推荐数据中的场景事件之间的事件相似度;An event calculation module 604, configured to calculate an event similarity between a scene event in each target data and a scene event in each to-be-recommended data;
规则推荐模块605,用于获取事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给目标数据对应的专业公司。The rule recommendation module 605 is configured to obtain the data to be recommended corresponding to the event similarity with the largest value in the event similarity, and recommend the risk control rule in the data to be recommended to the professional company corresponding to the target data.
进一步地,该风控数据处理装置还包括:Further, the risk control data processing device further includes:
请求接收模块606,用于若接收到专业公司的风控推荐请求,则根据风控推荐请求更新目标数据。The request receiving module 606 is configured to update the target data according to the risk control recommendation request if a risk control recommendation request from a professional company is received.
进一步地,该风控数据处理装置还包括:Further, the risk control data processing device further includes:
命中记录模块607,用于监控每个专业公司的每个场景事件,若检测到场景事件被触发并且命中该场景事件对应的规则集合中的风控规则,则记录该场景事件的命中信息,其中,命中信息包括该场景事件的发生时间,以及该场景事件命中的风控规则;A hit recording module 607 is used to monitor each scene event of each professional company. If it is detected that a scene event is triggered and the risk control rule in the rule set corresponding to the scene event is detected, the hit information of the scene event is recorded, where The hit information includes the occurrence time of the scene event and the risk control rule of the scene event hit;
数据统计模块608,用于每隔预定的时间间隔,根据命中信息,统计每个场景事件在预设时间段的数据量,其中,数据量包括场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数;A data statistics module 608 is configured to count the data amount of each scene event in a preset time period according to the hit information at a predetermined time interval, where the data amount includes the number of scene event occurrences and a rule set corresponding to the scene event The number of times each risk control rule was hit;
命中计算模块609,用于根据数据量,计算每个场景事件的风控命中率。The hit calculation module 609 is configured to calculate a wind control hit rate of each scene event according to the amount of data.
进一步地,规则推荐模块605包括:Further, the rule recommendation module 605 includes:
数据获取单元6051,用于获取事件相似度中数值最大的事件相似度对应的待推荐数据,作为标准数据;A data obtaining unit 6051 is configured to obtain data to be recommended corresponding to the event similarity with the largest numerical value among the event similarities as standard data;
规则推荐单元6052,用于从标准数据中,筛选出数值最大的风控命中率,并将该风控命中率对应的风控规则,推荐给目标数据对应的专业公司。A rule recommending unit 6052 is used to filter out the wind control hit ratio with the largest value from the standard data, and recommend the wind control rule corresponding to the wind control hit ratio to the professional company corresponding to the target data.
进一步地,该风控数据处理装置还包括:Further, the risk control data processing device further includes:
信息反馈模块610,用于根据目标数据对应的专业公司反馈的确认信息,获取确认信息中该专业公司确认的目标风控规则;An information feedback module 610 is configured to obtain the target risk control rule confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data;
规则配置模块611,用于在风控数据库中,将目标风控规则配置到目标数据对应的规则集合中。A rule configuration module 611 is configured to configure a target risk control rule in a risk control database to a rule set corresponding to the target data.
关于风控数据处理装置的具体限定可以参见上文中对于风控数据处理方法的限定,在此不再赘述。上述风控数据处理装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific limitation of the wind control data processing device, refer to the foregoing limitation on the wind control data processing method, which will not be repeated here. Each module in the above-mentioned wind control data processing device may be implemented in whole or in part by software, hardware, and a combination thereof. The above-mentioned modules may be embedded in the hardware in or independent of the processor in the computer device, or may be stored in the memory of the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图7所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于保存风控数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种风控数据处理方法。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 7. The computer device includes a processor, a memory, a network interface, and a database connected through a system bus. The processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer-readable instructions, and a database. The internal memory provides an environment for the operation of the operating system and computer-readable instructions in a non-volatile storage medium. The database of the computer equipment is used to store risk control data. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer-readable instructions are executed by a processor to implement a method for processing wind control data.
在一个实施例中,提供了一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机可读指令,处理器执行计算机可读指令时实现上述实施例风控数据处理方法的步骤,例如图2所示的步骤S1至步骤S5。或者,处理器执行计算机可读指令时实现上述实施例中风控数据处理装置的各模块/单元的功能,例如图6所示模块601至模块605的功能。为避免重复,这里不再赘述。In one embodiment, a computer device is provided, including a memory, a processor, and computer-readable instructions stored on the memory and executable on the processor. When the processor executes the computer-readable instructions, the wind control of the foregoing embodiment is implemented. The steps of the data processing method are, for example, steps S1 to S5 shown in FIG. 2. Alternatively, when the processor executes the computer-readable instructions, the functions of the modules / units of the wind control data processing device in the foregoing embodiment are implemented, for example, the functions of modules 601 to 605 shown in FIG. 6. To avoid repetition, we will not repeat them here.
在一个实施例中,提供了一种非易失性存储介质,其上存储有计算机可读指令,计算机可读指令被处理器执行时实现上述方法实施例中风控数据处理方法,或者,该计算机可 读指令被处理器执行时实现上述装置实施例中风控数据处理装置中各模块/单元的功能。为避免重复,这里不再赘述。In one embodiment, a non-volatile storage medium is provided, on which computer-readable instructions are stored, and when the computer-readable instructions are executed by a processor, the wind control data processing method in the foregoing method embodiment is implemented, or the computer When the readable instructions are executed by the processor, the functions of each module / unit in the wind control data processing device in the above device embodiment are realized. To avoid repetition, we will not repeat them here.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)、DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the methods of the foregoing embodiments can be implemented by using computer-readable instructions to instruct related hardware. The computer-readable instructions can be stored in a non-volatile computer. In the readable storage medium, the computer-readable instructions, when executed, may include the processes of the embodiments of the methods described above. Wherein, any reference to the storage, storage, database, or other media used in the embodiments provided in this application may include non-volatile and / or volatile storage. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in various forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), dual data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain (Synchlink), DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。Those skilled in the art can clearly understand that, for the convenience and brevity of the description, only the above-mentioned division of functional units and modules is used as an example. In practical applications, the above functions can be assigned by different functional units, Module completion, that is, dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解,其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to describe the technical solution of the present application, but are not limited thereto. Although the present application is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still implement the foregoing implementations. The technical solutions described in the examples are modified, or some of the technical features are equivalently replaced; and these modifications or replacements do not deviate the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the embodiments of the application, and should be included in Within the scope of this application.

Claims (20)

  1. 一种风控数据处理方法,其特征在于,所述风控数据处理方法包括:A wind control data processing method, characterized in that the wind control data processing method includes:
    获取风控数据库中每个专业公司的风控数据,其中,所述风控数据包括场景信息,每个所述场景信息对应的场景事件,每个所述场景事件对应的包含风控规则的规则集合,以及每个所述风控规则对应的风控命中率;Obtain the wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each of the scene information, and rules containing wind control rules corresponding to each of the scene events A set, and a wind control hit rate corresponding to each of the wind control rules;
    从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据;Selecting, from the risk control data, risk control data in which the rule set is empty as target data, and using the risk control data in which the rule set is not empty as basic data;
    针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据;For each of the target data, calculate an information similarity between the scene information in the target data and the scene information in each of the basic data, and if the information similarity is greater than or equal to a preset similarity threshold, Then obtain the basic data where the scene information corresponding to the information similarity is located, and use the basic data as the data to be recommended;
    计算每个所述目标数据中的场景事件与每个所述待推荐数据中的场景事件之间的事件相似度;Calculating an event similarity between a scene event in each of the target data and a scene event in each of the to-be-recommended data;
    获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司。Acquire the to-be-recommended data corresponding to the event similarity with the largest value among the event similarities, and recommend the risk control rules in the to-be-recommended data to the professional company corresponding to the target data.
  2. 如权利要求1所述的风控数据处理方法,其特征在于,在从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据之后,并且在所述针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据之前,所述风控数据处理方法还包括:The method for processing risk control data according to claim 1, wherein the risk control data in which the rule set is empty is selected from the risk control data as target data, and the rule set is not After the empty wind control data is used as the basic data, and for each of the target data, the information similarity between the scene information in the target data and the scene information in each of the basic data is calculated. Before the information similarity is greater than or equal to a preset similarity threshold, the basic data where the scene information corresponding to the information similarity is located and before the basic data is used as the data to be recommended, the method for processing wind control data further includes:
    若接收到专业公司的风控推荐请求,则根据所述风控推荐请求更新所述目标数据。If a risk control recommendation request from a professional company is received, the target data is updated according to the risk control recommendation request.
  3. 如权利要求1或2所述的风控数据处理方法,其特征在于,在所述获取风控数据库中每个专业公司的风控数据之前,所述风控数据处理方法还包括:The wind control data processing method according to claim 1 or 2, wherein before the acquiring wind control data of each professional company in the wind control database, the wind control data processing method further comprises:
    监控每个所述专业公司的每个所述场景事件,若检测到所述场景事件被触发并且命中该场景事件对应的规则集合中的风控规则,则记录该场景事件的命中信息,其中,所述命中信息包括该场景事件的发生时间,以及该场景事件命中的风控规则;Monitoring each of the scene events of each of the professional companies, and if it is detected that the scene event is triggered and hits a wind control rule in a rule set corresponding to the scene event, then the hit information of the scene event is recorded, wherein, The hit information includes the occurrence time of the scene event and the risk control rule of the scene event hit;
    每隔预定的时间间隔,根据所述命中信息,统计每个所述场景事件在预设时间段的数据量,其中,所述数据量包括所述场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数;Every predetermined time interval, according to the hit information, the data amount of each scene event in a preset time period is counted, wherein the data amount includes the number of times the scene event occurs and a rule corresponding to the scene event The number of times each risk control rule in the set was hit;
    根据所述数据量,计算每个所述场景事件的风控命中率。According to the amount of data, a wind control hit rate of each of the scene events is calculated.
  4. 如权利要求1或2所述的风控数据处理方法,其特征在于,所述获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司包括:The method for processing risk control data according to claim 1 or 2, wherein the to-be-recommended data corresponding to the event similarity with the largest value among the event similarities is obtained, and the risk control in the to-be-recommended data is The rules recommended to the professional company corresponding to the target data include:
    获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,作为标准数据;Acquiring to-be-recommended data corresponding to the event similarity with the largest value among the event similarities as standard data;
    从所述标准数据中,筛选出数值最大的所述风控命中率,并将该风控命中率对应的所述风控规则,推荐给所述目标数据对应的专业公司。From the standard data, the wind control hit ratio with the largest value is selected, and the wind control rule corresponding to the wind control hit ratio is recommended to the professional company corresponding to the target data.
  5. 如权利要求1或2所述的风控数据处理方法,其特征在于,在所述获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司之后,所述风控数据处理方法还包括:The method for processing risk control data according to claim 1 or 2, characterized in that: the to-be-recommended data corresponding to the event similarity with the largest value among the event similarities is obtained, and the risk in the to-be-recommended data is obtained. After the control rules are recommended to the professional company corresponding to the target data, the method for processing risk control data further includes:
    根据所述目标数据对应的专业公司反馈的确认信息,获取所述确认信息中该专业公司确认的目标风控规则;Obtaining the target risk control rule confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data;
    在所述风控数据库中,将所述目标风控规则配置到所述目标数据对应的规则集合中。In the risk control database, the target risk control rule is configured into a rule set corresponding to the target data.
  6. 一种风控数据处理装置,其特征在于,所述风控数据处理装置包括:A wind control data processing device, characterized in that the wind control data processing device includes:
    数据获取模块,用于获取风控数据库中每个专业公司的风控数据,其中,所述风控数据包括场景信息,每个所述场景信息对应的场景事件,每个所述场景事件对应的包含风控规则的规则集合,以及每个所述风控规则对应的风控命中率;A data acquisition module is configured to acquire wind control data of each professional company in the wind control database, where the wind control data includes scene information, a scene event corresponding to each of the scene information, and a corresponding event of each of the scene events. A rule set including a risk control rule, and a risk control hit rate corresponding to each of the risk control rules;
    数据筛选模块,用于从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据;A data screening module, configured to select, from the wind control data, wind control data in which the rule set is empty as target data, and use the wind control data in which the rule set is not empty as basic data;
    信息计算模块,用于针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据;An information calculation module is configured to calculate, for each of the target data, information similarity between scene information in the target data and scene information in each of the basic data, and if the information similarity is greater than or equal to If the similarity threshold is set, the basic data where the scene information corresponding to the information similarity is located is used as the data to be recommended;
    事件计算模块,用于计算每个所述目标数据中的场景事件与每个所述待推荐数据中的场景事件之间的事件相似度;An event calculation module, configured to calculate an event similarity between a scene event in each of the target data and a scene event in each of the to-be-recommended data;
    规则推荐模块,用于获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司。The rule recommendation module is configured to obtain the data to be recommended corresponding to the event similarity with the largest value among the event similarities, and recommend the risk control rule in the data to be recommended to the professional company corresponding to the target data.
  7. 如权利要求6所述的风控数据处理装置,其特征在于,所述风控数据处理装置还包括:The wind control data processing device according to claim 6, wherein the wind control data processing device further comprises:
    请求接收模块,用于若接收到专业公司的风控推荐请求,则根据所述风控推荐请求更新所述目标数据。The request receiving module is configured to update the target data according to the risk control recommendation request if a risk control recommendation request from a professional company is received.
  8. 如权利要求6或7所述的风控数据处理装置,其特征在于,所述风控数据处理装置还包括:The wind control data processing device according to claim 6 or 7, wherein the wind control data processing device further comprises:
    命中记录模块,用于监控每个所述专业公司的每个所述场景事件,若检测到所述场景事件被触发并且命中该场景事件对应的规则集合中的风控规则,则记录该场景事件的命中信息,其中,所述命中信息包括该场景事件的发生时间,以及该场景事件命中的风控规则;The hit recording module is configured to monitor each of the scene events of each of the professional companies, and if it is detected that the scene event is triggered and hits a wind control rule in a rule set corresponding to the scene event, the scene event is recorded The hit information, wherein the hit information includes a time of occurrence of the scene event and a risk control rule of the scene event hit;
    数据统计模块,用于每隔预定的时间间隔,根据所述命中信息,统计每个所述场景事件在预设时间段的数据量,其中,所述数据量包括所述场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数;A data statistics module is configured to count a data amount of each scene event in a preset time period according to the hit information at a predetermined time interval, where the data amount includes the number of times the scene event occurs and The number of times each risk control rule in the rule set corresponding to the scenario event is hit;
    命中计算模块,用于根据所述数据量,计算每个所述场景事件的风控命中率。A hit calculation module is configured to calculate a wind control hit rate of each of the scene events according to the data amount.
  9. 如权利要求6或7所述的风控数据处理装置,其特征在于,所述规则推荐模块包括:The wind control data processing device according to claim 6 or 7, wherein the rule recommendation module comprises:
    数据获取单元,用于获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,作为标准数据;A data obtaining unit, configured to obtain data to be recommended corresponding to the event similarity with the largest numerical value among the event similarities as standard data;
    规则推荐单元,用于从所述标准数据中,筛选出数值最大的所述风控命中率,并将该风控命中率对应的所述风控规则,推荐给所述目标数据对应的专业公司。A rule recommendation unit is configured to select the wind control hit ratio with the largest value from the standard data, and recommend the wind control rule corresponding to the wind control hit ratio to a professional company corresponding to the target data .
  10. 如权利要求6或7所述的风控数据处理装置,其特征在于,所述风控数据处理装置还包括:The wind control data processing device according to claim 6 or 7, wherein the wind control data processing device further comprises:
    信息反馈模块,用于根据所述目标数据对应的专业公司反馈的确认信息,获取所述确认信息中该专业公司确认的目标风控规则;An information feedback module, configured to obtain target risk control rules confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data;
    规则配置模块,用于在所述风控数据库中,将所述目标风控规则配置到所述目标数据对应的规则集合中。A rule configuration module is configured to configure the target risk control rule in the risk control database to a rule set corresponding to the target data.
  11. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其特征在于,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, and is characterized in that the processor implements the computer-readable instructions as follows step:
    获取风控数据库中每个专业公司的风控数据,其中,所述风控数据包括场景信息,每个所述场景信息对应的场景事件,每个所述场景事件对应的包含风控规则的规则集合,以及每个所述风控规则对应的风控命中率;Obtain the wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each of the scene information, and rules containing wind control rules corresponding to each of the scene events A set, and a wind control hit rate corresponding to each of the wind control rules;
    从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据;Selecting, from the risk control data, risk control data in which the rule set is empty as target data, and using the risk control data in which the rule set is not empty as basic data;
    针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息 相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据;For each of the target data, calculate an information similarity between the scene information in the target data and the scene information in each of the basic data, and if the information similarity is greater than or equal to a preset similarity threshold, Then obtain the basic data where the scene information corresponding to the information similarity is located, and use the basic data as the data to be recommended;
    计算每个所述目标数据中的场景事件与每个所述待推荐数据中的场景事件之间的事件相似度;Calculating an event similarity between a scene event in each of the target data and a scene event in each of the to-be-recommended data;
    获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司。Acquire the to-be-recommended data corresponding to the event similarity with the largest value among the event similarities, and recommend the risk control rules in the to-be-recommended data to the professional company corresponding to the target data.
  12. 如权利要求11所述的计算机设备,其特征在于,在从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据之后,并且在所述针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据之前,所述处理器执行所述计算机可读指令时还实现如下步骤:The computer device according to claim 11, wherein the wind control data in which the rule set is empty is selected from the wind control data as target data, and the wind in which the rule set is not empty is selected. After controlling data as basic data, and for each of the target data, calculating information similarity between scene information in the target data and scene information in each of the basic data, if the information is similar If the degree is greater than or equal to a preset similarity threshold, the basic data where the scene information corresponding to the similarity of the information is located, and before the basic data is used as the data to be recommended, the processor also executes the computer-readable instructions. To achieve the following steps:
    若接收到专业公司的风控推荐请求,则根据所述风控推荐请求更新所述目标数据。If a risk control recommendation request from a professional company is received, the target data is updated according to the risk control recommendation request.
  13. 如权利要求11或12所述的计算机设备,其特征在于,在所述获取风控数据库中每个专业公司的风控数据之前,所述处理器执行所述计算机可读指令时还实现如下步骤:The computer device according to claim 11 or 12, wherein before the acquiring the wind control data of each professional company in the wind control database, the processor further implements the following steps when executing the computer-readable instructions :
    监控每个所述专业公司的每个所述场景事件,若检测到所述场景事件被触发并且命中该场景事件对应的规则集合中的风控规则,则记录该场景事件的命中信息,其中,所述命中信息包括该场景事件的发生时间,以及该场景事件命中的风控规则;Monitoring each of the scene events of each of the professional companies, and if it is detected that the scene event is triggered and hits a wind control rule in a rule set corresponding to the scene event, then the hit information of the scene event is recorded, wherein, The hit information includes the occurrence time of the scene event and the risk control rule of the scene event hit;
    每隔预定的时间间隔,根据所述命中信息,统计每个所述场景事件在预设时间段的数据量,其中,所述数据量包括所述场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数;Every predetermined time interval, according to the hit information, the data amount of each scene event in a preset time period is counted, wherein the data amount includes the number of times the scene event occurs and a rule corresponding to the scene event The number of times each risk control rule in the set was hit;
    根据所述数据量,计算每个所述场景事件的风控命中率。According to the amount of data, a wind control hit rate of each of the scene events is calculated.
  14. 如权利要求11或12所述的计算机设备,其特征在于,所述获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司包括:The computer device according to claim 11 or 12, wherein the to-be-recommended data corresponding to the event similarity with the largest numerical value in the event similarity is obtained, and the risk control rule in the to-be-recommended data is recommended. The professional company corresponding to the target data includes:
    获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,作为标准数据;Acquiring to-be-recommended data corresponding to the event similarity with the largest value among the event similarities as standard data;
    从所述标准数据中,筛选出数值最大的所述风控命中率,并将该风控命中率对应的所述风控规则,推荐给所述目标数据对应的专业公司。From the standard data, the wind control hit ratio with the largest value is selected, and the wind control rule corresponding to the wind control hit ratio is recommended to the professional company corresponding to the target data.
  15. 如权利要求11或12所述的计算机设备,其特征在于,在所述获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司之后,所述处理器执行所述计算机可读指令时还实现如下步 骤:The computer device according to claim 11 or 12, wherein, in the step of acquiring the data to be recommended corresponding to the event similarity having the largest value among the event similarities, the risk control rules in the data to be recommended are obtained, After recommending to the professional company corresponding to the target data, the processor further implements the following steps when executing the computer-readable instructions:
    根据所述目标数据对应的专业公司反馈的确认信息,获取所述确认信息中该专业公司确认的目标风控规则;Obtaining the target risk control rule confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data;
    在所述风控数据库中,将所述目标风控规则配置到所述目标数据对应的规则集合中。In the risk control database, the target risk control rule is configured into a rule set corresponding to the target data.
  16. 一个或多个存储有计算机可读指令的非易失性可读存储介质,其特征在于,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more non-volatile readable storage media storing computer readable instructions, characterized in that when the computer readable instructions are executed by one or more processors, the one or more processors are caused to execute The following steps:
    获取风控数据库中每个专业公司的风控数据,其中,所述风控数据包括场景信息,每个所述场景信息对应的场景事件,每个所述场景事件对应的包含风控规则的规则集合,以及每个所述风控规则对应的风控命中率;Obtain the wind control data of each professional company in the wind control database, where the wind control data includes scene information, scene events corresponding to each of the scene information, and rules containing wind control rules corresponding to each of the scene events A set, and a wind control hit rate corresponding to each of the wind control rules;
    从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据;Selecting, from the risk control data, risk control data in which the rule set is empty as target data, and using the risk control data in which the rule set is not empty as basic data;
    针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据;For each of the target data, calculate an information similarity between the scene information in the target data and the scene information in each of the basic data, and if the information similarity is greater than or equal to a preset similarity threshold, Then obtain the basic data where the scene information corresponding to the information similarity is located, and use the basic data as the data to be recommended;
    计算每个所述目标数据中的场景事件与每个所述待推荐数据中的场景事件之间的事件相似度;Calculating an event similarity between a scene event in each of the target data and a scene event in each of the to-be-recommended data;
    获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司。Acquire the to-be-recommended data corresponding to the event similarity with the largest value among the event similarities, and recommend the risk control rules in the to-be-recommended data to the professional company corresponding to the target data.
  17. 如权利要求16所述的非易失性可读存储介质,其特征在于,在从所述风控数据中筛选出所述规则集合为空的风控数据,作为目标数据,并将所述规则集合不为空的风控数据作为基础数据之后,并且在所述针对每个所述目标数据,计算该目标数据中的场景信息与每个所述基础数据中的场景信息之间的信息相似度,若所述信息相似度大于或等于预设的相似度阈值,则获取该信息相似度对应的场景信息所在的基础数据,并将该基础数据作为待推荐数据之前,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器还执行如下步骤:The non-volatile readable storage medium according to claim 16, wherein the wind control data with the rule set empty is selected from the wind control data as target data, and the rule is After collecting non-empty risk control data as basic data, and for each of the target data, calculating information similarity between scene information in the target data and scene information in each of the basic data If the similarity of the information is greater than or equal to a preset similarity threshold, then obtain the basic data where the scene information corresponding to the information similarity is located, and before using the basic data as the data to be recommended, the computer-readable instructions are When the one or more processors execute, the one or more processors further perform the following steps:
    若接收到专业公司的风控推荐请求,则根据所述风控推荐请求更新所述目标数据。If a risk control recommendation request from a professional company is received, the target data is updated according to the risk control recommendation request.
  18. 如权利要求16或17所述的非易失性可读存储介质,其特征在于,在所述获取风控数据库中每个专业公司的风控数据之前,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器还执行如下步骤:The non-volatile readable storage medium according to claim 16 or 17, wherein before the obtaining wind control data of each professional company in the wind control database, the computer-readable instructions are read by one or more When the two processors execute, the one or more processors further perform the following steps:
    监控每个所述专业公司的每个所述场景事件,若检测到所述场景事件被触发并且命中 该场景事件对应的规则集合中的风控规则,则记录该场景事件的命中信息,其中,所述命中信息包括该场景事件的发生时间,以及该场景事件命中的风控规则;Monitoring each of the scene events of each of the professional companies, and if it is detected that the scene event is triggered and hits a wind control rule in a rule set corresponding to the scene event, then the hit information of the scene event is recorded, wherein, The hit information includes the occurrence time of the scene event and the risk control rule of the scene event hit;
    每隔预定的时间间隔,根据所述命中信息,统计每个所述场景事件在预设时间段的数据量,其中,所述数据量包括所述场景事件发生的次数和该场景事件对应的规则集合中每个风控规则被命中的次数;Every predetermined time interval, according to the hit information, the data amount of each scene event in a preset time period is counted, wherein the data amount includes the number of times the scene event occurs and a rule corresponding to the scene event The number of times each risk control rule in the set was hit;
    根据所述数据量,计算每个所述场景事件的风控命中率。According to the amount of data, a wind control hit rate of each of the scene events is calculated.
  19. 如权利要求16或17所述的非易失性可读存储介质,其特征在于,所述获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司包括:The non-volatile readable storage medium according to claim 16 or 17, characterized in that, the to-be-recommended data corresponding to the event similarity with the largest value among the event similarities is obtained, and the to-be-recommended data is Risk control rules recommended to professional companies corresponding to the target data include:
    获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,作为标准数据;Acquiring to-be-recommended data corresponding to the event similarity with the largest value among the event similarities as standard data;
    从所述标准数据中,筛选出数值最大的所述风控命中率,并将该风控命中率对应的所述风控规则,推荐给所述目标数据对应的专业公司。From the standard data, the wind control hit ratio with the largest value is selected, and the wind control rule corresponding to the wind control hit ratio is recommended to the professional company corresponding to the target data.
  20. 如权利要求16或17所述的非易失性可读存储介质,其特征在于,在所述获取所述事件相似度中数值最大的事件相似度对应的待推荐数据,并将该待推荐数据中的风控规则,推荐给所述目标数据对应的专业公司之后,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器还执行如下步骤:The non-volatile readable storage medium according to claim 16 or 17, wherein in the step of acquiring the data to be recommended corresponding to the event similarity having the largest value among the event similarities, the data to be recommended After the risk control rules in the recommendation are recommended to the professional company corresponding to the target data, when the computer-readable instructions are executed by one or more processors, the one or more processors further perform the following steps:
    根据所述目标数据对应的专业公司反馈的确认信息,获取所述确认信息中该专业公司确认的目标风控规则;Obtaining the target risk control rule confirmed by the professional company in the confirmation information according to the confirmation information fed back by the professional company corresponding to the target data;
    在所述风控数据库中,将所述目标风控规则配置到所述目标数据对应的规则集合中。In the risk control database, the target risk control rule is configured into a rule set corresponding to the target data.
PCT/CN2018/092327 2018-06-08 2018-06-22 Method for processing risk control data, device, computer apparatus, and storage medium WO2019232821A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810585015.7 2018-06-08
CN201810585015.7A CN108985559A (en) 2018-06-08 2018-06-08 Air control data processing method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2019232821A1 true WO2019232821A1 (en) 2019-12-12

Family

ID=64541033

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/092327 WO2019232821A1 (en) 2018-06-08 2018-06-22 Method for processing risk control data, device, computer apparatus, and storage medium

Country Status (2)

Country Link
CN (1) CN108985559A (en)
WO (1) WO2019232821A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108985559A (en) * 2018-06-08 2018-12-11 平安科技(深圳)有限公司 Air control data processing method, device, computer equipment and storage medium
CN110322139B (en) * 2019-06-28 2023-11-28 创新先进技术有限公司 Policy recommendation method and device
CN113112363A (en) * 2021-04-09 2021-07-13 上海中汇亿达金融信息技术有限公司 Method, apparatus, and medium for facilitating transaction admission auditing
CN114625747B (en) * 2022-05-13 2022-08-12 杭银消费金融股份有限公司 Wind control updating method and system based on information security

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070239495A1 (en) * 2006-04-11 2007-10-11 Bank Of America Corporation Application Risk and Control Assessment Tool
CN107291922A (en) * 2017-06-29 2017-10-24 深圳市佰仟金融服务有限公司 A kind of data distributing method and device
CN107424069A (en) * 2017-08-17 2017-12-01 阿里巴巴集团控股有限公司 A kind of generation method of air control feature, risk monitoring and control method and apparatus
CN107688948A (en) * 2017-07-24 2018-02-13 平安科技(深圳)有限公司 Claims Resolution data processing method, device, computer equipment and storage medium
CN107767021A (en) * 2017-09-12 2018-03-06 阿里巴巴集团控股有限公司 A kind of risk control method and equipment
CN108985559A (en) * 2018-06-08 2018-12-11 平安科技(深圳)有限公司 Air control data processing method, device, computer equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4957704B2 (en) * 2008-11-04 2012-06-20 富士ゼロックス株式会社 Information processing apparatus and information processing program
CN106897807B (en) * 2015-12-18 2021-01-26 创新先进技术有限公司 Business risk control method and equipment
CN107016473B (en) * 2016-01-27 2022-11-22 创新先进技术有限公司 Risk control method and equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070239495A1 (en) * 2006-04-11 2007-10-11 Bank Of America Corporation Application Risk and Control Assessment Tool
CN107291922A (en) * 2017-06-29 2017-10-24 深圳市佰仟金融服务有限公司 A kind of data distributing method and device
CN107688948A (en) * 2017-07-24 2018-02-13 平安科技(深圳)有限公司 Claims Resolution data processing method, device, computer equipment and storage medium
CN107424069A (en) * 2017-08-17 2017-12-01 阿里巴巴集团控股有限公司 A kind of generation method of air control feature, risk monitoring and control method and apparatus
CN107767021A (en) * 2017-09-12 2018-03-06 阿里巴巴集团控股有限公司 A kind of risk control method and equipment
CN108985559A (en) * 2018-06-08 2018-12-11 平安科技(深圳)有限公司 Air control data processing method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN108985559A (en) 2018-12-11

Similar Documents

Publication Publication Date Title
US20200342097A1 (en) Systems and methods for detecting resources responsible for events
CN108615119B (en) Abnormal user identification method and equipment
US10805328B2 (en) Systems and methods for detecting and scoring anomalies
US10178116B2 (en) Automated computer behavioral analysis system and methods
WO2019232821A1 (en) Method for processing risk control data, device, computer apparatus, and storage medium
KR101757844B1 (en) Methods and systems for deleting requested information
US9407655B2 (en) Monitoring security risks to enterprise corresponding to access rights and access risk calculation
WO2020164269A1 (en) User-group-based message pushing method and apparatus, and computer device
US20220292493A1 (en) Enhancing Merchant Databases Using Crowdsourced Browser Data
US11074652B2 (en) System and method for model-based prediction using a distributed computational graph workflow
CN111669379B (en) Behavior abnormity detection method and device
WO2019024497A1 (en) Method, device, terminal equipment and medium for generating customer return visit event
CN112602084A (en) System and method for identifying data leaks
US11829425B1 (en) Social warning system
CN111026882A (en) Default determination method, device, equipment and storage medium based on knowledge graph
RU2702275C1 (en) Method and system for marking user actions for subsequent analysis and accumulation
US20240089276A1 (en) Bot prevention velocity framework
CN109800099A (en) A kind of restoring method, storage medium and the terminal device of user's operation behavior
US20210409204A1 (en) Encryption of protected data for transmission over a web interface
US20230325841A1 (en) Systems and methods for detecting websites that perpetrate at least one of scams or frauds
RU2693646C1 (en) Method and system for selection of proposals for a user based on analysis of actions thereof
CN117994035A (en) Request processing method and device, equipment, storage medium and program product
CN115499231A (en) Flow detection method and device, electronic equipment and storage medium
CN117668363A (en) Recommendation method, device, equipment and medium
CN117472908A (en) Method, device, electronic equipment and storage medium for preventing repeated service delivery

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18921787

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 12/03/2021)

122 Ep: pct application non-entry in european phase

Ref document number: 18921787

Country of ref document: EP

Kind code of ref document: A1