WO2022121217A1 - Quota prediction method and device, and computer-readable storage medium - Google Patents

Quota prediction method and device, and computer-readable storage medium Download PDF

Info

Publication number
WO2022121217A1
WO2022121217A1 PCT/CN2021/090569 CN2021090569W WO2022121217A1 WO 2022121217 A1 WO2022121217 A1 WO 2022121217A1 CN 2021090569 W CN2021090569 W CN 2021090569W WO 2022121217 A1 WO2022121217 A1 WO 2022121217A1
Authority
WO
WIPO (PCT)
Prior art keywords
insurance
information
target
feature
insurance application
Prior art date
Application number
PCT/CN2021/090569
Other languages
French (fr)
Chinese (zh)
Inventor
郝晓丽
陈吕
张乐婷
洪霞
袁丽乔
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2022121217A1 publication Critical patent/WO2022121217A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/08Insurance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis

Definitions

  • the present application relates to the technical field of artificial intelligence, and in particular, to a quota prediction method, device, and computer-readable storage medium.
  • the main purpose of the present application is to provide an amount prediction method, device and computer-readable storage medium, which aims to solve the technical problem of low accuracy in predicting the insurance amount when adding insurance.
  • the present application provides a quota prediction method
  • the quota prediction method includes: obtaining the insurance application information to be evaluated of the target insurance applicant, and obtaining the previous insurance application information of the target insurance applicant according to the insurance application information to be evaluated; Type of insurance application this time; select a pre-trained risk assessment model corresponding to the type of insurance application this time, use the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combine the text features with
  • the target feature is obtained by combining the digital features, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target feature, and the When the risk level is lower than high risk, obtain the limit adjustment factor of the insurance application information to be evaluated; combine the previous insurance application information and the limit adjustment factor to predict the insurance addition limit information of the target insurance applicant.
  • the present application also provides a quota prediction device
  • the quota prediction device includes: a past information acquisition module for acquiring the insurance application information to be evaluated of the target insurance applicant, and obtaining the insurance application information to be evaluated according to the to-be-evaluated insurance application information The previous insurance application information of the target applicant and the type of this insurance application; the target feature acquisition module is used to select a pre-trained risk assessment model corresponding to the current insurance application type, and use the risk assessment model to extract the to-be-evaluated The text features and digital features in the insurance application information, and the target features are obtained by combining the text features and digital features, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; risk level A determination module, configured to determine the risk level of the insurance application information to be evaluated according to the target characteristics, and obtain the quota adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than high risk; add insurance The limit prediction
  • the present application also provides a quota prediction device, the quota prediction device includes a processor, a memory, and a quota prediction program stored on the memory and executable by the processor, wherein the When the above-mentioned quota prediction program is executed by the processor, the following method is implemented: obtaining the insurance application information to be evaluated of the target insurance applicant, and obtaining the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information; Selecting the pre-trained risk assessment model corresponding to the type of insurance this time, using the risk assessment model to extract the text features and numerical features in the insurance application information to be assessed, and combining the text features and numerical features to obtain a target feature, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target feature, and the risk level is detected when the risk level is detected.
  • the risk assessment model is obtained by using the training
  • the present application also provides a computer-readable storage medium, where a quota prediction program is stored on the computer-readable storage medium, wherein when the quota prediction program is executed by a processor, the following method is implemented: obtaining The insurance application information to be evaluated of the target insurance applicant, and the previous insurance application information and the current insurance application type of the target insurance applicant are obtained according to the to-be-evaluated insurance application information; the pre-trained risk assessment model corresponding to the current insurance application type is selected, Use the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combine the text features and numerical features to obtain target features, wherein the risk assessment model is a training program selected by feature engineering
  • the data set is obtained by training the machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target characteristics, and when it is detected that the risk level is lower than high risk, the quota adjustment of the insurance application information to be evaluated is obtained factor; combining the previous insurance application information and the limit adjustment factor
  • the present application realizes that the insurable amount information consistent with the actual situation can be obtained at the time of applying for insurance, thereby solving the technical problem of low accuracy in predicting the insured amount when applying for insurance. At the same time, it also saves the customer from being disturbed by the insurance salesman who does not know the amount of insurance that can be added to him, such as physical examination and coordination.
  • FIG. 1 is a schematic diagram of the hardware structure of the quota prediction device involved in the solution of the embodiment of the present application.
  • FIG. 2 is a schematic flowchart of the first embodiment of the quota prediction method of the application.
  • FIG. 3 is a schematic diagram of functional modules of the quota prediction device of the present application.
  • the technical solution of the present application relates to the field of artificial intelligence technology, and can be applied to scenarios such as financial technology such as insurance limit prediction, so as to promote the construction of smart cities.
  • the data involved in this application such as insurance application information and/or insurance limit information, may be stored in a database, or may be stored in a blockchain, which is not limited in this application.
  • the quota prediction method involved in the embodiment of the present application is mainly applied to a quota prediction device, and the quota prediction device may be a device with display and processing functions, such as a PC, a portable computer, and a mobile terminal.
  • FIG. 1 is a schematic diagram of the hardware structure of the quota prediction device involved in the solution of the embodiment of the present application.
  • the quota prediction device may include a processor 1001 (for example, a CPU), a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 .
  • the communication bus 1002 is used to realize the connection and communication between these components;
  • the user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard);
  • the network interface 1004 may optionally include a standard wired interface, a wireless interface (such as a WI-FI interface);
  • the memory 1005 can be a high-speed RAM memory, or a stable memory (non-volatile memory), such as a disk memory, and the memory 1005 can optionally be a storage device independent of the aforementioned processor 1001 .
  • FIG. 1 does not constitute a limitation on the quota prediction device, and may include more or less components than the one shown, or combine some components, or arrange different components.
  • the memory 1005 as a computer-readable storage medium in FIG. 1 may include an operating system, a network communication module, and a quota prediction program.
  • the network communication module is mainly used to connect to the server and perform data communication with the server; and the processor 1001 can call the quota prediction program stored in the memory 1005 and execute the quota prediction method provided by the embodiment of the present application.
  • the present application provides a quota prediction method, that is, by obtaining the customer's insurance application information to obtain their previous insurance application information, using a pre-trained risk assessment model to determine the corresponding risk level, and only when the detected risk level is in the Subsequent reinsurance limit estimation is carried out only when it is within a certain range (below the high risk level), which avoids the negative impact that may be caused by customers with higher risk levels continuing to apply for insurance.
  • the evaluation avoids the evaluation error caused by different types of insurance; the risk evaluation model is obtained by using feature engineering and machine learning training, and the target characteristics of the insurance information to be evaluated are obtained by using the risk evaluation model, so that the risk evaluation model finally obtained by the risk evaluation model
  • the results are more accurate; by synthesizing the previous insurance information and the limit adjustment factor, the customer's current insured amount can be calculated intelligently, and the insurable amount information that is consistent with the actual situation can be obtained at the time of the current insurance application, so as to solve the problem. It solves the technical problem of low accuracy in predicting the insurance amount when adding insurance. At the same time, it also saves the customer from being disturbed by the insurance salesman who does not know the amount of insurance that can be added to him, such as physical examination and coordination.
  • FIG. 2 is a schematic flowchart of the first embodiment of the quota prediction method of the present application.
  • the first embodiment of the present application provides a quota prediction method, and the quota prediction method includes the following steps.
  • Step S10 obtaining the insurance application information to be evaluated of the target insurance applicant, and obtaining the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information.
  • the target insurance applicant that is, the insurance application user
  • the insurance application information to be evaluated may include the identity information of the insured user, the type of insurance this time, the amount of this insurance application, the date of application for insurance, and information about personal and family medical history.
  • the previous insurance application information is the relevant information that the insured user has applied for insurance before this application, which may specifically include the historical insurance type, historical insurance amount, historical insurance time, etc.
  • the previous acquisition method of insurance application information can be obtained directly from the information to be evaluated, or can be accessed to a database storing the user's insurance application information according to the personal identity information in the insurance application information to be evaluated.
  • the type of insurance applied for this time can be the actual type of insurance such as life insurance type, critical illness insurance type, etc.
  • the server receives the questionnaire data, uses the questionnaire data as the above-mentioned insurance application information to be evaluated, and then accesses the insurance application within the company.
  • Information database query to obtain the previous insurance information of the insured user and the type of insurance purchased this time.
  • Step S20 select a pre-trained risk assessment model corresponding to the current insurance application type, use the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combine the text features and numerical features.
  • the target features are obtained by combining, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm.
  • a risk assessment model corresponding to common insurance types on the market is pre-trained on the server.
  • the server obtains the current insurance type, the corresponding risk assessment model can be directly determined according to the keyword of the insurance type.
  • Different types of risk assessment models are trained with corresponding types of sample data.
  • the server can collect a large amount of real sample data of related businesses such as insurance business from various channels, add labels to it manually or in other ways, and then use feature engineering to filter out a more effective part of the sample data as the final
  • use machine learning algorithms such as decision trees, clustering, deep neural networks, or XGBoost and other model algorithms to train the selected sample data, and finally obtain a trained risk assessment model.
  • the server takes the insurance application information to be evaluated as the input of the model, and the model first extracts the text information and digital information filled in by the user in the insurance application information to be evaluated.
  • Text information such as family medical history, place of residence, etc.; digital information such as age, income, etc.
  • the model can encode text information to make it a discrete numerical feature (text feature), and then perform some numerical validity processing on the digital information to obtain digital features. After the model obtains the text features and digital features, the features with high correlation are combined, and the combined features and the original features are combined to obtain the final target features.
  • Step S30 Determine the risk level of the insurance application information to be evaluated according to the target feature, and obtain a quota adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than high risk.
  • a scoring criterion for multi-dimensional risk control indicators is set in the preset risk assessment model.
  • Risk levels can be divided according to actual needs, such as low risk level, medium risk level and high risk level. Very high risk level, which is not limited in this embodiment.
  • the quota adjustment factor may be related adjustment parameters, regional development levels, corresponding upward adjustment strategies for lower risk levels, and/or salesperson quality, etc.
  • the risk level determination method can be that the model obtains the risk score corresponding to each risk control index of the current insurance application information to be evaluated, and then determines the specific risk level according to the interval in which the risk score falls.
  • the model maps the currently obtained target characteristics to each risk control index, evaluates the questionnaire data based on each risk control index, and outputs the final risk level of the insured user this time.
  • the server obtains the quota adjustment factor corresponding to the sub-questionnaire data when the risk level of the current insured user is low risk, medium risk and other non-high risk levels and above.
  • Step S40 combining the previous insurance application information and the limit adjustment factor, predict the insurance addition limit information of the target insurance applicant.
  • the added insurance limit information may include the name of the insurance application, the code number, the range of the added insurance limit, and the like.
  • the calculation method of the insurable amount can be obtained by first obtaining the original insurable amount corresponding to the user, then subtracting the insured amount from the original insurable amount, that is, the previous insured amount above, to obtain the remaining insurable amount, and finally adjusting the amount. The factor continues to adjust the remaining insurable amount, and the final value obtained is the maximum insurable amount, and the insured amount range is from 0 to the maximum insurable amount.
  • the method may further include: generating a prompt message including the information on the added insurance amount, and sending the prompt message to the user terminal of the target insurance applicant. After the server calculates the reinsurable limit of the insured user's insurance this time, a notification message can be generated separately or displayed synchronously on the self-check result feedback interface, so that the insured user can know the current reinsurable limit.
  • the risk level output by the model is a very low risk level.
  • the server can first obtain the user's standard maximum life insurance amount in life insurance according to general standards, and then adjust the standard maximum insurance amount according to the regional examination-free standard in the region where the user is located to obtain the user's maximum life insurance amount applicable to the region. Subtract the previous life insurance coverage to get the remaining life insurance coverage that can be insured. After the server multiplies the remaining amount of life insurance that can be insured by the preset adjustment coefficient corresponding to life insurance, and then adds the upward limit of the quality level of the salesperson corresponding to the insurance policy for this order and the upward limit corresponding to the extremely low risk, the final result is obtained.
  • the limit value is the maximum limit of the user's current life insurance coverage.
  • the application obtains the insurance application information to be evaluated of the target insurance applicant, and obtains the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information;
  • the evaluation model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target feature, and when it is detected that the risk level is a level below high risk, Obtain the limit adjustment factor of the insurance application information to be evaluated; combine the previous insurance application information and the limit adjustment factor to predict the insurance addition limit information of the target insurance applicant.
  • the application obtains the customer's insurance application information by obtaining their previous insurance application information, uses the pre-trained risk assessment model to determine the corresponding risk level, and only detects that the risk level is within a certain range (below the high risk level)
  • the follow-up insurance limit estimation is carried out only at the time of day, which avoids the negative impact that may be caused by customers with higher risk levels continuing to apply for insurance.
  • the risk assessment model is obtained by using feature engineering and machine learning training, and the target characteristics of the insurance information to be assessed are obtained by using the risk assessment model, so that the final risk assessment result obtained by the risk assessment model is more accurate; by synthesizing the previous insurance information It can intelligently calculate the insurance amount that the customer can currently insure with the amount adjustment factor, so that the insurable amount information that is consistent with the actual situation can be obtained when the insurance is applied, thus solving the problem of predicting the insurance amount when adding insurance. technical issues with low accuracy. At the same time, it also saves the customer from being disturbed by the insurance salesman who does not know the amount of insurance that can be added to him, such as physical examination and coordination.
  • step S20 includes: based on the risk assessment model, acquiring text information and numerical information in the insurance application information to be evaluated; processing the text information by one-hot encoding to obtain the text features, and converting the text information into The digital information is subjected to missing value processing and dense processing to obtain the digital feature; the mutual information value of the text feature and the digital feature is obtained, and the text feature and the digital feature are divided into two groups based on the mutual information value.
  • one-hot encoding is One-Hot-coding, also known as one-bit effective encoding.
  • the method is to use an N-bit state register to encode N states, and each state has its independent register bit. And at any time, only one of them is valid.
  • the model performs one-hot encoding on the text information extracted from the insurance application information to be evaluated to convert it into discrete numerical features, and it can continue to be densely processed to further reduce the amount of data.
  • the model then performs missing value processing on the digital information extracted from the insurance information to be evaluated, such as special value filling, average filling, hot card filling, and expected value maximization.
  • the model is densely processed, such as compressing sparse rows and columns, using Principal Component Analysis (PCA, Principal Components Analysis), Singular Value Decomposition (SVD, Singular Value Decomposition) and other methods for dimensionality reduction.
  • PCA Principal Component Analysis
  • SVD Singular Value Decomposition
  • other methods for dimensionality reduction The model needs to calculate the mutual information value between the features, and the mutual information value represents the degree of correlation between the features, and the specific calculation method can refer to the prior art, which will not be repeated here.
  • the model calculates the mutual information value between the features, it can be compared with the preset standard mutual information threshold, and the features exceeding the threshold can be combined to obtain the combined feature, and finally the combined feature and the uncombined feature can be combined. common as the target feature.
  • the step of acquiring the quota adjustment factor of the insurance application information to be assessed includes: when the risk level is below high risk, judging the target Whether the insured has passed the self-check; if the target insured has passed the self-check, obtain the regional medical examination-exemption standard, the insurance salesperson level and the insurance type adjustment coefficient of the insurance application type corresponding to the insurance application information to be evaluated as the Quota adjustment factor.
  • the server can access the relevant government platform in the region to search, or use the specific extracted name as a keyword to search in the database storing the information on the medical examination exemption standard in each region, so as to Obtain the latest information on local exemption standards for medical examinations; for insurance salesperson levels, the server can directly access the information database within the insurance company, and search for the corresponding level information according to the specific insurance salesperson's name, employee number and other information; for insurance type adjustment The server can also directly access the internal information database of the insurance company, and find the corresponding insurance type adjustment coefficient in the database according to the name of the specific insurance type purchased this time.
  • the server detects that the currently obtained risk level is not high risk or above, it further determines whether the insured user has passed the self-check. If the insured user passes the self-check, the server will determine the highest medical-exemption insurance amount in the region according to the region information filled in by the user in the questionnaire, the quality level of the salesperson responsible for the insurance application for this order, and the type of insurance purchased this time; If the user fails to pass the self-check, it will enter the regular insurance application process, and the additional insurance amount will not be automatically calculated and displayed for the insured user.
  • step S30 includes: determining the maximum medical-exemption insurance amount according to the regional medical-exemption standard, and obtaining an initial insurance increase amount in combination with the previous insurance application information; using the insurance type adjustment coefficient to carry out the initial insurance increase amount. Adjustment to obtain the adjusted insurance amount; obtain the level adjustment amount corresponding to the insurance salesperson's level, and use the level adjustment amount to adjust the adjusted insurance amount to obtain the target insurance applicant's insurance amount information , wherein the information on the added insurance amount of the target insured is stored in the blockchain.
  • the following describes the calculation formula of the insurable amount of life insurance and the calculation formula of the insurable amount of critical illness insurance as examples.
  • the server For the calculation of the insured amount of life insurance that can be added, the server needs to call the highest medical examination-free life insurance insurance amount applicable to the insured customer in the region where the user is located.
  • the sum of the sum insured for the physical examination of the life insurance is called C, and the calculation formula for the additional insured amount of the initial life insurance can be set as (A-C)*X1, where X1 is the life insurance coefficient, which can be flexibly set according to actual needs.
  • the server For the calculation of the insured amount of critical illness insurance, the server needs to call the highest medical-exemption critical illness insurance amount applicable to the insured customer in the area where the user is located.
  • the sum of the historical risk insured amount and the critical illness medical insurance amount is called D.
  • the coefficient can be set flexibly according to actual needs, which can be set to be the same as X1 or different.
  • the initial value can be adjusted slightly according to the level of the salesperson responsible for the insurance business of this order to obtain the final reinsurable amount. Relevant personnel can pre-set different increase quotas corresponding to different salesperson levels on the server.
  • the above-mentioned target insurance applicant's insurance coverage information can also be stored in a blockchain node.
  • step S10 includes: when an insurance application instruction is received, acquiring an insurance application questionnaire of the target insurance applicant based on the insurance application instruction as the insurance application information to be evaluated; acquiring the identity filled in by the target insurance applicant in the insurance application questionnaire Information and insurance name information, determine the type of insurance this time according to the insurance name information, and determine whether the target insurance applicant is an insured user according to the identity information; The past insurance application information of the target insurance applicant is found in the information database.
  • an insurance application user usually fills in a user insurance application questionnaire on a personal terminal, a counter terminal of an insurance company, or other terminal equipment.
  • the questionnaire may contain questions about the user's personal information and questions about the setting of risk control indicators set by the insurance company according to actual needs.
  • the identity information may specifically include name, age, ID number, place of residence, etc.
  • the server receives an insurance application instruction sent by the insurance application user, obtains the questionnaire information filled in by the user pointed to in the insurance application instruction, and uses it as the above-mentioned insurance application information to be evaluated.
  • the server After receiving the user's insurance application questionnaire data sent by the user's questionnaire filling terminal, the server extracts the question option data submitted by each questionnaire question user, and extracts the information that can indicate the identity of the insurance applicant as a retrieval keyword in the insurance application information database. Query, inquire whether there is a previous insurance application record of the insured user in the database, and obtain the record when there is a previous insurance application record in the information database. At the same time, the server can extract the name of the insurance that the user has filled in or selected in the questionnaire, and use it as the insurance type of this insurance. In addition, it can also be converted into a unique number for storage for subsequent data processing.
  • the step of judging whether the target insured is an insured user according to the identity information further includes: if the target insured is not an insured user, executing the determination according to the target feature.
  • the risk level of the insurance application information to be evaluated and when it is detected that the risk level is lower than the high risk level, the step of obtaining the quota adjustment factor of the insurance application information to be evaluated; information on the added insurance amount of the target policyholder.
  • the server does not query the previous insurance application information of the current insurance application user in the insurance application information database, that is, the current insurance application user is a new insurance application user, it continues to input the question answer information in the insurance application questionnaire of the new insurance application user.
  • the risk assessment model determines the risk level of the current new insured users through the risk assessment model. If the server determines through the model that the risk level of the current new insured user is below a high risk level, such as a medium risk level, a low risk level, or a very low risk level, the quota adjustment factor corresponding to the questionnaire information is obtained.
  • the server calculates the reinsurable limit for newly insured users.
  • the specific calculation method can be as follows: multiply the maximum amount of medical examination-free life insurance for new insured users by the adjustment coefficient of life insurance, and add the corresponding increase amount corresponding to the level of the salesperson responsible for this insurance, and finally get The result is the life insurance reinsurable limit of the new insured user. If the server determines through the model that the risk level of the current new insured user is high risk level, extremely high risk level, or higher, the subsequent steps will not be performed, and the routine self-checking process will be turned to.
  • the step of determining the risk level of the insurance application information to be evaluated based on the preset risk evaluation model includes: calculating a risk score corresponding to the target feature on the preset risk control index; The rule determines the risk level corresponding to the risk score as the risk level of the insurance application information to be evaluated.
  • the related issues of the risk control indicators may include related indicators of the diseases involved in the insurance product insured by the user this time, and the like.
  • the diseases involved in the insurance product insured by the user this time and the like.
  • physical health index, dietary health, exercise, smoking, drinking, stress and family medical history, etc. can also be flexibly set according to specific risk control needs.
  • Each questionnaire question can be set with multiple answer options, and different answer options correspond to corresponding option scores.
  • the model maps the currently obtained multiple target features to each risk control index one by one, and then marks the risk score of each target feature on the risk control index according to the preset corresponding rules, and then sets the corresponding risk control index according to each risk control index.
  • the weight coefficient of the insured user is calculated to calculate the total risk score of the insured user this time. After the model calculates and obtains the final total risk score, it can locate the risk level of the risk score according to the preset score level threshold.
  • a score of 0-20 is set as a very low risk level; a score of 21-40 is set as a low risk level; a score of 41-60 is set as a medium risk level; a score of 61-80 is set as a high risk level; 81- A score of 100 is set as a very high risk level.
  • step S20 it also includes: collecting an initial sample data set including multiple pieces of sample data, wherein each piece of sample data includes sample features and corresponding sample labels; constructing a feature index according to the sample features and the sample labels set, and calculate the highest discrimination accuracy rate of the feature index set; compare the highest discrimination accuracy rate with the preset standard accuracy rate threshold, and compare the sample corresponding to the highest discrimination accuracy rate not less than the standard accuracy rate threshold
  • the data is used as target sample data to filter out the target sample data set; the target sample data set is trained by using a preset machine learning model to obtain the risk assessment model.
  • the feature engineering may be the maximum correlation minimum redundancy combined with the maximum mutual information coefficient feature selection strategy.
  • the sample label is used to indicate the user's risk score.
  • the server first calculates the maximum mutual information coefficient between the sample feature in each piece of sample data and its sample label, then constructs a feature index set, and calculates the discrimination accuracy rate of one feature index set per day, and selects the item with the highest value and the Preset standard accuracy thresholds for comparison.
  • the server includes the sample data corresponding to the discrimination accuracy rate greater than or equal to the threshold into the target sample data set.
  • the server uses decision tree, clustering, deep neural network, or XGBoost and other model algorithms to train it, and finally trains to obtain the final risk assessment model.
  • the blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • risk control indicators to quantify the risk scores of insured customers, and then determine their corresponding risk levels, so that the model can quickly assess the risk levels of insured customers; , which reduces the data processing burden of the device and improves the efficiency and accuracy of model training.
  • the present application also provides a limit prediction device
  • the limit prediction device includes: a past information acquisition module 10 for obtaining the insurance application information to be evaluated of the target insured, and according to The to-be-evaluated insurance application information obtains the target insurance applicant's previous insurance application information and the current insurance application type; the target feature acquisition module 20 is used to select a pre-trained risk assessment model corresponding to the current insurance application type, using the The risk assessment model extracts the text features and numerical features in the insurance application information to be assessed, and combines the text features and numerical features to obtain the target features, wherein the risk assessment model uses the training data set screened by feature engineering to pass through.
  • the risk level determination module 30 is configured to determine the risk level of the insurance application information to be evaluated according to the target feature, and obtain the to-be-evaluated risk level when it is detected that the risk level is below high risk
  • the amount adjustment factor of the insurance application information is configured to combine the previous insurance application information and the amount adjustment factor to predict the insurance addition amount information of the target insurance applicant.
  • the present application also provides a quota prediction device.
  • the credit prediction device includes a processor, a memory, and a credit prediction program stored on the memory and executable on the processor, wherein the credit prediction program, when executed by the processor, implements the above-mentioned The steps of the quota prediction method.
  • embodiments of the present application further provide a computer-readable storage medium.
  • a quota prediction program is stored on the computer-readable storage medium of the present application, wherein when the quota prediction program is executed by the processor, the steps of the above quota prediction method are implemented.
  • the storage medium involved in this application such as a computer-readable storage medium, may be non-volatile or volatile.
  • the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation.
  • the technical solutions of the present application can be embodied in the form of software products in essence or the parts that make contributions to the prior art.
  • the computer software products are stored in a storage medium (such as ROM/RAM) as described above. , magnetic disk, optical disk), including several instructions to make a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.

Abstract

A quota prediction method and device, and a computer-readable storage medium, which relate to classification model and blockchain technology. According to the method, a follow-up insurance quota estimation is performed only when a risk level is detected to be within a certain range, so as to prevent negative impact that can be caused by continually insuring customers of a high risk level; by means of selecting a risk assessment model corresponding to a current insurance type for risk assessment, an assessment error caused by different insurance types is avoided; the risk assessment model is obtained by means of using feature engineering and machine learning training, and target features are obtained by using the risk assessment model, so that risk assessment results finally obtained by the risk assessment model are more accurate; by means of synthesizing previous insurance information and quota adjustment factors to predict a coverage range that customers can currently underwrite, insurance quota information consistent with the actual situation can be obtained at the time of the current insurance, wherein the previous insurance information can be stored in a blockchain.

Description

额度预测方法、设备及计算机可读存储介质Quota prediction method, device and computer-readable storage medium
本申请要求于2020年12月7日提交中国专利局、申请号为202011420120.9,发明名称为“额度预测方法、设备及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application with an application number of 202011420120.9 and an invention title of "Amount Prediction Method, Device and Computer-readable Storage Medium" filed with the China Patent Office on December 7, 2020, the entire contents of which are incorporated by reference in this application.
技术领域technical field
本申请涉及人工智能技术领域,尤其涉及一种额度预测方法、设备及计算机可读存储介质。The present application relates to the technical field of artificial intelligence, and in particular, to a quota prediction method, device, and computer-readable storage medium.
背景技术Background technique
在保险行业中,针对客户投保,业内的通行做法是,客户填写投保信息后,公司经过审核给予承保或不予承保的决定。但发明人意识到,无论是客户还是负责保险销售的代理人,在本单投保之后,都难以直接获知准确的可继续投保的额度信息。客户本人或是代理人考虑到若是再次投保的额度过大,会面临保险公司退保、拒保的风险,在再次投保时往往会比较保守,投入较小的额度以期能投保成功,但这就容易出现客户再次购买的保额不足以满足其保障需求的情况,从而导致了加保时对于投保额度进行预测的准确性低下的技术问题。In the insurance industry, for customers to apply for insurance, the common practice in the industry is that after the customer fills in the insurance application information, the company makes a decision to underwrite or not to underwrite after review. However, the inventor realized that it is difficult for the customer or the agent in charge of insurance sales to directly obtain the accurate information on the amount of insurance that can continue to be insured after this policy is insured. Considering that if the amount of re-insurance is too large, the customer or the agent will face the risk of surrender or refusal of insurance by the insurance company. When re-insuring, they tend to be more conservative and invest a smaller amount in order to be successful in insurance. It is easy to happen that the amount of insurance that the customer purchases again is not enough to meet their protection needs, which leads to the technical problem of low accuracy in predicting the amount of insurance when adding insurance.
技术问题technical problem
本申请的主要目的在于提供一种额度预测方法、设备及计算机可读存储介质,旨在解决加保时对于投保额度进行预测的准确性低下的技术问题。The main purpose of the present application is to provide an amount prediction method, device and computer-readable storage medium, which aims to solve the technical problem of low accuracy in predicting the insurance amount when adding insurance.
技术解决方案technical solutions
为实现上述目的,本申请提供一种额度预测方法,所述额度预测方法包括:获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。In order to achieve the above purpose, the present application provides a quota prediction method, the quota prediction method includes: obtaining the insurance application information to be evaluated of the target insurance applicant, and obtaining the previous insurance application information of the target insurance applicant according to the insurance application information to be evaluated; Type of insurance application this time; select a pre-trained risk assessment model corresponding to the type of insurance application this time, use the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combine the text features with The target feature is obtained by combining the digital features, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target feature, and the When the risk level is lower than high risk, obtain the limit adjustment factor of the insurance application information to be evaluated; combine the previous insurance application information and the limit adjustment factor to predict the insurance addition limit information of the target insurance applicant.
此外,为实现上述目的,本申请还提供一种额度预测装置,所述额度预测装置包括:既往信息获取模块,用于获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;目标特征获取模块,用于选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;风险等级确定模块,用于根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;加保额度预测模块,用于结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。In addition, in order to achieve the above purpose, the present application also provides a quota prediction device, the quota prediction device includes: a past information acquisition module for acquiring the insurance application information to be evaluated of the target insurance applicant, and obtaining the insurance application information to be evaluated according to the to-be-evaluated insurance application information The previous insurance application information of the target applicant and the type of this insurance application; the target feature acquisition module is used to select a pre-trained risk assessment model corresponding to the current insurance application type, and use the risk assessment model to extract the to-be-evaluated The text features and digital features in the insurance application information, and the target features are obtained by combining the text features and digital features, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; risk level A determination module, configured to determine the risk level of the insurance application information to be evaluated according to the target characteristics, and obtain the quota adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than high risk; add insurance The limit prediction module is used for predicting the added insurance limit information of the target insured in combination with the previous insurance application information and the limit adjustment factor.
此外,为实现上述目的,本申请还提供一种额度预测设备,所述额度预测设备包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的额度预测程序,其中所述额度预测程序被所述处理器执行时,实现以下方法:获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。In addition, in order to achieve the above object, the present application also provides a quota prediction device, the quota prediction device includes a processor, a memory, and a quota prediction program stored on the memory and executable by the processor, wherein the When the above-mentioned quota prediction program is executed by the processor, the following method is implemented: obtaining the insurance application information to be evaluated of the target insurance applicant, and obtaining the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information; Selecting the pre-trained risk assessment model corresponding to the type of insurance this time, using the risk assessment model to extract the text features and numerical features in the insurance application information to be assessed, and combining the text features and numerical features to obtain a target feature, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target feature, and the risk level is detected when the risk level is detected. When the level is below high risk, obtain the limit adjustment factor of the insurance application information to be evaluated; combine the previous insurance application information and the limit adjustment factor to predict the insurance addition limit information of the target insured.
此外,为实现上述目的,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有额度预测程序,其中所述额度预测程序被处理器执行时,实现以下方法:获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。In addition, in order to achieve the above purpose, the present application also provides a computer-readable storage medium, where a quota prediction program is stored on the computer-readable storage medium, wherein when the quota prediction program is executed by a processor, the following method is implemented: obtaining The insurance application information to be evaluated of the target insurance applicant, and the previous insurance application information and the current insurance application type of the target insurance applicant are obtained according to the to-be-evaluated insurance application information; the pre-trained risk assessment model corresponding to the current insurance application type is selected, Use the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combine the text features and numerical features to obtain target features, wherein the risk assessment model is a training program selected by feature engineering The data set is obtained by training the machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target characteristics, and when it is detected that the risk level is lower than high risk, the quota adjustment of the insurance application information to be evaluated is obtained factor; combining the previous insurance application information and the limit adjustment factor, predict the insurance addition limit information of the target insured.
有益效果beneficial effect
本申请实现了在当次投保时就能获取到与实际情况相符的可加保额度信息,从而解决了加保时对于投保额度进行预测的准确性低下的技术问题。同时,也使得客户免于保险业务员由于不知其可加保额度而向其进行的体检、契调等的业务打扰。The present application realizes that the insurable amount information consistent with the actual situation can be obtained at the time of applying for insurance, thereby solving the technical problem of low accuracy in predicting the insured amount when applying for insurance. At the same time, it also saves the customer from being disturbed by the insurance salesman who does not know the amount of insurance that can be added to him, such as physical examination and coordination.
附图说明Description of drawings
图1为本申请实施例方案中涉及的额度预测设备的硬件结构示意图。FIG. 1 is a schematic diagram of the hardware structure of the quota prediction device involved in the solution of the embodiment of the present application.
图2为本申请额度预测方法第一实施例的流程示意图。FIG. 2 is a schematic flowchart of the first embodiment of the quota prediction method of the application.
图3为本申请额度预测装置的功能模块示意图。FIG. 3 is a schematic diagram of functional modules of the quota prediction device of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics and advantages of the purpose of the present application will be further described with reference to the accompanying drawings in conjunction with the embodiments.
本发明的实施方式Embodiments of the present invention
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.
本申请的技术方案涉及人工智能技术领域,可应用于金融科技如保险额度预测等场景中,以推动智慧城市的建设。可选的,本申请涉及的数据如投保信息和/或加保额度信息等可存储于数据库中,或者可以存储于区块链中,本申请不做限定。The technical solution of the present application relates to the field of artificial intelligence technology, and can be applied to scenarios such as financial technology such as insurance limit prediction, so as to promote the construction of smart cities. Optionally, the data involved in this application, such as insurance application information and/or insurance limit information, may be stored in a database, or may be stored in a blockchain, which is not limited in this application.
本申请实施例涉及的额度预测方法主要应用于额度预测设备,该额度预测设备可以是PC、便携计算机、移动终端等具有显示和处理功能的设备。The quota prediction method involved in the embodiment of the present application is mainly applied to a quota prediction device, and the quota prediction device may be a device with display and processing functions, such as a PC, a portable computer, and a mobile terminal.
参照图1,图1为本申请实施例方案中涉及的额度预测设备的硬件结构示意图。本申请实施例中,额度预测设备可以包括处理器1001(例如CPU),通信总线1002,用户接口1003,网络接口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信;用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard);网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口);存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器,存储器1005可选的还可以是独立于前述处理器1001的存储装置。Referring to FIG. 1 , FIG. 1 is a schematic diagram of the hardware structure of the quota prediction device involved in the solution of the embodiment of the present application. In this embodiment of the present application, the quota prediction device may include a processor 1001 (for example, a CPU), a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 . Wherein, the communication bus 1002 is used to realize the connection and communication between these components; the user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard); the network interface 1004 may optionally include a standard wired interface, a wireless interface (such as a WI-FI interface); the memory 1005 can be a high-speed RAM memory, or a stable memory (non-volatile memory), such as a disk memory, and the memory 1005 can optionally be a storage device independent of the aforementioned processor 1001 .
本领域技术人员可以理解,图1中示出的硬件结构并不构成对额度预测设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the hardware structure shown in FIG. 1 does not constitute a limitation on the quota prediction device, and may include more or less components than the one shown, or combine some components, or arrange different components.
继续参照图1,图1中作为一种计算机可读存储介质的存储器1005可以包括操作系统、网络通信模块以及额度预测程序。Continuing to refer to FIG. 1 , the memory 1005 as a computer-readable storage medium in FIG. 1 may include an operating system, a network communication module, and a quota prediction program.
在图1中,网络通信模块主要用于连接服务器,与服务器进行数据通信;而处理器1001可以调用存储器1005中存储的额度预测程序,并执行本申请实施例提供的额度预测方法。In FIG. 1 , the network communication module is mainly used to connect to the server and perform data communication with the server; and the processor 1001 can call the quota prediction program stored in the memory 1005 and execute the quota prediction method provided by the embodiment of the present application.
基于上述硬件结构,提出本申请额度预测方法的各个实施例。Based on the above hardware structure, various embodiments of the quota prediction method of the present application are proposed.
在保险行业中,针对客户投保,业内的通行做法是,客户填写投保信息后,公司经过审核给予承保或不予承保的决定。但客户无法获知自身还可购买的保障额度。而同时在保险销售过程中,代理人也存在类似困惑,对于老客户,不清楚老客户已有多少保障,还可购买多少保障;对于新客户,也不确定该为新客户匹配多少保额。而又由于购买过高的保额又会被核保限额,因此代理人在设计保险计划时,通常都较为保守,这就导致了较多客户最终所购买的保额不足以满足其保障需求。上述种种情况均反映出目前对于加保额度信息的获取效率低下的技术问题。In the insurance industry, for customers to apply for insurance, the common practice in the industry is that after the customer fills in the insurance application information, the company makes a decision to underwrite or not to underwrite after review. However, customers cannot know the amount of coverage they can still purchase. At the same time, in the process of insurance sales, agents are also confused. For old customers, they do not know how much insurance they have and how much insurance they can buy; for new customers, they are not sure how much insurance should be matched for new customers. In addition, because the purchase of an excessively high amount of insurance will be subject to an underwriting limit, agents are usually conservative when designing insurance plans, which leads to the fact that the amount of insurance purchased by many customers is not enough to meet their protection needs. All of the above situations reflect the current technical problem of low efficiency in obtaining information on the added insurance amount.
为解决上述问题,本申请提供一种额度预测方法,即通过获取客户的投保信息得到其既往投保信息,使用了预训练的风险评估模型确定出对应的风险等级,并仅在检测到风险等级在一定范围内(高风险等级以下)时才进行后续的加保额度估算,避免了风险等级较高客户继续投保所可能造成的负面影响;通过选择与本次投保类型相对应的风险评估模型进行风险评估,避免了投保类型不同所造成的评估误差;通过运用特征工程与机器学习训练得到风险评估模型,并利用风险评估模型得到待评估投保信息的目标特征,使得风险评估模型最终得出的风险评估结果更加精准;通过综合既往投保信息与额度调节因子智能运算出客户当前所能承保的保额范围,实现了在当次投保时就能获取到与实际情况相符的可加保额度信息,从而解决了加保时对于投保额度进行预测的准确性低下的技术问题。同时,也使得客户免于保险业务员由于不知其可加保额度而向其进行的体检、契调等的业务打扰。In order to solve the above-mentioned problems, the present application provides a quota prediction method, that is, by obtaining the customer's insurance application information to obtain their previous insurance application information, using a pre-trained risk assessment model to determine the corresponding risk level, and only when the detected risk level is in the Subsequent reinsurance limit estimation is carried out only when it is within a certain range (below the high risk level), which avoids the negative impact that may be caused by customers with higher risk levels continuing to apply for insurance. The evaluation avoids the evaluation error caused by different types of insurance; the risk evaluation model is obtained by using feature engineering and machine learning training, and the target characteristics of the insurance information to be evaluated are obtained by using the risk evaluation model, so that the risk evaluation model finally obtained by the risk evaluation model The results are more accurate; by synthesizing the previous insurance information and the limit adjustment factor, the customer's current insured amount can be calculated intelligently, and the insurable amount information that is consistent with the actual situation can be obtained at the time of the current insurance application, so as to solve the problem. It solves the technical problem of low accuracy in predicting the insurance amount when adding insurance. At the same time, it also saves the customer from being disturbed by the insurance salesman who does not know the amount of insurance that can be added to him, such as physical examination and coordination.
参照图2,图2为本申请额度预测方法第一实施例的流程示意图。Referring to FIG. 2 , FIG. 2 is a schematic flowchart of the first embodiment of the quota prediction method of the present application.
本申请第一实施例提供一种额度预测方法,所述额度预测方法包括以下步骤。The first embodiment of the present application provides a quota prediction method, and the quota prediction method includes the following steps.
步骤S10,获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型。Step S10 , obtaining the insurance application information to be evaluated of the target insurance applicant, and obtaining the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information.
在本实施例中,目标投保人也即是投保用户,可以在个人终端、保险公司的柜台终端或是其他终端设备上填写关于本次投保所需要提供的信息。待评估投保信息中可包括投保用户的身份信息、本次投保种类、本次投保额度、投保申请日期以及关于个人及家族病史等的信息。既往投保信息为投保用户在本次投保之前,已经投保过的相关信息,具体可包括历史投保保险种类、历史投保额度、历史投保时间等。既往投保信息的获取方式可以是从待评估信息中直接获取,也可为根据待评估投保信息中的个人身份信息访问存储有用户投保信息的数据库。本次投保类型可为寿险类型、重疾险类型等实际的险种类型。In this embodiment, the target insurance applicant, that is, the insurance application user, can fill in the information required for this insurance application on a personal terminal, a counter terminal of an insurance company, or other terminal equipment. The insurance application information to be evaluated may include the identity information of the insured user, the type of insurance this time, the amount of this insurance application, the date of application for insurance, and information about personal and family medical history. The previous insurance application information is the relevant information that the insured user has applied for insurance before this application, which may specifically include the historical insurance type, historical insurance amount, historical insurance time, etc. The previous acquisition method of insurance application information can be obtained directly from the information to be evaluated, or can be accessed to a database storing the user's insurance application information according to the personal identity information in the insurance application information to be evaluated. The type of insurance applied for this time can be the actual type of insurance such as life insurance type, critical illness insurance type, etc.
具体地,若当前一用户在保险公司的柜台终端上填写对应的投保问卷,填写完成提交后,服务器接收到该问卷数据,将此问卷数据作为上述待评估投保信息,然后再访问公司内部的投保信息数据库,查询获得该投保用户的既往投保信息与本次所投的保险种类。Specifically, if the current user fills in the corresponding insurance application questionnaire on the counter terminal of the insurance company, after completing the filling and submitting, the server receives the questionnaire data, uses the questionnaire data as the above-mentioned insurance application information to be evaluated, and then accesses the insurance application within the company. Information database, query to obtain the previous insurance information of the insured user and the type of insurance purchased this time.
步骤S20,选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得。Step S20, select a pre-trained risk assessment model corresponding to the current insurance application type, use the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combine the text features and numerical features. The target features are obtained by combining, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm.
在本实施例中,需要说明的是,服务器上预训练有对应与市面上常见保险种类的风险评估模型。在服务器获取到本次投保类型时,可根据投保类型的关键字直接确定到对应的风险评估模型。不同类型的风险评估模型分别采用对应类型的样本数据训练所得。服务器可从各种渠道收集真实的诸如保险业务等相关业务的大量样本数据,并以人工标注或是其他方式为其添加标签,再利用特征工程在其中筛选出更为有效的一部分样本数据作为最终的样本数据,最后使用机器学习算法,例如决策树、聚类、深度神经网络,或是XGBoost等模型算法对筛选出的样本数据进行训练,最终得到训练完成的风险评估模型。In this embodiment, it should be noted that a risk assessment model corresponding to common insurance types on the market is pre-trained on the server. When the server obtains the current insurance type, the corresponding risk assessment model can be directly determined according to the keyword of the insurance type. Different types of risk assessment models are trained with corresponding types of sample data. The server can collect a large amount of real sample data of related businesses such as insurance business from various channels, add labels to it manually or in other ways, and then use feature engineering to filter out a more effective part of the sample data as the final Finally, use machine learning algorithms, such as decision trees, clustering, deep neural networks, or XGBoost and other model algorithms to train the selected sample data, and finally obtain a trained risk assessment model.
服务器将待评估投保信息作为模型的输入,模型先分别提取出待评估投保信息中用户填写的文字信息与数字信息。文字信息例如家族病史、居住地等;数字信息例如年龄、收入等。模型可对文字信息进行编码,使其成为离散的数值特征(文字特征),再将数字信息进行一些数值有效性处理,得到数字特征。模型在得到文字特征和数字特征后,再将相关度高的特征进行组合,结合组合后的特征和原有的特征得到最终的目标特征。The server takes the insurance application information to be evaluated as the input of the model, and the model first extracts the text information and digital information filled in by the user in the insurance application information to be evaluated. Text information such as family medical history, place of residence, etc.; digital information such as age, income, etc. The model can encode text information to make it a discrete numerical feature (text feature), and then perform some numerical validity processing on the digital information to obtain digital features. After the model obtains the text features and digital features, the features with high correlation are combined, and the combined features and the original features are combined to obtain the final target features.
步骤S30,根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子。Step S30: Determine the risk level of the insurance application information to be evaluated according to the target feature, and obtain a quota adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than high risk.
在本实施例中,预设风险评估模型中设置有多维度风控指标的得分判定标准。风险等级可根据实际需求进行划分,例如可划分为低风险等级、中风险等级与高风险等级,也可进一步细化,分为极低风险等级、低风险等级、中风险等级、高风险等级与极高风险等级,本实施例不做限定。额度调节因子可为相关的调节参数、地区发展水平、较低风险等级的对应上调策略和/或业务员品质等。风险等级确定方式可为模型获取当前待评估投保信息在各个风控指标上所对应的风险评分,再根据风险评分所落在的区间确定具体的风险等级。In this embodiment, a scoring criterion for multi-dimensional risk control indicators is set in the preset risk assessment model. Risk levels can be divided according to actual needs, such as low risk level, medium risk level and high risk level. Very high risk level, which is not limited in this embodiment. The quota adjustment factor may be related adjustment parameters, regional development levels, corresponding upward adjustment strategies for lower risk levels, and/or salesperson quality, etc. The risk level determination method can be that the model obtains the risk score corresponding to each risk control index of the current insurance application information to be evaluated, and then determines the specific risk level according to the interval in which the risk score falls.
具体地,模型将当前得出的目标特征对应到各个风控指标上,基于各个风控指标对问卷数据进行评估,输出该投保用户本次投保最终的风险等级。服务器在该当前投保用户本次投保的风险等级为低风险、中风险等非高风险及以上等级时,获取与该分问卷数据对应的额度调节因子。Specifically, the model maps the currently obtained target characteristics to each risk control index, evaluates the questionnaire data based on each risk control index, and outputs the final risk level of the insured user this time. The server obtains the quota adjustment factor corresponding to the sub-questionnaire data when the risk level of the current insured user is low risk, medium risk and other non-high risk levels and above.
步骤S40,结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。Step S40, combining the previous insurance application information and the limit adjustment factor, predict the insurance addition limit information of the target insurance applicant.
在本实施例中,加保额度信息可包括投保保险名称、代号、加保额度范围等。可加保额度的计算方式可为先获取该用户对应的原始可投保额度,然后再用原始可投保额度减去已投保额度也即是上述既往投保额度得到剩余可投保额度,最后再通过额度调节因子对剩余可投保额度继续额度调节,所得到的最终值即为可加保额度的最大值,而加保额度范围即为0至可加保额度的最大值。另外,在S30之后,还可包括:生成包含有所述加保额度信息的提示消息,并将所述提示消息发送至所述目标投保人的用户终端。在服务器计算出该投保用户本次所投保险的可加保额度后,可单独生成通知消息或是在自核结果反馈界面同步予以显示,方便投保用户获知当前还可加保的额度。In this embodiment, the added insurance limit information may include the name of the insurance application, the code number, the range of the added insurance limit, and the like. The calculation method of the insurable amount can be obtained by first obtaining the original insurable amount corresponding to the user, then subtracting the insured amount from the original insurable amount, that is, the previous insured amount above, to obtain the remaining insurable amount, and finally adjusting the amount. The factor continues to adjust the remaining insurable amount, and the final value obtained is the maximum insurable amount, and the insured amount range is from 0 to the maximum insurable amount. In addition, after S30, the method may further include: generating a prompt message including the information on the added insurance amount, and sending the prompt message to the user terminal of the target insurance applicant. After the server calculates the reinsurable limit of the insured user's insurance this time, a notification message can be generated separately or displayed synchronously on the self-check result feedback interface, so that the insured user can know the current reinsurable limit.
具体地,若用户此次投保的是寿险,模型输出的风险等级为极低风险等级。服务器可先根据一般标准得到该用户在寿险上的标准最高寿险保额,然后根据该用户所在地区的地区免体检标准对标准最高保额进行调节得到适用于该地区的用户最高寿险保额,再减去之前投过的寿险保额得到剩余可加保的寿险保额额度。服务器将剩余可加保的寿险保额额度乘上预设的寿险对应的调节系数后,再加上本单投保对应业务员的品质等级的上调额度以及极低风险对应的上调额度,最终得到的额度值即为该用户当前的寿险可加保最高额度。Specifically, if the user purchased life insurance this time, the risk level output by the model is a very low risk level. The server can first obtain the user's standard maximum life insurance amount in life insurance according to general standards, and then adjust the standard maximum insurance amount according to the regional examination-free standard in the region where the user is located to obtain the user's maximum life insurance amount applicable to the region. Subtract the previous life insurance coverage to get the remaining life insurance coverage that can be insured. After the server multiplies the remaining amount of life insurance that can be insured by the preset adjustment coefficient corresponding to life insurance, and then adds the upward limit of the quality level of the salesperson corresponding to the insurance policy for this order and the upward limit corresponding to the extremely low risk, the final result is obtained. The limit value is the maximum limit of the user's current life insurance coverage.
在本实施例中,本申请通过获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。通过上述方式,本申请通过获取客户的投保信息得到其既往投保信息,使用了预训练的风险评估模型确定出对应的风险等级,并仅在检测到风险等级在一定范围内(高风险等级以下)时才进行后续的加保额度估算,避免了风险等级较高客户继续投保所可能造成的负面影响;通过选择与本次投保类型相对应的风险评估模型进行风险评估,避免了投保类型不同所造成的评估误差;通过运用特征工程与机器学习训练得到风险评估模型,并利用风险评估模型得到待评估投保信息的目标特征,使得风险评估模型最终得出的风险评估结果更加精准;通过综合既往投保信息与额度调节因子智能运算出客户当前所能承保的保额范围,实现了在当次投保时就能获取到与实际情况相符的可加保额度信息,从而解决了加保时对于投保额度进行预测的准确性低下的技术问题。同时,也使得客户免于保险业务员由于不知其可加保额度而向其进行的体检、契调等的业务打扰。In this embodiment, the application obtains the insurance application information to be evaluated of the target insurance applicant, and obtains the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information; A pre-trained risk assessment model corresponding to the type, using the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combining the text features and numerical features to obtain target features, wherein the risk The evaluation model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm; the risk level of the insurance application information to be evaluated is determined according to the target feature, and when it is detected that the risk level is a level below high risk, Obtain the limit adjustment factor of the insurance application information to be evaluated; combine the previous insurance application information and the limit adjustment factor to predict the insurance addition limit information of the target insurance applicant. Through the above method, the application obtains the customer's insurance application information by obtaining their previous insurance application information, uses the pre-trained risk assessment model to determine the corresponding risk level, and only detects that the risk level is within a certain range (below the high risk level) The follow-up insurance limit estimation is carried out only at the time of day, which avoids the negative impact that may be caused by customers with higher risk levels continuing to apply for insurance. The risk assessment model is obtained by using feature engineering and machine learning training, and the target characteristics of the insurance information to be assessed are obtained by using the risk assessment model, so that the final risk assessment result obtained by the risk assessment model is more accurate; by synthesizing the previous insurance information It can intelligently calculate the insurance amount that the customer can currently insure with the amount adjustment factor, so that the insurable amount information that is consistent with the actual situation can be obtained when the insurance is applied, thus solving the problem of predicting the insurance amount when adding insurance. technical issues with low accuracy. At the same time, it also saves the customer from being disturbed by the insurance salesman who does not know the amount of insurance that can be added to him, such as physical examination and coordination.
进一步地,基于上述图2所示的第一实施例,提出本申请额度预测方法的第二实施例。本实施例中,步骤S20包括:基于所述风险评估模型,获取所述待评估投保信息中的文字信息与数字信息;将所述文字信息采用独热编码处理得到所述文字特征,并将所述数字信息进行缺失值处理与稠密处理得到所述数字特征;获取所述文字特征与所述数字特征的互信息值,并基于所述互信息值将所述文字特征与所述数字特征分为可组合的第一特征与不可组合的第二特征;将所述第一特征进行组合得到组合特征,将所述组合特征与所述第二特征作为所述目标特征。Further, based on the first embodiment shown in FIG. 2 above, a second embodiment of the quota prediction method of the present application is proposed. In this embodiment, step S20 includes: based on the risk assessment model, acquiring text information and numerical information in the insurance application information to be evaluated; processing the text information by one-hot encoding to obtain the text features, and converting the text information into The digital information is subjected to missing value processing and dense processing to obtain the digital feature; the mutual information value of the text feature and the digital feature is obtained, and the text feature and the digital feature are divided into two groups based on the mutual information value. A first feature that can be combined and a second feature that cannot be combined; a combined feature is obtained by combining the first feature, and the combined feature and the second feature are used as the target feature.
在本实施例中,独热编码即 One-Hot-coding,又称一位有效编码,其方法是使用N位状态寄存器来对N个状态进行编码,每个状态都由他独立的寄存器位,并且在任意时候,其中只有一位有效。模型对待评估投保信息中提取出的文字信息进行独热编码,使其转化为离散型的数值特征,还可继续对其进行稠密处理,以进一步减少数据量。模型再对从待评估投保信息中提取出的数字信息进行缺失值处理,例如采用特殊值填充、平均值填充、热卡填充、期望值最大化等方式。然后模型再对其进行稠密处理,例如压缩稀疏行与列,使用主成分分析(PCA,PrincipalComponents Analysis)),奇异值分解(SVD,Singular Value Decomposition)等方式进行降维。模型需要计算各特征间的互信息值,互信息值表征的是特征间的相关程度,其具体计算方式可参考现有技术,在此不作赘述。模型在计算出个特征间的互信息值后,可将其与预设的标准互信息阈值进行比较,将超出阈值的特征进行组合,得到组合特征,最后即可将组合特征与未组合的特征共同作为目标特征。In this embodiment, one-hot encoding is One-Hot-coding, also known as one-bit effective encoding. The method is to use an N-bit state register to encode N states, and each state has its independent register bit. And at any time, only one of them is valid. The model performs one-hot encoding on the text information extracted from the insurance application information to be evaluated to convert it into discrete numerical features, and it can continue to be densely processed to further reduce the amount of data. The model then performs missing value processing on the digital information extracted from the insurance information to be evaluated, such as special value filling, average filling, hot card filling, and expected value maximization. Then the model is densely processed, such as compressing sparse rows and columns, using Principal Component Analysis (PCA, Principal Components Analysis), Singular Value Decomposition (SVD, Singular Value Decomposition) and other methods for dimensionality reduction. The model needs to calculate the mutual information value between the features, and the mutual information value represents the degree of correlation between the features, and the specific calculation method can refer to the prior art, which will not be repeated here. After the model calculates the mutual information value between the features, it can be compared with the preset standard mutual information threshold, and the features exceeding the threshold can be combined to obtain the combined feature, and finally the combined feature and the uncombined feature can be combined. common as the target feature.
进一步地,所述在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子的步骤包括:在所述风险等级为高风险以下等级时,判断所述目标投保人是否通过自核;若所述目标投保人通过自核,则获取所述待评估投保信息对应的地区免体检标准、保险业务员等级以及所述本次投保类型的险种调节系数作为所述额度调节因子。Further, when it is detected that the risk level is below high risk, the step of acquiring the quota adjustment factor of the insurance application information to be assessed includes: when the risk level is below high risk, judging the target Whether the insured has passed the self-check; if the target insured has passed the self-check, obtain the regional medical examination-exemption standard, the insurance salesperson level and the insurance type adjustment coefficient of the insurance application type corresponding to the insurance application information to be evaluated as the Quota adjustment factor.
在本实施例中,对于地区免体检标准,服务器可以访问该地区的相关政府平台进行搜索,或是以具体的提取名称为关键词在存储有各地区免体检标准信息的数据库中进行查找,以获取到最新的地方免体检标准信息;对于保险业务员等级,服务器可直接访问保险公司内部的信息数据库,根据具体的保险业务员的姓名、员工编号等信息查找其对应的等级信息;对于险种调节系数,服务器亦可直接访问保险公司内部的信息数据库,根据本次投保所投的具体险种名称在库中查找到对应的险种调节系数。In this embodiment, for the regional medical examination exemption standard, the server can access the relevant government platform in the region to search, or use the specific extracted name as a keyword to search in the database storing the information on the medical examination exemption standard in each region, so as to Obtain the latest information on local exemption standards for medical examinations; for insurance salesperson levels, the server can directly access the information database within the insurance company, and search for the corresponding level information according to the specific insurance salesperson's name, employee number and other information; for insurance type adjustment The server can also directly access the internal information database of the insurance company, and find the corresponding insurance type adjustment coefficient in the database according to the name of the specific insurance type purchased this time.
服务器在检测到当前所得到的风险等级不为高风险及以上等级时,则进一步判断该投保用户是否通过自核。若该投保用户通过自核,服务器则根据问卷中用户填写的地区信息确定该地区的最高免体检保额,负责本单投保业务的业务员的品质等级以及本次投保的保险种类;若该投保用户未通过自核,则进入常规投保流程,不为该投保用户自动计算并展示加保额度。When the server detects that the currently obtained risk level is not high risk or above, it further determines whether the insured user has passed the self-check. If the insured user passes the self-check, the server will determine the highest medical-exemption insurance amount in the region according to the region information filled in by the user in the questionnaire, the quality level of the salesperson responsible for the insurance application for this order, and the type of insurance purchased this time; If the user fails to pass the self-check, it will enter the regular insurance application process, and the additional insurance amount will not be automatically calculated and displayed for the insured user.
进一步地,步骤S30包括:根据所述地区免体检标准确定所述最高免体检保额,并结合所述既往投保信息得到初始加保额度;使用所述险种调节系数对所述初始加保额度进行调节,得到调节加保额度;获取与所述保险业务员等级对应的等级调节额度,并使用所述等级调节额度对所述调节加保额度进行调节,得到所述目标投保人的加保额度信息,其中,所述目标投保人的加保额度信息存储于区块链中。Further, step S30 includes: determining the maximum medical-exemption insurance amount according to the regional medical-exemption standard, and obtaining an initial insurance increase amount in combination with the previous insurance application information; using the insurance type adjustment coefficient to carry out the initial insurance increase amount. Adjustment to obtain the adjusted insurance amount; obtain the level adjustment amount corresponding to the insurance salesperson's level, and use the level adjustment amount to adjust the adjusted insurance amount to obtain the target insurance applicant's insurance amount information , wherein the information on the added insurance amount of the target insured is stored in the blockchain.
在本实施例中,下面以寿险可加保保额计算公式与重疾险可加保保额计算公式为例进行说明。In this embodiment, the following describes the calculation formula of the insurable amount of life insurance and the calculation formula of the insurable amount of critical illness insurance as examples.
对于寿险可加保保额的计算,服务器需要调用适用于用户所在地区的该投保客户的最高免体检寿险体检保额,为便于描述以下称之为A,调用该投保用户本单与历史风险保额寿险体检保额之和,称之为C,则初始寿险可加保保额计算公式可设置为(A-C)*X1,其中,X1为寿险系数,可根据实际需求灵活设定。对于重疾险可加保保额的计算,服务器需要调用适用于用户所在地区的该投保客户的最高免体检重疾体检保额,为便于描述以下称之为B,调用该投保用户本单与历史风险保额重疾体检保额之和,称之为D,则初始寿险可加保保额计算公式可设置为寿险可加保保额=(B-D)*X2,其中,X2为重疾险系数,根据实际需求灵活设定,可设置其与X1相同,也可设置为不同。在计算出初始可加保额度之和,即可根据负责本单投保业务的业务员等级,对初始值进行小幅度调节,得到最终的可加保额度。相关人员可预先在服务器上设置不同的业务员等级所对应的不同的上调额度。For the calculation of the insured amount of life insurance that can be added, the server needs to call the highest medical examination-free life insurance insurance amount applicable to the insured customer in the region where the user is located. The sum of the sum insured for the physical examination of the life insurance is called C, and the calculation formula for the additional insured amount of the initial life insurance can be set as (A-C)*X1, where X1 is the life insurance coefficient, which can be flexibly set according to actual needs. For the calculation of the insured amount of critical illness insurance, the server needs to call the highest medical-exemption critical illness insurance amount applicable to the insured customer in the area where the user is located. The sum of the historical risk insured amount and the critical illness medical insurance amount is called D. The calculation formula of the initial life insurance reinsurable amount can be set as life insurance reinsurable amount = (B-D)*X2, where X2 is the critical illness insurance The coefficient can be set flexibly according to actual needs, which can be set to be the same as X1 or different. After the sum of the initial reinsurable amount is calculated, the initial value can be adjusted slightly according to the level of the salesperson responsible for the insurance business of this order to obtain the final reinsurable amount. Relevant personnel can pre-set different increase quotas corresponding to different salesperson levels on the server.
需要强调的是,为进一步保证上述目标投保人的加保额度信息的私密和安全性,上述目标投保人的加保额度信息还可以存储于一区块链的节点中。It should be emphasized that, in order to further ensure the privacy and security of the above-mentioned target applicant's insurance coverage information, the above-mentioned target insurance applicant's insurance coverage information can also be stored in a blockchain node.
进一步地,步骤S10包括:在接收到投保指令时,基于所述投保指令获取所述目标投保人的投保问卷作为所述待评估投保信息;获取所述投保问卷中所述目标投保人填写的身份信息与投保名称信息,根据所述投保名称信息确定所述本次投保类型,根据所述身份信息判断所述目标投保人是否为已投保用户;若所述目标投保人为已投保用户,则在投保信息库中查找到所述目标投保人的既往投保信息。Further, step S10 includes: when an insurance application instruction is received, acquiring an insurance application questionnaire of the target insurance applicant based on the insurance application instruction as the insurance application information to be evaluated; acquiring the identity filled in by the target insurance applicant in the insurance application questionnaire Information and insurance name information, determine the type of insurance this time according to the insurance name information, and determine whether the target insurance applicant is an insured user according to the identity information; The past insurance application information of the target insurance applicant is found in the information database.
在本实施例中,投保用户通常是通过在个人终端、保险公司的柜台终端或是其他终端设备上填写用户投保问卷。该问卷可以包含关于用户个人信息的相关问题与保险公司根据实际需要所设定的风控指标设置的问题。身份信息具体可包括姓名、年龄、身份证号、居住地等。投保用户在填写完成问卷并提交后,服务器接收到该投保用户发出的一投保指令,获取该投保指令中所指向的用户所填写的问卷信息,将其作为上述待评估投保信息。In this embodiment, an insurance application user usually fills in a user insurance application questionnaire on a personal terminal, a counter terminal of an insurance company, or other terminal equipment. The questionnaire may contain questions about the user's personal information and questions about the setting of risk control indicators set by the insurance company according to actual needs. The identity information may specifically include name, age, ID number, place of residence, etc. After the insurance application user completes and submits the questionnaire, the server receives an insurance application instruction sent by the insurance application user, obtains the questionnaire information filled in by the user pointed to in the insurance application instruction, and uses it as the above-mentioned insurance application information to be evaluated.
具体地,服务器接收用户问卷填写终端发送的用户投保问卷数据后,从中提取出各问卷问题用户所提交的问题选项数据,提取出可表明投保人身份的信息作为检索关键词在投保信息库中进行查询,查询库中是否存在该投保用户的既往投保记录,并在信息库中存在既往投保记录时获取该记录。服务器同时可提取用户在问卷中填写或选中的本次所投的险种名称,将其作为本次投保的投保类型,另外,还可将其转化为唯一编号进行存储,以便后续的数据处理。Specifically, after receiving the user's insurance application questionnaire data sent by the user's questionnaire filling terminal, the server extracts the question option data submitted by each questionnaire question user, and extracts the information that can indicate the identity of the insurance applicant as a retrieval keyword in the insurance application information database. Query, inquire whether there is a previous insurance application record of the insured user in the database, and obtain the record when there is a previous insurance application record in the information database. At the same time, the server can extract the name of the insurance that the user has filled in or selected in the questionnaire, and use it as the insurance type of this insurance. In addition, it can also be converted into a unique number for storage for subsequent data processing.
进一步地,所述根据所述身份信息判断所述目标投保人是否为已投保用户的步骤之后,还包括:若所述目标投保人不为已投保用户,则执行根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子的步骤;基于所述额度调节因子,得到不为已投保用户的目标投保人的加保额度信息。Further, after the step of judging whether the target insured is an insured user according to the identity information, the step further includes: if the target insured is not an insured user, executing the determination according to the target feature. The risk level of the insurance application information to be evaluated, and when it is detected that the risk level is lower than the high risk level, the step of obtaining the quota adjustment factor of the insurance application information to be evaluated; information on the added insurance amount of the target policyholder.
在本实施例中,若服务器未在投保信息库中查询到当前投保用户的既往投保信息,也即是当前投保用户为新投保用户,则继续将新投保用户的投保问卷中的问题作答信息输入风险评估模型,通过风险评估模型确定出当前的新投保用户的风险等级。若服务器通过模型判定当前新投保用户的风险等级为中风险等级、低风险等级或是极低风险等级等高风险以下等级,则获取该问卷信息对应的额度调节因子。服务器对于新投保用户的可加保额度计算。只需基于额度调节因子即可,而无需结合既往投保信息进行计算。以寿险为例,具体的计算方式可为:将新投保用户的最高免体检寿险体检保额乘上寿险调节系数,再加上负责本次投保的业务员的等级所对应的上调额度,最终得到的结果即为新投保用户的寿险可加保额度。若服务器通过模型判定当前新投保用户的风险等级为高风险等级、极高风险等级等高风险及以上等级,则不执行后续步骤,转向常规自核流程。In this embodiment, if the server does not query the previous insurance application information of the current insurance application user in the insurance application information database, that is, the current insurance application user is a new insurance application user, it continues to input the question answer information in the insurance application questionnaire of the new insurance application user. The risk assessment model determines the risk level of the current new insured users through the risk assessment model. If the server determines through the model that the risk level of the current new insured user is below a high risk level, such as a medium risk level, a low risk level, or a very low risk level, the quota adjustment factor corresponding to the questionnaire information is obtained. The server calculates the reinsurable limit for newly insured users. It only needs to be based on the adjustment factor of the amount, and does not need to be calculated based on the previous insurance information. Taking life insurance as an example, the specific calculation method can be as follows: multiply the maximum amount of medical examination-free life insurance for new insured users by the adjustment coefficient of life insurance, and add the corresponding increase amount corresponding to the level of the salesperson responsible for this insurance, and finally get The result is the life insurance reinsurable limit of the new insured user. If the server determines through the model that the risk level of the current new insured user is high risk level, extremely high risk level, or higher, the subsequent steps will not be performed, and the routine self-checking process will be turned to.
进一步地,通过提取待评估投保信息的特征并进行特征组合,使得能够深度挖掘出待评估投保信息中的有效信息,进而提升模型评估结果的准确性;通过对风险评估模型评估后的优质客户,实时获取基于多维度计算的可加保保险的保额空间,助力代理人展业,提升客户投保保额,以满足客户保障需求。Further, by extracting the characteristics of the insurance application information to be evaluated and combining the characteristics, it is possible to deeply mine the effective information in the insurance application information to be evaluated, thereby improving the accuracy of the model evaluation results; Real-time access to the insured amount space of insurable insurance based on multi-dimensional calculation, helping agents to expand their business, increasing the insured amount of customers, and meeting customer protection needs.
进一步地,基于上述图2所示的第一实施例,提出本申请额度预测方法的第三实施例。本实施例中,所述基于预设风险评估模型确定所述待评估投保信息的风险等级的步骤包括:计算所述目标特征对应在预设风控指标上的风险得分;根据预设得分等级对应规则确定所述风险得分所对应的风险等级,以作为所述待评估投保信息的风险等级。Further, based on the first embodiment shown in FIG. 2 above, a third embodiment of the quota prediction method of the present application is proposed. In this embodiment, the step of determining the risk level of the insurance application information to be evaluated based on the preset risk evaluation model includes: calculating a risk score corresponding to the target feature on the preset risk control index; The rule determines the risk level corresponding to the risk score as the risk level of the insurance application information to be evaluated.
在本实施例中,风控指标的相关问题可包括用户此次投保的保险产品所涉及的疾病的相关指标等。例如身体健康指数、膳食健康、运动情况、吸烟情况、饮酒情况、压力程度与家族病史情况等,还可以根据具体的风控需求灵活设置。每个问卷问题可以设置有多个答案选项,不同的答案选项对应于相应的选项得分。In this embodiment, the related issues of the risk control indicators may include related indicators of the diseases involved in the insurance product insured by the user this time, and the like. For example, physical health index, dietary health, exercise, smoking, drinking, stress and family medical history, etc., can also be flexibly set according to specific risk control needs. Each questionnaire question can be set with multiple answer options, and different answer options correspond to corresponding option scores.
模型将当前所得的多个目标特征一一对应到各风控指标上,再根据预设的对应规则相应标记各目标特征在风控指标上的风险评分,然后再根据各个风控指标设定相应的权重系数,计算该投保用户本次投保的风险总评分。模型在计算得到最终的风险总评分后,可根据预设的评分等级阈值,来定位该风险评分所在的风险等级。例如,将0-20分设定为极低风险等级;21-40分设定为低风险等级;41-60分设定为中风险等级;61-80分设定为高风险等级;81-100分设定为极高风险等级。The model maps the currently obtained multiple target features to each risk control index one by one, and then marks the risk score of each target feature on the risk control index according to the preset corresponding rules, and then sets the corresponding risk control index according to each risk control index. The weight coefficient of the insured user is calculated to calculate the total risk score of the insured user this time. After the model calculates and obtains the final total risk score, it can locate the risk level of the risk score according to the preset score level threshold. For example, a score of 0-20 is set as a very low risk level; a score of 21-40 is set as a low risk level; a score of 41-60 is set as a medium risk level; a score of 61-80 is set as a high risk level; 81- A score of 100 is set as a very high risk level.
进一步地,步骤S20之前,还包括:收集包含多条样本数据的初始样本数据集,其中,每条样本数据包括样本特征与对应的样本标签;根据所述样本特征与所述样本标签构建特征索引集,并计算所述特征索引集的最高判别准确率;将所述最高判别准确率与预设标准准确率阈值进行比较,并将不小于所述标准准确率阈值的最高判别准确率对应的样本数据作为目标样本数据,以筛选出目标样本数据集;使用预设机器学习模型对所述目标样本数据集进行训练,得到所述风险评估模型。Further, before step S20, it also includes: collecting an initial sample data set including multiple pieces of sample data, wherein each piece of sample data includes sample features and corresponding sample labels; constructing a feature index according to the sample features and the sample labels set, and calculate the highest discrimination accuracy rate of the feature index set; compare the highest discrimination accuracy rate with the preset standard accuracy rate threshold, and compare the sample corresponding to the highest discrimination accuracy rate not less than the standard accuracy rate threshold The data is used as target sample data to filter out the target sample data set; the target sample data set is trained by using a preset machine learning model to obtain the risk assessment model.
在本实施例中,特征工程可为最大相关最小冗余联合最大互信息系数特征选择策略.样本标签用于指示出用户的风险评分。服务器首先计算每一条样本数据中的样本特征与其样本标签之间的最大互信息系数,然后构建特征索引集,并计算每日一特征索引集的判别准确率,从中选出数值最高的一项与预设的标准准确率阈值进行比较。服务器将大于或等于阈值的判别准确率对应的样本数据列入目标样本数据集。最后,服务器使用决策树、聚类、深度神经网络,或是XGBoost等模型算法对其进行训练,最后训练得出最终的风险评估模型。In this embodiment, the feature engineering may be the maximum correlation minimum redundancy combined with the maximum mutual information coefficient feature selection strategy. The sample label is used to indicate the user's risk score. The server first calculates the maximum mutual information coefficient between the sample feature in each piece of sample data and its sample label, then constructs a feature index set, and calculates the discrimination accuracy rate of one feature index set per day, and selects the item with the highest value and the Preset standard accuracy thresholds for comparison. The server includes the sample data corresponding to the discrimination accuracy rate greater than or equal to the threshold into the target sample data set. Finally, the server uses decision tree, clustering, deep neural network, or XGBoost and other model algorithms to train it, and finally trains to obtain the final risk assessment model.
本申请所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
进一步地,通过预设风控指标以对投保客户的风险评分进行量化,进而确定出其对应的风险等级,使得模型能够快速地对投保客户的风险等级进行评定;通过对初始样本数据进行特征筛选,使得减小设备的数据处理负担,提升模型训练效率与精确性。Further, by presetting risk control indicators to quantify the risk scores of insured customers, and then determine their corresponding risk levels, so that the model can quickly assess the risk levels of insured customers; , which reduces the data processing burden of the device and improves the efficiency and accuracy of model training.
此外,如图3所示,为实现上述目的,本申请还提供一种额度预测装置,所述额度预测装置包括:既往信息获取模块10,用于获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;目标特征获取模块20,用于选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;风险等级确定模块30,用于根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;加保额度预测模块40,用于结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。In addition, as shown in FIG. 3 , in order to achieve the above purpose, the present application also provides a limit prediction device, the limit prediction device includes: a past information acquisition module 10 for obtaining the insurance application information to be evaluated of the target insured, and according to The to-be-evaluated insurance application information obtains the target insurance applicant's previous insurance application information and the current insurance application type; the target feature acquisition module 20 is used to select a pre-trained risk assessment model corresponding to the current insurance application type, using the The risk assessment model extracts the text features and numerical features in the insurance application information to be assessed, and combines the text features and numerical features to obtain the target features, wherein the risk assessment model uses the training data set screened by feature engineering to pass through. Obtained from machine learning algorithm training; the risk level determination module 30 is configured to determine the risk level of the insurance application information to be evaluated according to the target feature, and obtain the to-be-evaluated risk level when it is detected that the risk level is below high risk The amount adjustment factor of the insurance application information; the insurance addition amount prediction module 40 is configured to combine the previous insurance application information and the amount adjustment factor to predict the insurance addition amount information of the target insurance applicant.
本申请还提供一种额度预测设备。The present application also provides a quota prediction device.
所述额度预测设备包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的额度预测程序,其中所述额度预测程序被所述处理器执行时,实现如上所述的额度预测方法的步骤。The credit prediction device includes a processor, a memory, and a credit prediction program stored on the memory and executable on the processor, wherein the credit prediction program, when executed by the processor, implements the above-mentioned The steps of the quota prediction method.
其中,所述额度预测程序被执行时所实现的方法可参照本申请额度预测方法的各个实施例,此处不再赘述。For the method implemented when the quota prediction program is executed, reference may be made to the various embodiments of the quota prediction method of the present application, which will not be repeated here.
此外,本申请实施例还提供一种计算机可读存储介质。In addition, the embodiments of the present application further provide a computer-readable storage medium.
本申请计算机可读存储介质上存储有额度预测程序,其中所述额度预测程序被处理器执行时,实现如上述的额度预测方法的步骤。A quota prediction program is stored on the computer-readable storage medium of the present application, wherein when the quota prediction program is executed by the processor, the steps of the above quota prediction method are implemented.
可选的,本申请涉及的存储介质如计算机可读存储介质可以是非易失性的,也可以是易失性的。Optionally, the storage medium involved in this application, such as a computer-readable storage medium, may be non-volatile or volatile.
其中,额度预测程序被执行时所实现的方法可参照本申请额度预测方法的各个实施例,此处不再赘述。For the method implemented when the quota prediction program is executed, reference may be made to the various embodiments of the quota prediction method of the present application, which will not be repeated here.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or system comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or system. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in the process, method, article or system that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The above-mentioned serial numbers of the embodiments of the present application are only for description, and do not represent the advantages or disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation. Based on this understanding, the technical solutions of the present application can be embodied in the form of software products in essence or the parts that make contributions to the prior art. The computer software products are stored in a storage medium (such as ROM/RAM) as described above. , magnetic disk, optical disk), including several instructions to make a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only the preferred embodiments of the present application, and are not intended to limit the patent scope of the present application. Any equivalent structure or equivalent process transformation made by using the contents of the description and drawings of the present application, or directly or indirectly applied in other related technical fields , are similarly included within the scope of patent protection of this application.

Claims (20)

  1. 一种额度预测方法,其中,所述额度预测方法包括:A quota prediction method, wherein the quota prediction method includes:
    获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;Obtaining the insurance application information to be evaluated of the target insured, and obtaining the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information;
    选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;Selecting the pre-trained risk assessment model corresponding to the type of insurance this time, using the risk assessment model to extract the text features and numerical features in the insurance application information to be assessed, and combining the text features and numerical features to obtain a target feature, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm;
    根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;Determine the risk level of the insurance application information to be evaluated according to the target feature, and obtain the quota adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than high risk;
    结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。Combined with the previous insurance application information and the limit adjustment factor, the insurance addition limit information of the target insurance applicant is predicted.
  2. 如权利要求1所述的额度预测方法,其中,所述利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征的步骤包括:The limit prediction method according to claim 1, wherein the use of the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combining the text features and numerical features to obtain a Steps include:
    基于所述风险评估模型,获取所述待评估投保信息中的文字信息与数字信息;Based on the risk assessment model, obtain text information and numerical information in the insurance application information to be assessed;
    将所述文字信息采用独热编码处理得到所述文字特征,并将所述数字信息进行缺失值处理与稠密处理得到所述数字特征;The text information is processed by one-hot encoding to obtain the text features, and the digital information is processed with missing values and dense to obtain the digital features;
    获取所述文字特征与所述数字特征的互信息值,并基于所述互信息值将所述文字特征与所述数字特征分为可组合的第一特征与不可组合的第二特征;Obtaining the mutual information value of the text feature and the digital feature, and dividing the text feature and the digital feature into a combinable first feature and an uncombinable second feature based on the mutual information value;
    将所述第一特征进行组合得到组合特征,将所述组合特征与所述第二特征作为所述目标特征。The first feature is combined to obtain a combined feature, and the combined feature and the second feature are used as the target feature.
  3. 如权利要求1所述的额度预测方法,其中,所述在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子的步骤包括:The limit prediction method according to claim 1, wherein, when it is detected that the risk level is lower than high risk, the step of acquiring the limit adjustment factor of the insurance application information to be evaluated comprises:
    在所述风险等级为高风险以下等级时,判断所述目标投保人是否通过自核;When the risk level is below the high risk level, determine whether the target insured has passed the self-check;
    若所述目标投保人通过自核,则获取所述待评估投保信息对应的地区免体检标准、保险业务员等级以及所述本次投保类型的险种调节系数作为所述额度调节因子。If the target insured passes the self-check, the regional medical exemption standard corresponding to the insurance application information to be evaluated, the insurance salesperson level, and the insurance type adjustment coefficient of the current insurance application type are obtained as the quota adjustment factor.
  4. 如权利要求3所述的额度预测方法,其中,所述结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息的步骤包括:The limit prediction method according to claim 3, wherein the step of predicting the added insurance limit information of the target insured by combining the previous insurance application information and the limit adjustment factor comprises:
    根据所述地区免体检标准确定所述最高免体检保额,并结合所述既往投保信息得到初始加保额度;Determine the maximum medical-exemption insurance amount according to the regional medical-exemption standard, and obtain the initial additional insurance amount in combination with the previous insurance information;
    使用所述险种调节系数对所述初始加保额度进行调节,得到调节加保额度;Adjusting the initial insurance coverage amount using the insurance type adjustment coefficient to obtain an adjusted insurance coverage amount;
    获取与所述保险业务员等级对应的等级调节额度,并使用所述等级调节额度对所述调节加保额度进行调节,得到所述目标投保人的加保额度信息,其中,所述目标投保人的加保额度信息存储于区块链中。Obtain the level adjustment limit corresponding to the level of the insurance clerk, and use the level adjustment limit to adjust the adjustment plus insurance limit, and obtain the information on the increase insurance limit of the target insurance applicant, wherein the target insurance applicant The added insurance amount information is stored in the blockchain.
  5. 如权利要求1所述的额度预测方法,其中,所述获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型的步骤包括:The quota prediction method according to claim 1, wherein the step of obtaining the insurance application information to be evaluated of the target insurance applicant, and obtaining the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information include:
    在接收到投保指令时,基于所述投保指令获取所述目标投保人的投保问卷作为所述待评估投保信息;When receiving the insurance application instruction, obtain the insurance application questionnaire of the target applicant based on the insurance application instruction as the insurance application information to be evaluated;
    获取所述投保问卷中所述目标投保人填写的身份信息与投保名称信息,根据所述投保名称信息确定所述本次投保类型;Obtain the identity information and insurance name information filled in by the target insurance applicant in the insurance application questionnaire, and determine the current insurance application type according to the insurance application name information;
    根据所述身份信息判断所述目标投保人是否为已投保用户;Determine whether the target insured is an insured user according to the identity information;
    若所述目标投保人为已投保用户,则在投保信息库中查找到所述目标投保人的既往投保信息。If the target insurance applicant is an insured user, the previous insurance application information of the target insurance applicant is found in the insurance application information database.
  6. 如权利要求5所述的额度预测方法,其中,所述根据所述身份信息判断所述目标投保人是否为已投保用户的步骤之后,还包括:The quota prediction method according to claim 5, wherein after the step of judging whether the target insured is an insured user according to the identity information, the method further comprises:
    若所述目标投保人不为已投保用户,则执行根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子的步骤;If the target insured is not an insured user, the risk level of the insurance application information to be evaluated is determined according to the target feature, and when it is detected that the risk level is lower than high risk, the to-be-evaluated information is obtained. The steps of adjusting the amount of insurance information;
    基于所述额度调节因子,得到不为已投保用户的目标投保人的加保额度信息。Based on the limit adjustment factor, information on the added insurance limit of the target insured who is not an insured user is obtained.
  7. 如权利要求1所述的额度预测方法,其中,所述根据所述目标特征确定所述待评估投保信息的风险等级的步骤包括:The limit prediction method according to claim 1, wherein the step of determining the risk level of the insurance application information to be evaluated according to the target characteristic comprises:
    计算所述目标特征对应在预设风控指标上的风险得分;calculating the risk score corresponding to the target feature on the preset risk control index;
    根据预设得分等级对应规则确定所述风险得分所对应的风险等级,以作为所述待评估投保信息的风险等级。The risk level corresponding to the risk score is determined according to a preset score level correspondence rule, as the risk level of the insurance application information to be evaluated.
  8. 如权利要求1-7中任一项所述的额度预测方法,其中,所述选择与所述本次投保类型对应的预训练的风险评估模型的步骤之前,还包括:The quota prediction method according to any one of claims 1-7, wherein before the step of selecting a pre-trained risk assessment model corresponding to the current insurance application type, the method further comprises:
    收集包含多条样本数据的初始样本数据集,其中,每条样本数据包括样本特征与对应的样本标签;Collect an initial sample data set containing multiple pieces of sample data, wherein each piece of sample data includes sample features and corresponding sample labels;
    根据所述样本特征与所述样本标签构建特征索引集,并计算所述特征索引集的最高判别准确率;Build a feature index set according to the sample feature and the sample label, and calculate the highest discrimination accuracy rate of the feature index set;
    将所述最高判别准确率与预设标准准确率阈值进行比较,并将不小于所述标准准确率阈值的最高判别准确率对应的样本数据作为目标样本数据,以筛选出目标样本数据集;Comparing the highest discrimination accuracy with a preset standard accuracy threshold, and using the sample data corresponding to the highest discrimination accuracy not less than the standard accuracy threshold as the target sample data, to filter out the target sample data set;
    使用预设机器学习模型对所述目标样本数据集进行训练,得到所述风险评估模型。The target sample data set is trained using a preset machine learning model to obtain the risk assessment model.
  9. 一种额度预测设备,其中,所述额度预测设备包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的额度预测程序,其中所述额度预测程序被所述处理器执行时,实现以下方法:A credit prediction device, wherein the credit prediction device includes a processor, a memory, and a credit prediction program stored on the memory and executable by the processor, wherein the credit prediction program is executed by the processor When executed, implement the following methods:
    获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;Obtaining the insurance application information to be evaluated of the target insured, and obtaining the previous insurance application information and the current insurance application type of the target insurance applicant according to the to-be-evaluated insurance application information;
    选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;Selecting the pre-trained risk assessment model corresponding to the type of insurance this time, using the risk assessment model to extract the text features and numerical features in the insurance application information to be assessed, and combining the text features and numerical features to obtain a target feature, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm;
    根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;Determine the risk level of the insurance application information to be evaluated according to the target feature, and obtain the quota adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than high risk;
    结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。Combined with the previous insurance application information and the limit adjustment factor, the insurance addition limit information of the target insurance applicant is predicted.
  10. 如权利要求9所述的额度预测设备,其中,执行所述利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征的步骤包括:The limit prediction device according to claim 9, wherein the extraction of text features and numerical features in the insurance application information to be evaluated by using the risk assessment model is performed, and the text features and numerical features are combined to obtain the target feature The steps include:
    基于所述风险评估模型,获取所述待评估投保信息中的文字信息与数字信息;Based on the risk assessment model, obtain text information and numerical information in the insurance application information to be assessed;
    将所述文字信息采用独热编码处理得到所述文字特征,并将所述数字信息进行缺失值处理与稠密处理得到所述数字特征;The text information is processed by one-hot encoding to obtain the text features, and the digital information is processed with missing values and dense to obtain the digital features;
    获取所述文字特征与所述数字特征的互信息值,并基于所述互信息值将所述文字特征与所述数字特征分为可组合的第一特征与不可组合的第二特征;Obtaining the mutual information value of the text feature and the digital feature, and dividing the text feature and the digital feature into a combinable first feature and an uncombinable second feature based on the mutual information value;
    将所述第一特征进行组合得到组合特征,将所述组合特征与所述第二特征作为所述目标特征。The first feature is combined to obtain a combined feature, and the combined feature and the second feature are used as the target feature.
  11. 如权利要求9所述的额度预测设备,其中,执行所述在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子的步骤包括:The limit prediction device according to claim 9, wherein the step of obtaining the limit adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than the high risk level comprises:
    在所述风险等级为高风险以下等级时,判断所述目标投保人是否通过自核;When the risk level is below the high risk level, determine whether the target insured has passed the self-check;
    若所述目标投保人通过自核,则获取所述待评估投保信息对应的地区免体检标准、保险业务员等级以及所述本次投保类型的险种调节系数作为所述额度调节因子。If the target insured passes the self-check, the regional medical exemption standard corresponding to the insurance application information to be evaluated, the insurance salesperson level, and the insurance type adjustment coefficient of the current insurance application type are obtained as the quota adjustment factor.
  12. 如权利要求11所述的额度预测设备,其中,执行所述结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息的步骤包括:The limit prediction device according to claim 11, wherein the step of combining the previous insurance application information and the limit adjustment factor to predict the insurance addition limit information of the target insurance applicant comprises:
    根据所述地区免体检标准确定所述最高免体检保额,并结合所述既往投保信息得到初始加保额度;Determine the maximum medical-exemption insurance amount according to the regional medical-exemption standard, and obtain the initial additional insurance amount in combination with the previous insurance information;
    使用所述险种调节系数对所述初始加保额度进行调节,得到调节加保额度;Using the insurance type adjustment coefficient to adjust the initial insurance coverage amount to obtain an adjusted insurance coverage amount;
    获取与所述保险业务员等级对应的等级调节额度,并使用所述等级调节额度对所述调节加保额度进行调节,得到所述目标投保人的加保额度信息,其中,所述目标投保人的加保额度信息存储于区块链中。Obtain the level adjustment limit corresponding to the level of the insurance clerk, and use the level adjustment limit to adjust the adjustment plus insurance limit, and obtain the information on the increase insurance limit of the target insurance applicant, wherein the target insurance applicant The insurance amount information is stored in the blockchain.
  13. 如权利要求9所述的额度预测设备,其中,执行所述获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型的步骤包括:The limit prediction device according to claim 9, wherein the obtaining of the insurance application information to be assessed of the target insurance applicant is performed, and the previous insurance application information of the target insurance applicant and the information of the current insurance application type are obtained according to the to-be-evaluated insurance application information. Steps include:
    在接收到投保指令时,基于所述投保指令获取所述目标投保人的投保问卷作为所述待评估投保信息;When receiving the insurance application instruction, obtain the insurance application questionnaire of the target applicant based on the insurance application instruction as the insurance application information to be evaluated;
    获取所述投保问卷中所述目标投保人填写的身份信息与投保名称信息,根据所述投保名称信息确定所述本次投保类型;Obtain the identity information and insurance name information filled in by the target insurance applicant in the insurance application questionnaire, and determine the current insurance application type according to the insurance application name information;
    根据所述身份信息判断所述目标投保人是否为已投保用户;Determine whether the target insured is an insured user according to the identity information;
    若所述目标投保人为已投保用户,则在投保信息库中查找到所述目标投保人的既往投保信息。If the target insurance applicant is an insured user, the previous insurance application information of the target insurance applicant is found in the insurance application information database.
  14. 如权利要求9-13中任一项所述的额度预测设备,其中,所述选择与所述本次投保类型对应的预训练的风险评估模型的步骤之前,所述额度预测程序被所述处理器执行时还用于实现:The limit prediction device according to any one of claims 9-13, wherein the limit prediction program is processed by the process before the step of selecting a pre-trained risk assessment model corresponding to the current insurance application type It is also used to implement:
    收集包含多条样本数据的初始样本数据集,其中,每条样本数据包括样本特征与对应的样本标签;Collect an initial sample data set containing multiple pieces of sample data, wherein each piece of sample data includes sample features and corresponding sample labels;
    根据所述样本特征与所述样本标签构建特征索引集,并计算所述特征索引集的最高判别准确率;Build a feature index set according to the sample feature and the sample label, and calculate the highest discrimination accuracy rate of the feature index set;
    将所述最高判别准确率与预设标准准确率阈值进行比较,并将不小于所述标准准确率阈值的最高判别准确率对应的样本数据作为目标样本数据,以筛选出目标样本数据集;Comparing the highest discrimination accuracy with a preset standard accuracy threshold, and using the sample data corresponding to the highest discrimination accuracy not less than the standard accuracy threshold as the target sample data, to filter out the target sample data set;
    使用预设机器学习模型对所述目标样本数据集进行训练,得到所述风险评估模型。The target sample data set is trained using a preset machine learning model to obtain the risk assessment model.
  15. 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有额度预测程序,其中所述额度预测程序被处理器执行时,实现以下方法:A computer-readable storage medium, wherein a quota prediction program is stored on the computer-readable storage medium, wherein when the quota prediction program is executed by a processor, the following method is implemented:
    获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型;Obtain the insurance application information to be evaluated of the target insurance applicant, and obtain the previous insurance application information and the current insurance application type of the target insurance applicant according to the insurance application information to be evaluated;
    选择与所述本次投保类型对应的预训练的风险评估模型,利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征,其中,所述风险评估模型是利用特征工程筛选后的训练数据集通过机器学习算法训练所得;Selecting a pre-trained risk assessment model corresponding to the type of insurance this time, using the risk assessment model to extract text features and numerical features in the insurance application information to be assessed, and combining the text features and numerical features to obtain a target feature, wherein the risk assessment model is obtained by using the training data set screened by feature engineering to be trained by a machine learning algorithm;
    根据所述目标特征确定所述待评估投保信息的风险等级,并在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子;Determine the risk level of the insurance application information to be evaluated according to the target feature, and obtain the quota adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than high risk;
    结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息。Combined with the previous insurance application information and the limit adjustment factor, the insurance addition limit information of the target insurance applicant is predicted.
  16. 如权利要求15所述的计算机可读存储介质,其中,执行所述利用所述风险评估模型提取所述待评估投保信息中的文字特征与数字特征,并将所述文字特征与数字特征组合得到目标特征的步骤包括:The computer-readable storage medium according to claim 15, wherein the extraction of text features and numerical features in the insurance application information to be assessed by using the risk assessment model is performed, and the text features and numerical features are combined to obtain The steps to target features include:
    基于所述风险评估模型,获取所述待评估投保信息中的文字信息与数字信息;Based on the risk assessment model, obtain text information and numerical information in the insurance application information to be assessed;
    将所述文字信息采用独热编码处理得到所述文字特征,并将所述数字信息进行缺失值处理与稠密处理得到所述数字特征;The text information is processed by one-hot encoding to obtain the text features, and the digital information is processed with missing values and dense to obtain the digital features;
    获取所述文字特征与所述数字特征的互信息值,并基于所述互信息值将所述文字特征与所述数字特征分为可组合的第一特征与不可组合的第二特征;Obtaining the mutual information value of the text feature and the digital feature, and dividing the text feature and the digital feature into a combinable first feature and an uncombinable second feature based on the mutual information value;
    将所述第一特征进行组合得到组合特征,将所述组合特征与所述第二特征作为所述目标特征。The first feature is combined to obtain a combined feature, and the combined feature and the second feature are used as the target feature.
  17. 如权利要求15所述的计算机可读存储介质,其中,执行所述在检测到所述风险等级为高风险以下等级时,获取所述待评估投保信息的额度调节因子的步骤包括:The computer-readable storage medium according to claim 15, wherein the step of obtaining the limit adjustment factor of the insurance application information to be evaluated when it is detected that the risk level is lower than the high risk level comprises:
    在所述风险等级为高风险以下等级时,判断所述目标投保人是否通过自核;When the risk level is below high risk, determine whether the target insured has passed the self-check;
    若所述目标投保人通过自核,则获取所述待评估投保信息对应的地区免体检标准、保险业务员等级以及所述本次投保类型的险种调节系数作为所述额度调节因子。If the target insured passes the self-check, the regional medical exemption standard corresponding to the insurance application information to be evaluated, the insurance salesperson level, and the insurance type adjustment coefficient of the current insurance application type are obtained as the quota adjustment factor.
  18. 如权利要求17所述的计算机可读存储介质,其中,执行所述结合所述既往投保信息与所述额度调节因子,预测所述目标投保人的加保额度信息的步骤包括:The computer-readable storage medium of claim 17, wherein the step of combining the previous insurance application information and the amount adjustment factor to predict the insurance addition amount information of the target applicant comprises:
    根据所述地区免体检标准确定所述最高免体检保额,并结合所述既往投保信息得到初始加保额度;Determine the maximum medical-exemption insurance amount according to the regional medical-exemption standard, and obtain the initial additional insurance amount in combination with the previous insurance information;
    使用所述险种调节系数对所述初始加保额度进行调节,得到调节加保额度;Using the insurance type adjustment coefficient to adjust the initial insurance coverage amount to obtain an adjusted insurance coverage amount;
    获取与所述保险业务员等级对应的等级调节额度,并使用所述等级调节额度对所述调节加保额度进行调节,得到所述目标投保人的加保额度信息,其中,所述目标投保人的加保额度信息存储于区块链中。Obtain the level adjustment limit corresponding to the level of the insurance clerk, and use the level adjustment limit to adjust the adjustment plus insurance limit, and obtain the information on the increase insurance limit of the target insurance applicant, wherein the target insurance applicant The insurance amount information is stored in the blockchain.
  19. 如权利要求15所述的计算机可读存储介质,其中,执行所述获取目标投保人的待评估投保信息,并根据所述待评估投保信息得到所述目标投保人的既往投保信息以及本次投保类型的步骤包括:The computer-readable storage medium according to claim 15, wherein the obtaining of the insurance application information to be evaluated of the target insurance applicant is executed, and the previous insurance application information and the current insurance application of the target insurance applicant are obtained according to the to-be-evaluated insurance application information Types of steps include:
    在接收到投保指令时,基于所述投保指令获取所述目标投保人的投保问卷作为所述待评估投保信息;When receiving the insurance application instruction, obtain the insurance application questionnaire of the target applicant based on the insurance application instruction as the insurance application information to be evaluated;
    获取所述投保问卷中所述目标投保人填写的身份信息与投保名称信息,根据所述投保名称信息确定所述本次投保类型;Obtain the identity information and insurance name information filled in by the target insurance applicant in the insurance application questionnaire, and determine the current insurance application type according to the insurance application name information;
    根据所述身份信息判断所述目标投保人是否为已投保用户;Determine whether the target insured is an insured user according to the identity information;
    若所述目标投保人为已投保用户,则在投保信息库中查找到所述目标投保人的既往投保信息。If the target insurance applicant is an insured user, the previous insurance application information of the target insurance applicant is found in the insurance application information database.
  20. 如权利要求15-19中任一项所述的计算机可读存储介质,其中,所述选择与所述本次投保类型对应的预训练的风险评估模型的步骤之前,所述额度预测程序被处理器执行时还用于实现:The computer-readable storage medium of any one of claims 15-19, wherein the quota prediction program is processed before the step of selecting a pre-trained risk assessment model corresponding to the current insurance type It is also used to implement:
    收集包含多条样本数据的初始样本数据集,其中,每条样本数据包括样本特征与对应的样本标签;Collect an initial sample data set containing multiple pieces of sample data, wherein each piece of sample data includes sample features and corresponding sample labels;
    根据所述样本特征与所述样本标签构建特征索引集,并计算所述特征索引集的最高判别准确率;Build a feature index set according to the sample feature and the sample label, and calculate the highest discrimination accuracy rate of the feature index set;
    将所述最高判别准确率与预设标准准确率阈值进行比较,并将不小于所述标准准确率阈值的最高判别准确率对应的样本数据作为目标样本数据,以筛选出目标样本数据集;Comparing the highest discrimination accuracy with a preset standard accuracy threshold, and using the sample data corresponding to the highest discrimination accuracy not less than the standard accuracy threshold as the target sample data, to filter out the target sample data set;
    使用预设机器学习模型对所述目标样本数据集进行训练,得到所述风险评估模型。The target sample data set is trained using a preset machine learning model to obtain the risk assessment model.
PCT/CN2021/090569 2020-12-07 2021-04-28 Quota prediction method and device, and computer-readable storage medium WO2022121217A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011420120.9 2020-12-07
CN202011420120.9A CN112529716B (en) 2020-12-07 2020-12-07 Method, device and computer readable storage medium for predicting credit

Publications (1)

Publication Number Publication Date
WO2022121217A1 true WO2022121217A1 (en) 2022-06-16

Family

ID=74998044

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/090569 WO2022121217A1 (en) 2020-12-07 2021-04-28 Quota prediction method and device, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN112529716B (en)
WO (1) WO2022121217A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115434870A (en) * 2022-07-29 2022-12-06 国家电投集团云南国际电力投资有限公司 Method, device and equipment for processing vibration data of wind power generation set and storage medium
CN116051296A (en) * 2022-12-28 2023-05-02 中国银行保险信息技术管理有限公司 Customer evaluation analysis method and system based on standardized insurance data
CN116668095A (en) * 2023-05-16 2023-08-29 江苏信创网安数据科技有限公司 Intelligent network security assessment method and system
CN117011074A (en) * 2023-07-25 2023-11-07 明亚保险经纪股份有限公司 Risk early warning method and platform

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112529716B (en) * 2020-12-07 2023-10-24 平安科技(深圳)有限公司 Method, device and computer readable storage medium for predicting credit
CN113159460A (en) * 2021-05-24 2021-07-23 中国平安人寿保险股份有限公司 User data processing method and device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150039351A1 (en) * 2013-08-02 2015-02-05 Transamerica Corporation Categorizing Life Insurance Applicants to Determine Suitable Life Insurance Products
CN109670974A (en) * 2018-12-14 2019-04-23 中国平安人寿保险股份有限公司 A kind of risk monitoring and control method and device, electric terminal
CN110827157A (en) * 2019-11-05 2020-02-21 泰康保险集团股份有限公司 Data processing method and device, storage medium and electronic equipment
CN111738870A (en) * 2020-07-28 2020-10-02 工保科技(浙江)有限公司 Method and platform for identifying insurance risk of engineering performance guarantee based on characteristic engineering
CN112529716A (en) * 2020-12-07 2021-03-19 平安科技(深圳)有限公司 Quota prediction method, device and computer readable storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050037720A (en) * 2003-10-20 2005-04-25 쌍용화재해상보험주식회사 Insurance management system and method using a risk estimation
US11151654B2 (en) * 2015-09-30 2021-10-19 Johnson Controls Tyco IP Holdings LLP System and method for determining risk profile, adjusting insurance premiums and automatically collecting premiums based on sensor data
IT201700025082A1 (en) * 2017-03-07 2018-09-07 Frediano Rossi Method and management system for data relating to risks
CN109165840B (en) * 2018-08-20 2022-06-21 平安科技(深圳)有限公司 Risk prediction processing method, risk prediction processing device, computer equipment and medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150039351A1 (en) * 2013-08-02 2015-02-05 Transamerica Corporation Categorizing Life Insurance Applicants to Determine Suitable Life Insurance Products
CN109670974A (en) * 2018-12-14 2019-04-23 中国平安人寿保险股份有限公司 A kind of risk monitoring and control method and device, electric terminal
CN110827157A (en) * 2019-11-05 2020-02-21 泰康保险集团股份有限公司 Data processing method and device, storage medium and electronic equipment
CN111738870A (en) * 2020-07-28 2020-10-02 工保科技(浙江)有限公司 Method and platform for identifying insurance risk of engineering performance guarantee based on characteristic engineering
CN112529716A (en) * 2020-12-07 2021-03-19 平安科技(深圳)有限公司 Quota prediction method, device and computer readable storage medium

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115434870A (en) * 2022-07-29 2022-12-06 国家电投集团云南国际电力投资有限公司 Method, device and equipment for processing vibration data of wind power generation set and storage medium
CN115434870B (en) * 2022-07-29 2023-11-07 云南滇能智慧能源有限公司 Method, device, equipment and storage medium for processing vibration data of wind power generation set
CN116051296A (en) * 2022-12-28 2023-05-02 中国银行保险信息技术管理有限公司 Customer evaluation analysis method and system based on standardized insurance data
CN116051296B (en) * 2022-12-28 2023-09-29 中国银行保险信息技术管理有限公司 Customer evaluation analysis method and system based on standardized insurance data
CN116668095A (en) * 2023-05-16 2023-08-29 江苏信创网安数据科技有限公司 Intelligent network security assessment method and system
CN116668095B (en) * 2023-05-16 2024-03-29 江苏信创网安数据科技有限公司 Intelligent network security assessment method and system
CN117011074A (en) * 2023-07-25 2023-11-07 明亚保险经纪股份有限公司 Risk early warning method and platform

Also Published As

Publication number Publication date
CN112529716A (en) 2021-03-19
CN112529716B (en) 2023-10-24

Similar Documents

Publication Publication Date Title
WO2022121217A1 (en) Quota prediction method and device, and computer-readable storage medium
Richman et al. A neural network extension of the Lee–Carter model to multiple populations
CN108648074B (en) Loan assessment method, device and equipment based on support vector machine
TW530234B (en) Methods and systems for efficiently sampling portfolios for optimal underwriting
US9697469B2 (en) Method and system for generating and aggregating models based on disparate data from insurance, financial services, and public industries
US8515862B2 (en) Computer-implemented systems and methods for integrated model validation for compliance and credit risk
TWI242724B (en) Methods and systems for optimizing return and present value
Van Gestel et al. A process model to develop an internal rating system: Sovereign credit ratings
CN112862298B (en) Credit evaluation method for user portrait
CN109376766B (en) Portrait prediction classification method, device and equipment
CN112990386B (en) User value clustering method and device, computer equipment and storage medium
CN111180065A (en) Insurance user evaluation method and device, electronic equipment and computer readable medium
CN113902037A (en) Abnormal bank account identification method, system, electronic device and storage medium
CN112116103A (en) Method, device and system for evaluating personal qualification based on federal learning and storage medium
CN112990989B (en) Value prediction model input data generation method, device, equipment and medium
Saulsberry et al. The social vulnerability metric (SVM) as a new tool for public health
US20140156544A1 (en) Non-Tangible Assets Valuation Tool
KR102005734B1 (en) Total finance service offering system reflecting quantitative evaluating result of tangible assets based on machine learning
CN113269179B (en) Data processing method, device, equipment and storage medium
Wofford A simulation approach to the appraisal of income producing real estate
CN114757497A (en) Technology trust-increasing method for science and technology enterprise, device, equipment and medium thereof
US20200202444A1 (en) Servicing a plurality of rived longevity-contingent instruments
KR20210126408A (en) Device and method for ai calculating damage using disease big data
CN110942180A (en) Industrial design matching service party prediction method based on xgboost algorithm
Baret et al. Forecasting stability and growth pact compliance using machine learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21901932

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21901932

Country of ref document: EP

Kind code of ref document: A1