WO2020073727A1 - Risk forecast method, device, computer apparatus, and storage medium - Google Patents

Risk forecast method, device, computer apparatus, and storage medium Download PDF

Info

Publication number
WO2020073727A1
WO2020073727A1 PCT/CN2019/099382 CN2019099382W WO2020073727A1 WO 2020073727 A1 WO2020073727 A1 WO 2020073727A1 CN 2019099382 W CN2019099382 W CN 2019099382W WO 2020073727 A1 WO2020073727 A1 WO 2020073727A1
Authority
WO
WIPO (PCT)
Prior art keywords
prediction
sample
risk
enterprise
label
Prior art date
Application number
PCT/CN2019/099382
Other languages
French (fr)
Chinese (zh)
Inventor
于修铭
汪伟
肖京
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020073727A1 publication Critical patent/WO2020073727A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities

Definitions

  • This application relates to a risk prediction method, device, computer equipment and storage medium.
  • the risk early warning system is based on the characteristics of the research object, by collecting relevant data and information, monitoring the changing trend of risk factors, and evaluating the strength of various risk states deviating from the early warning line, sending early warning signals to the decision-making layer and taking pre-control in advance Countermeasure system. Therefore, to build an early warning system, you must first build an evaluation index system and analyze and process the index categories. Second, based on the early warning model, comprehensive evaluation of the evaluation index system. Finally, the early warning interval is set according to the judgment results, and corresponding countermeasures are taken.
  • a risk prediction method for example, a risk prediction method, apparatus, computer equipment, and storage medium are provided.
  • a risk prediction method includes:
  • the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
  • Extract enterprise indicators from all dimensions of the enterprise include various financial indicators, legal litigation information, public opinion information, and import and export lists;
  • Forecast labels include risk prediction labels and risk-free prediction labels
  • the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag;
  • a risk prediction device includes:
  • the dimension information obtaining module is used to receive the prediction request sent by the terminal, and obtain each dimension information of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise ;
  • the enterprise index extraction module is used to extract enterprise indexes from all dimensions of the enterprise; the enterprise indexes include various financial indexes, legal litigation information, public opinion information, and import and export lists;
  • the tag acquisition module is used to input the enterprise index as a prediction feature into an available prediction model that is pre-trained, and output a prediction tag corresponding to the enterprise index;
  • the available prediction model is based on a prompt corresponding to the training data set and sample data Information training;
  • the prediction labels include risk prediction labels and risk-free prediction labels;
  • the prediction information obtaining module is used to obtain prediction information corresponding to the enterprise index according to the prediction label; the prediction information includes risk prediction information corresponding to the risk prediction label, and corresponding to the risk-free prediction label No risk prediction information; and
  • the sending module is used to send the prediction information to the corresponding terminal.
  • a computer device includes a memory and one or more processors.
  • the memory stores computer-readable instructions.
  • the one or more processors are executed The following steps:
  • the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
  • Extract enterprise indicators from all dimensions of the enterprise include various financial indicators, legal litigation information, public opinion information, and import and export lists;
  • Forecast labels include risk prediction labels and risk-free prediction labels
  • the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag;
  • One or more non-volatile storage media storing computer readable instructions, which when executed by one or more processors, cause the one or more processors to perform the following steps:
  • the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
  • Extract enterprise indicators from all dimensions of the enterprise include various financial indicators, legal litigation information, public opinion information, and import and export lists;
  • Forecast labels include risk prediction labels and risk-free prediction labels
  • the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag;
  • FIG. 1 is an application scenario diagram of a risk prediction method according to one or more embodiments.
  • FIG. 2 is a schematic flowchart of a risk prediction method according to one or more embodiments.
  • FIG. 3 is a schematic flowchart of constructing an initial model according to one or more embodiments.
  • FIG. 4 is a schematic flowchart of using a sample label as a prediction label of an initial model according to one or more embodiments.
  • FIG. 5 is a block diagram of a risk prediction device according to one or more embodiments.
  • Figure 6 is a block diagram of a computer device in accordance with one or more embodiments.
  • the risk prediction method provided by this application can be applied in the application environment shown in FIG. 1.
  • the terminal 102 and the server 104 communicate via the network.
  • the server 104 receives the prediction request sent by the terminal 102 and obtains the information of each dimension of the enterprise corresponding to the prediction request.
  • the prediction request carries the enterprise logo, and the enterprise logo corresponds to the information of each dimension of the enterprise.
  • the server 104 extracts enterprise indicators from various dimensions of the enterprise.
  • the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists.
  • the enterprise index is used as a prediction feature to input into the available prediction model that is pre-trained, and the prediction label corresponding to the enterprise index is output.
  • the prediction model can be trained according to the prompt information corresponding to the training data set and sample data. Risk prediction label.
  • the server 104 obtains the prediction information corresponding to the enterprise index according to the prediction tag, the prediction information includes the risk prediction information corresponding to the risk prediction tag, and the risk-free prediction information corresponding to the risk-free prediction tag, and sends the prediction information to the corresponding terminal 102.
  • the terminal 102 may be, but not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices.
  • the server 104 may be implemented by an independent server or a server cluster composed of multiple servers.
  • a risk prediction method is provided. Taking the method applied to the server in FIG. 1 as an example for illustration, it includes the following steps:
  • the server receives the prediction request sent by the terminal, and obtains information on each dimension of the enterprise corresponding to the prediction request.
  • the prediction request carries an enterprise ID, and the enterprise ID corresponds to each dimension information of the enterprise.
  • the prediction request carries the enterprise logo, and there is a correspondence between the enterprise logo and each dimension information of the enterprise, and the server can obtain each dimension information of the enterprise corresponding to the enterprise logo according to the correspondence between the enterprise logo and each dimension information of the enterprise .
  • the server receives the prediction request sent by the terminal, and parses the prediction request to obtain the enterprise identification carried in the prediction request. Obtain the correspondence between the enterprise logo and each dimension of the enterprise, and according to the correspondence between the acquired enterprise logo and each dimension of the enterprise, obtain the dimension information of the enterprise corresponding to the enterprise logo from the database.
  • the enterprise management system performs daily operations, and can call the interface to obtain information and data related to the enterprise from other platforms connected to the enterprise management system, based on the acquired enterprise-related information And data to generate information about each dimension of the corresponding enterprise.
  • the server After receiving the prediction request sent by the corresponding terminal where the enterprise is located, the server parses the prediction request sent by the corresponding terminal and obtains the enterprise identification carried in the prediction request. Furthermore, the server can identify the corresponding enterprise according to the enterprise identification, and obtain the information of each dimension of the enterprise corresponding to the enterprise identification from the database.
  • the server extracts enterprise indicators from all dimensions of the enterprise.
  • the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists.
  • the server acquires the information of each dimension of the enterprise from the database, and extracts the enterprise index from the information of each dimension of the enterprise. Among them, there is a correspondence between each dimension information of the enterprise and the enterprise index.
  • the server can obtain the correspondence between each dimension information of the enterprise and the enterprise index, and according to the correspondence between the dimension information of the enterprise and the enterprise index, Separately extract the enterprise indexes corresponding to the information of each dimension.
  • Enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists.
  • financial indicators refer to the relative indicators for enterprises to summarize and evaluate the financial status and operating results, including: debt solvency indicators, including asset-liability ratio, current ratio, quick ratio; operating ability indicators, including accounts receivable turnover , Inventory turnover rate; profitability indicators, including capital profit rate, sales profit rate (operating income tax rate), cost expense profit rate, etc.
  • the legal litigation information indicates the legal situation of the enterprise's participation in legal affairs.
  • Public opinion is the abbreviation of "public opinion situation", which refers to the occurrence, development and change of intermediary social events in a certain social space, and the public as the subject of social managers, enterprises, individuals and other various organizations as the object
  • Social attitudes generated and held by their political, social, and moral orientations, including positive and negative public opinion can also express the beliefs, attitudes, opinions, and emotions expressed by more people about various phenomena and problems in society Wait for the sum of performance.
  • the import and export list shows the import and export situation of the company's products, including product type, corresponding product quantity, product selling price and purchase price, etc.
  • the server inputs the enterprise index as a prediction feature into the available prediction model that is pre-trained, and outputs the prediction label corresponding to the enterprise index.
  • the available prediction model is trained based on the prompt information corresponding to the training data set and sample data. Labels and risk-free predictive labels.
  • the server inputs the enterprise index as a prediction feature into the available prediction model, and obtains the risk prediction label and the risk-free prediction label corresponding to the enterprise index, respectively. Furthermore, the server can obtain the risk prediction information corresponding to the risk prediction label according to the correspondence between the preset risk prediction label and the risk prediction information, and according to the correspondence between the preset risk-free prediction label and the risk-free prediction information To obtain the risk-free prediction information corresponding to the risk-free prediction label.
  • the server obtains the sample data from the database and extracts the sample label from the sample data, and uses the sample label as the prediction label of the initial model.
  • the server obtains the indexes corresponding to the sample data and uses the indexes corresponding to the sample data as the prediction features of the initial model, and then generates a training data set according to the prediction features and the prediction labels. Use the prompt information corresponding to the training data set and the sample data to train the initial model to obtain an available prediction model.
  • the initial model is an existing risk prediction model, but it is not applicable to all companies that need to make risk predictions. It is necessary to use the sample label, that is, the prediction label, and the prediction feature, that is, the index corresponding to the sample data to generate the training data set.
  • the server can use the prompt information corresponding to the training data set and the sample data to train the initial model, and obtain the available test model corresponding to the enterprise according to the initial model after the training.
  • the server obtains prediction information corresponding to the enterprise index according to the prediction tag.
  • the prediction information includes risk prediction information corresponding to the risk prediction tag and risk-free prediction information corresponding to the risk-free prediction tag.
  • the server obtains the risk prediction information corresponding to the risk prediction tag according to the correspondence between the preset risk prediction tag and the risk prediction information, and according to the correspondence between the preset risk-free prediction tag and the risk-free prediction information Relationship to obtain risk-free prediction information corresponding to the risk-free prediction label.
  • the risk prediction information indicates that the corresponding company has risk information, and the prediction information received by the corresponding company from the server is risk prediction information, which needs to be further investigated and processed based on the received risk prediction information.
  • the risk-free forecast information indicates that there is no risk information for the corresponding enterprise, and the forecast information sent by the server received by the corresponding enterprise is the risk-free forecast information.
  • S210 The server sends the prediction information to the corresponding terminal.
  • the risk prediction information indicates that the corresponding company has risk information, and the prediction information received by the corresponding company from the server is risk prediction information, which needs to be further investigated and processed based on the received risk prediction information.
  • the risk-free forecast information indicates that there is no risk information for the corresponding enterprise, and the forecast information sent by the server received by the corresponding enterprise is the risk-free forecast information.
  • the server obtains the information of each dimension of the enterprise according to the prediction request sent by the terminal, and extracts the enterprise index from the information of each dimension of the enterprise. Therefore, the enterprise index can be input into the available prediction model that is pre-trained as the prediction feature, and the prediction information corresponding to the enterprise index can be obtained, and the prediction information can be sent to the terminal.
  • data update there is no need to repeatedly perform data preprocessing on each indicator, which can reduce resource consumption, and at the same time, send corresponding prediction information to corresponding terminals for different prediction tags, which can further improve the risk prediction effect.
  • the steps of constructing the initial model include:
  • the server obtains sample data from the database, and extracts sample labels from the sample data, and uses the sample labels as the prediction labels of the initial model.
  • the server extracts sample tags from the sample data, and obtains the attributes of the sample tags, and divides the sample data into the first type of samples and the second type of samples according to the attributes.
  • the server respectively obtains a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample.
  • the server uses the first sample label as the risk prediction label of the initial model, and uses the second sample label as the risk-free prediction label of the initial model.
  • the attribute of the sample label is used to indicate whether the sample data corresponding to the sample label carries risk data.
  • the server may divide the sample data into a first type sample and a second type sample according to the attributes of the sample label.
  • the first type sample is sample data carrying risk data
  • the second type sample is sample data not carrying risk data.
  • the first sample label is a sample label carrying risk data
  • the second sample label is a sample label without risk data.
  • the server acquires the sample index corresponding to the sample data, and uses the sample index as the prediction feature of the initial model.
  • the server obtains the index corresponding to the sample data by acquiring the correspondence between the sample data and the index, and according to the correspondence between the sample data and the index.
  • the server obtains the correlation between the sample data and the predicted features of the initial model, and uses the indexes corresponding to the sample data as the predicted features of the initial model according to the correlation between the sample data and the predicted features.
  • the server obtains the sample data set from the database and stores the sample data set in the preset object. Furthermore, the server acquires the training parameters corresponding to the training function by calling the training function in the database and according to the correspondence between the preset training function and the training parameter. The sample data set is called from the preset object, and the initial model is constructed according to the training parameters and the sample data set.
  • the training parameters include the maximum depth of the tree, the contraction step size and the number of iterations.
  • the maximum depth of the tree represents the longest path from the root node to the leaf node plus 1, and from a recursive point of view, the depth of the tree is equal to the depth of its largest left and right subtree plus 1.
  • the contraction step length means that a value is added to each contraction operation by adding a certain number (this is the step length), and this operation is repeatedly performed. Iteration refers to the process of repeatedly performing a series of calculation steps and sequentially determining the subsequent amounts from the previous amount. Each result of this process is obtained by performing the same calculation step on the previous result, and the number of iterations means The number of times to repeat a series of calculation steps.
  • the server generates a training data set according to the predicted feature and the predicted label.
  • the prediction feature is an indicator corresponding to the sample data
  • the prediction label is a sample label of the sample data, including a first sample label and a second sample label, where the first sample label is a sample label carrying risk data, the second The sample label is a sample label of non-risk data.
  • the server generates a training data set according to the index corresponding to the sample data and the first sample label, and the index corresponding to the sample data and the second sample label.
  • the server uses the prompt information corresponding to the training data set and the sample data to train the initial model to obtain an available prediction model.
  • the server trains the sample data set according to the training parameters to obtain the training data set, and applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage. According to the prediction percentage and sample data set, build a usable prediction model.
  • the training parameters include the maximum depth of the tree, the contraction step size and the number of iterations.
  • the maximum depth of the tree indicates that the longest path from the root node to the leaf node is increased by 1, and the contraction step indicates that a value is added to each contraction operation, plus a certain number (this is the step size), and repeated execution In this operation, the number of iterations indicates the number of times to repeat a series of calculation steps.
  • the initial model is an existing risk prediction model, but it is not suitable for all companies that need to make risk predictions.
  • the prompt information corresponding to the training data set and sample data is applied to the initial model to obtain the prediction percentage.
  • the prediction percentage is used to represent the prompt information corresponding to the training data set and the sample data. After being applied to the initial model, it is obtained by calculating the degree of association between the training data set and the sample data.
  • the server extracts the sample label from the obtained sample data, and uses the sample label as the prediction label of the initial model. Obtain the index corresponding to the sample data, and use the index corresponding to the sample data as the prediction feature of the initial model, and generate the training data set according to the prediction feature and the prediction label, and then use the prompt information corresponding to the training data set and the sample data to carry out the initial model Train to get a usable prediction model. Therefore, by directly using the sample label as the prediction label of the initial model, it is possible to reduce the need for reprocessing the sample data every time the data is updated, and reduce resource consumption.
  • the step of extracting the sample label from the sample data and using the sample label as the prediction label of the initial model includes:
  • the server extracts sample tags from the sample data and obtains the attributes of the sample tags.
  • the server divides the sample data into the first type of samples and the second type of samples according to the attributes.
  • the server obtains the sample data and the correspondence between the sample data and the sample label, and extracts the corresponding sample label from the sample data according to the correspondence between the sample data and the sample label.
  • the attribute of the sample label is used to indicate whether the sample data corresponding to the sample label carries risk data.
  • the server may divide the sample data into a first type sample and a second type sample according to the attributes of the sample label.
  • the first type sample is sample data carrying risk data
  • the second type sample is sample data not carrying risk data.
  • the server obtains a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample.
  • the server acquires the corresponding relationship between the first type of sample and the first sample label, and obtains the first corresponding to the first type of sample according to the corresponding relationship between the first type of sample and the first sample label Sample label.
  • the first sample label is a sample label that carries risk data.
  • the server obtains the second sample label corresponding to the second type sample by acquiring the corresponding relationship between the second type sample and the second sample label, and according to the corresponding relationship between the second type sample and the second sample label.
  • the second sample label is a sample label of non-risk data.
  • the server uses the first sample label as the risk prediction label of the initial model, and uses the second sample label as the risk-free prediction label of the initial model.
  • the first sample label is a sample label carrying risk data
  • the second sample label is a sample label without risk data.
  • the server uses the first sample label, that is, the sample label that carries risk data, as the risk prediction label of the initial model, and corresponds to the first type of sample that carries risk data.
  • the second sample label that is, the sample label that does not carry risk data, is used as the risk-free prediction label of the initial model, and corresponds to the second type sample that does not carry risk data.
  • the server extracts the sample label from the sample data and obtains the attribute of the sample label, and divides the sample data into the first type sample and the second type sample according to the attribute. Obtain the first sample label corresponding to the first type sample and the second sample label corresponding to the second type sample, and use the first sample label as the risk prediction label of the initial model and the second sample label as the initial model Of risk-free prediction labels. Therefore, the sample tags carrying the risk data and the non-risk data can be distinguished, which is beneficial for performing targeted sample data processing and improving work efficiency.
  • a risk prediction method is provided, and the method further includes the following steps:
  • the server obtains the sample data set from the database and stores the sample data set in the preset object; calls the training function in the database; according to the correspondence between the preset training function and the training parameter, obtains the training corresponding to the training function Parameters; call the sample data set from the preset object, and build the initial model based on the training parameters and the sample data set.
  • the training parameters include the maximum depth of the tree, the contraction step size and the number of iterations.
  • the maximum depth of the tree indicates that the longest path from the root node to the leaf node is increased by 1, and the contraction step indicates that a value is added to each contraction operation, plus a certain number (this is the step size), and repeated execution In this operation, the number of iterations indicates the number of times to repeat a series of calculation steps.
  • the server obtains the training parameters corresponding to the called training function by acquiring the correspondence between the training function and the training parameter, and according to the correspondence between the training function and the training parameter. Use the training parameters to train the sample data set called from the preset object to obtain the initial model.
  • the server obtains the training parameters corresponding to the training function according to the correspondence between the preset training function and the training parameter, and calls the sample data set from the preset object, according to the training parameter and the sample data set To build the initial model. Therefore, the establishment of the initial model can be achieved, and the training data can be used to train the sample data set, thereby providing a corresponding basis for the subsequent available prediction models, and improving work efficiency.
  • the step of receiving the prediction request sent by the terminal and obtaining the information of each dimension of the enterprise according to the prediction request includes:
  • the server receives and parses the prediction request sent by the terminal; obtains the enterprise logo carried in the prediction request; based on the enterprise logo
  • the prediction request carries the enterprise logo, and there is a correspondence between the enterprise logo and each dimension information of the enterprise, and the server can obtain each dimension information of the enterprise corresponding to the enterprise logo according to the correspondence between the enterprise logo and each dimension information of the enterprise .
  • the server receives the prediction request sent by the terminal, and parses the prediction request to obtain the enterprise identification carried in the prediction request. Obtain the correspondence between the enterprise logo and each dimension of the enterprise, and according to the correspondence between the acquired enterprise logo and each dimension of the enterprise, obtain the dimension information of the enterprise corresponding to the enterprise logo from the database.
  • the server receives and parses the prediction request sent by the terminal, obtains the request tag carried by the prediction request, and obtains the dimension information of the enterprise corresponding to the request tag. Therefore, it is possible to obtain the information of each dimension of the targeted enterprise, clarify the correspondence between different prediction requests and the corresponding enterprise, and facilitate the acquisition and storage of the dimension information of the personalized enterprise.
  • the step of obtaining prediction information corresponding to the enterprise index according to the prediction label includes:
  • the server obtains the risk prediction information corresponding to the risk prediction tag according to the correspondence between the preset risk prediction tag and the risk prediction information; according to the correspondence between the preset risk-free prediction tag and the risk-free prediction information, the Risk-free prediction information corresponding to the risk-free prediction label.
  • the risk prediction information indicates that there is risk information for the corresponding enterprise, and the prediction information received by the corresponding enterprise from the server is risk prediction information, which needs to be further investigated and processed according to the received risk prediction information.
  • the risk-free forecast information indicates that there is no risk information for the corresponding enterprise, and the forecast information sent by the server received by the corresponding enterprise is the risk-free forecast information.
  • the initial model is an existing risk prediction model, but it is not applicable to all companies that need to make risk predictions. It is necessary to use the sample label, that is, the prediction label, and the prediction feature, that is, the index corresponding to the sample data to generate the training data set.
  • the server can use the prompt information corresponding to the training data set and the sample data to train the initial model, and obtain the available test model corresponding to the enterprise according to the initial model after the training.
  • the server can obtain the risk prediction information corresponding to the risk prediction label and the risk-free prediction information corresponding to the risk-free prediction label by separately acquiring the risk prediction label and the risk-free prediction label corresponding to the enterprise index. Therefore, corresponding prediction labels are obtained for different enterprise indicators, and corresponding risk prediction information and risk-free prediction information can be obtained separately, thereby improving the accuracy of risk prediction.
  • using the prompt information corresponding to the training data set and the sample data to train the initial model to obtain a usable prediction model includes:
  • the server trains the sample data set according to the training parameters to obtain the training data set; applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage; according to the prediction percentage and the sample data set, constructs an available prediction model.
  • the training parameters include the maximum depth of the tree, the contraction step size and the number of iterations.
  • the maximum depth of the tree indicates that the longest path from the root node to the leaf node is increased by 1, and the contraction step indicates that a value is added to each contraction operation, plus a certain number (this is the step size), and repeated execution In this operation, the number of iterations indicates the number of times to repeat a series of calculation steps.
  • the initial model is an existing risk prediction model, but it is not applicable to all enterprises that need to make risk predictions.
  • the prompt information corresponding to the training data set and sample data is applied to the initial model to obtain the prediction percentage.
  • the prediction percentage is used to represent the prompt information corresponding to the training data set and the sample data. After being applied to the initial model, it is obtained by calculating the degree of association between the training data set and the sample data.
  • the server trains the sample data set according to the training parameters to obtain the training data set, and applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage, and then obtains the prediction percentage based on Sample data sets to build available predictive models. Therefore, by further training the initial model, the available prediction model corresponding to the enterprise is obtained, and the accuracy of risk prediction according to the enterprise index is improved.
  • steps in the flowcharts of FIGS. 2-4 are displayed in order according to the arrows, the steps are not necessarily executed in the order indicated by the arrows. Unless clearly stated in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least some of the steps in FIGS. 2-4 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. These sub-steps or stages The execution order of is not necessarily sequential, but may be executed in turn or alternately with at least a part of other steps or sub-steps or stages of other steps.
  • a risk prediction device which includes: a dimension information acquisition module 502, an enterprise index extraction module 504, a label acquisition module 506, a prediction information acquisition module 508, and a transmission module 510, where:
  • the dimension information obtaining module 502 is used to receive the prediction request sent by the terminal and obtain each dimension information of the enterprise corresponding to the prediction request; the prediction request carries the enterprise ID, and the enterprise ID corresponds to each dimension information of the enterprise.
  • the enterprise index extraction module 504 is used to extract enterprise indexes from all dimensions of the enterprise; the enterprise indexes include various financial indexes, legal litigation information, public opinion information, and import and export lists.
  • the label acquisition module 506 is used to input the enterprise index as the prediction feature into the available prediction model obtained by pre-training, and output the prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; prediction Labels include risk prediction labels and risk-free prediction labels.
  • the prediction information obtaining module 508 is used to obtain the prediction information corresponding to the enterprise index according to the prediction label; the prediction information includes the risk prediction information corresponding to the risk prediction label and the risk-free prediction information corresponding to the risk-free prediction label.
  • the sending module 510 is used to send the prediction information to the corresponding terminal.
  • the server acquires the information of each dimension of the enterprise according to the prediction request sent by the terminal, and extracts the enterprise index from the information of each dimension of the enterprise. Therefore, the enterprise index can be input into the available prediction model that is pre-trained as the prediction feature, the risk prediction information and the risk-free prediction information corresponding to the enterprise index can be obtained, and the risk prediction information and the risk-free prediction information can be sent to the corresponding terminal respectively.
  • data update there is no need to repeatedly perform data preprocessing on each indicator, which can reduce resource consumption, and at the same time, send corresponding prediction information to corresponding terminals for different prediction tags, which can further improve the risk prediction effect.
  • a risk prediction device is provided, and the device further includes:
  • the prediction label generation module is used to obtain sample data from the database, and extract the sample label from the sample data, and use the sample label as the prediction label of the initial model;
  • the prediction feature generation module is used to obtain the sample index corresponding to the sample data, and The sample index is used as the prediction feature of the initial model;
  • the training data set generation module is used to generate the training data set based on the prediction feature and the prediction label;
  • the available prediction model generation module is used to use the prompt information corresponding to the training data set and the sample data
  • the model is trained to obtain an available prediction model.
  • the server extracts the sample label from the acquired sample data, and uses the sample label as the prediction label of the initial model. Obtain the index corresponding to the sample data, and use the index corresponding to the sample data as the prediction feature of the initial model, and generate the training data set according to the prediction feature and the prediction label, and then use the prompt information corresponding to the training data set and the sample data to carry out the initial model Train to get a usable prediction model. Therefore, by directly using the sample label as the prediction label of the initial model, it is possible to reduce the need for reprocessing the sample data every time the data is updated, and reduce resource consumption.
  • a risk prediction device is provided, and the device further includes:
  • the sample data set acquisition module is used to obtain the sample data set from the database and store the sample data set into a preset object;
  • the training function call module is used to call the training function in the database;
  • the training parameter acquisition module is used to Correspondence between the preset training function and training parameters to obtain the training parameters corresponding to the training function;
  • the initial model building module is used to call the sample data set from the preset object and build according to the training parameters and sample data set The initial model.
  • the server obtains the training parameters corresponding to the training function according to the correspondence between the preset training function and the training parameter, and calls the sample data set from the preset object, according to the training parameter and the sample data set, Build the initial model. Therefore, the establishment of the initial model can be achieved, and the training data can be used to train the sample data set, thereby providing a corresponding basis for the subsequent available prediction models, and improving work efficiency.
  • the dimensional information acquisition module is also used to:
  • Receive and parse the prediction request sent by the terminal obtain the enterprise ID carried in the prediction request based on the correspondence between the enterprise ID and each dimension information of the enterprise, and obtain each dimension information of the enterprise corresponding to the enterprise ID.
  • the server receives and parses the prediction request sent by the terminal, obtains the enterprise ID carried in the prediction request, and obtains each dimension information of the enterprise corresponding to the enterprise ID. Therefore, it is possible to obtain the information of each dimension of the targeted enterprise, clarify the correspondence between different prediction requests and the corresponding enterprise, and facilitate the acquisition and storage of the dimension information of the personalized enterprise.
  • the prompt information acquisition module is also used to:
  • the server inputs enterprise indicators as prediction features into the available prediction model, respectively obtains the risk prediction label and the risk-free prediction label corresponding to the enterprise index, and then can obtain the risk prediction information corresponding to the risk prediction label and Risk-free prediction information corresponding to the risk-free prediction label. Therefore, corresponding prediction labels are obtained for different enterprise indicators, and corresponding risk prediction information and risk-free prediction information can be obtained separately, thereby improving the accuracy of risk prediction.
  • the predicted label generation module is also used to:
  • the server extracts the sample label from the sample data and obtains the attribute of the sample label, and divides the sample data into the first type sample and the second type sample according to the attribute. Obtain the first sample label corresponding to the first type sample and the second sample label corresponding to the second type sample, and use the first sample label as the risk prediction label of the initial model and the second sample label as the initial model Of risk-free prediction labels. Therefore, the sample tags carrying the risk data and the non-risk data can be distinguished, which is beneficial for performing targeted sample data processing and improving work efficiency.
  • the prediction information acquisition module is also used to:
  • the server can obtain the risk prediction information corresponding to the risk prediction label and the risk-free prediction information corresponding to the risk-free prediction label by separately acquiring the risk prediction label and the risk-free prediction label corresponding to the enterprise index. Therefore, corresponding prediction labels are obtained for different enterprise indicators, and corresponding risk prediction information and risk-free prediction information can be obtained separately, thereby improving the accuracy of risk prediction.
  • a predictive model building module may be used, which is also used to:
  • the server trains the sample data set according to the training parameters to obtain the training data set, and applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage, and then obtains the prediction percentage according to the prediction percentage Data sets, build available predictive models. Therefore, by further training the initial model, the available prediction model corresponding to the enterprise is obtained, and the accuracy of risk prediction according to the enterprise index is improved.
  • Each module in the above risk prediction device may be implemented in whole or in part by software, hardware, or a combination thereof.
  • the above modules may be embedded in the hardware or independent of the processor in the computer device, or may be stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure may be as shown in FIG. 6.
  • the computer device includes a processor, memory, network interface, and database connected by a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, computer-readable instructions, and a database.
  • the internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium.
  • the database of the computer device is used to store information data of various dimensions of the enterprise.
  • the network interface of the computer device is used to communicate with external terminals through a network connection.
  • the computer readable instructions are executed by the processor to implement a risk prediction method.
  • FIG. 6 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or less components than shown in the figure, or combine certain components, or have a different arrangement of components.
  • a computer device includes a memory and one or more processors.
  • the memory stores computer-readable instructions.
  • the computer-readable instructions are executed by the processor, the unbalanced sample data preprocessing method provided in any embodiment of the present application is implemented. A step of.
  • One or more non-volatile computer-readable storage media storing computer-readable instructions, which when executed by one or more processors, cause the one or more processors to implement any one of the embodiments of the present application The steps of the unbalanced sample data preprocessing method provided.
  • Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory can include random access memory (RAM) or external cache memory.
  • RAM random access memory
  • DRAM dynamic RAM
  • SDRAM synchronous DRAM
  • DDRSDRAM double data rate SDRAM
  • ESDRAM enhanced SDRAM
  • SLDRAM synchronous chain (Synchlink) DRAM
  • RDRAM direct RAM
  • DRAM direct memory bus dynamic RAM
  • RDRAM memory bus dynamic RAM

Landscapes

  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Operations Research (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Educational Administration (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A risk forecast method comprises: receiving a forecast request sent by a terminal, and acquiring information of each dimension of a company corresponding to the forecast request carrying a company identifier, the company identifier corresponding to the information of each dimension of the company; extracting a company index from the information of each dimension of the company, the company index comprising various financial indexes, legal proceeding information, public sentiment information, and import and export lists; inputting the company index as a forecasting feature into an available forecasting model obtained from previous training, and outputting a forecast label corresponding to the company index, wherein the available forecasting model is obtained from training according to a training data set and indication information corresponding to sample data, and the forecast label comprises a risk-involvement forecast label and a risk-free forecast label; acquiring forecast information corresponding to the company index according to the forecast label, the forecast information comprising risk-involvement forecast information corresponding to the risk-involvement forecast label and risk-free forecast information corresponding to the risk-free forecast label; and sending the forecast information to a corresponding terminal.

Description

风险预测方法、装置、计算机设备和存储介质Risk prediction method, device, computer equipment and storage medium
相关申请的交叉引用Cross-reference of related applications
本申请要求于2018年10月11日提交中国专利局,申请号为2018111836725,申请名称为“风险预测方法、装置、计算机设备和存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requires priority to be submitted to the Chinese Patent Office on October 11, 2018, with the application number 2018111836725, and the priority of the Chinese patent application titled "risk prediction methods, devices, computer equipment and storage media", the entire contents of which are incorporated by reference In this application.
技术领域Technical field
本申请涉及一种风险预测方法、装置、计算机设备和存储介质。This application relates to a risk prediction method, device, computer equipment and storage medium.
背景技术Background technique
风险预警系统是根据所研究对象的特点,通过收集相关的资料信息,监控风险因素的变动趋势,并评价各种风险状态偏离预警线的强弱程度,向决策层发出预警信号并提前采取预控对策的系统。因此,要构建预警系统必须先构建评价指标体系,并对指标类别加以分析处理。其次,依据预警模型,对评价指标体系进行综合评判。最后,依据评判结果设置预警区间,并采取相应对策。The risk early warning system is based on the characteristics of the research object, by collecting relevant data and information, monitoring the changing trend of risk factors, and evaluating the strength of various risk states deviating from the early warning line, sending early warning signals to the decision-making layer and taking pre-control in advance Countermeasure system. Therefore, to build an early warning system, you must first build an evaluation index system and analyze and process the index categories. Second, based on the early warning model, comprehensive evaluation of the evaluation index system. Finally, the early warning interval is set according to the judgment results, and corresponding countermeasures are taken.
然而,发明人意识到,传统的企业风险预警系统,大多基于统计方法或者根据专家打分卡来搭建的,其关键部分是预先计算的各指标阈值,指标有效性,以及中间指标的权重等,而这些数据需要人工计算,在计算过程中容易出现失误进而影响风险预测效果。However, the inventor realized that most of the traditional enterprise risk early warning systems are built based on statistical methods or according to expert scorecards. The key part is the pre-calculated thresholds of indicators, the validity of indicators, and the weight of intermediate indicators. These data need to be calculated manually, and errors are likely to occur in the calculation process and affect the risk prediction effect.
发明内容Summary of the invention
根据本申请公开的各种实施例,提供一种风险预测方法、装置、计算机设备和存储介质。According to various embodiments disclosed in the present application, a risk prediction method, apparatus, computer equipment, and storage medium are provided.
一种风险预测方法包括:A risk prediction method includes:
接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;Receiving a prediction request sent by a terminal, and acquiring information of each dimension of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;Extract enterprise indicators from all dimensions of the enterprise; the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists;
将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;Input the enterprise index as a prediction feature into an available prediction model obtained by pre-training, and output a prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; Forecast labels include risk prediction labels and risk-free prediction labels;
根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及Obtaining prediction information corresponding to the enterprise indicator according to the prediction tag; the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag; and
将所述预测信息发送至对应终端。Send the prediction information to the corresponding terminal.
一种风险预测装置包括:A risk prediction device includes:
维度信息获取模块,用于接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;The dimension information obtaining module is used to receive the prediction request sent by the terminal, and obtain each dimension information of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise ;
企业指标提取模块,用于从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;The enterprise index extraction module is used to extract enterprise indexes from all dimensions of the enterprise; the enterprise indexes include various financial indexes, legal litigation information, public opinion information, and import and export lists;
标签获取模块,用于将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;The tag acquisition module is used to input the enterprise index as a prediction feature into an available prediction model that is pre-trained, and output a prediction tag corresponding to the enterprise index; the available prediction model is based on a prompt corresponding to the training data set and sample data Information training; the prediction labels include risk prediction labels and risk-free prediction labels;
预测信息获取模块,用于根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及The prediction information obtaining module is used to obtain prediction information corresponding to the enterprise index according to the prediction label; the prediction information includes risk prediction information corresponding to the risk prediction label, and corresponding to the risk-free prediction label No risk prediction information; and
发送模块,用于将所述预测信息发送至对应终端。The sending module is used to send the prediction information to the corresponding terminal.
一种计算机设备,包括存储器和一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device includes a memory and one or more processors. The memory stores computer-readable instructions. When the computer-readable instructions are executed by the processor, the one or more processors are executed The following steps:
接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;Receiving a prediction request sent by a terminal, and acquiring information of each dimension of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;Extract enterprise indicators from all dimensions of the enterprise; the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists;
将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;Input the enterprise index as a prediction feature into an available prediction model obtained by pre-training, and output a prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; Forecast labels include risk prediction labels and risk-free prediction labels;
根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及Obtaining prediction information corresponding to the enterprise indicator according to the prediction tag; the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag; and
将所述预测信息发送至对应终端。Send the prediction information to the corresponding terminal.
一个或多个存储有计算机可读指令的非易失性存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:One or more non-volatile storage media storing computer readable instructions, which when executed by one or more processors, cause the one or more processors to perform the following steps:
接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;Receiving a prediction request sent by a terminal, and acquiring information of each dimension of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;Extract enterprise indicators from all dimensions of the enterprise; the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists;
将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;Input the enterprise index as a prediction feature into an available prediction model obtained by pre-training, and output a prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; Forecast labels include risk prediction labels and risk-free prediction labels;
根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及Obtaining prediction information corresponding to the enterprise indicator according to the prediction tag; the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag; and
将所述预测信息发送至对应终端。Send the prediction information to the corresponding terminal.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征和优点将从说明书、附图以及权利要求书变得明显。The details of one or more embodiments of the application are set forth in the drawings and description below. Other features and advantages of this application will become apparent from the description, drawings, and claims.
附图说明BRIEF DESCRIPTION
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to more clearly explain the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings required in the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. Those of ordinary skill in the art can obtain other drawings based on these drawings without creative efforts.
图1为根据一个或多个实施例中风险预测方法的应用场景图。FIG. 1 is an application scenario diagram of a risk prediction method according to one or more embodiments.
图2为根据一个或多个实施例中风险预测方法的流程示意图。FIG. 2 is a schematic flowchart of a risk prediction method according to one or more embodiments.
图3为根据一个或多个实施例中构建初始模型的流程示意图。FIG. 3 is a schematic flowchart of constructing an initial model according to one or more embodiments.
图4为根据一个或多个实施例中将样本标签作为初始模型的预测标签的流程示意图。4 is a schematic flowchart of using a sample label as a prediction label of an initial model according to one or more embodiments.
图5为根据一个或多个实施例中风险预测装置的框图。FIG. 5 is a block diagram of a risk prediction device according to one or more embodiments.
图6为根据一个或多个实施例中计算机设备的框图。Figure 6 is a block diagram of a computer device in accordance with one or more embodiments.
具体实施方式detailed description
为了使本申请的技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the technical solutions and advantages of the present application more clear, the following describes the present application in further detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, and are not used to limit the present application.
本申请提供的风险预测方法,可以应用于如图1所示的应用环境中。终端102与服务器104通过网络进行通信。服务器104接收终端102发送的预测请求,并获取与预测请求对应的企业的各维度信息,预测请求携带企业标识,企业标识与企业的各维度信息对应。服务器104从企业的各维度信息中提取企业指标,企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单。将企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与企业指标对应的预测标签,可用预测模型根据训练数据集和样本数据对应的提示信息训练得到,预测标签包括风险预测标签和无风险预测标签。服务器104根据预测标签获得与企业指标对应的预测信息,预测信息包括与风险预测标签对应的风险预测信息,以及与无风险预测标签对应的无风险预测信息,并将预测信息发送至对应终端102。终端102可以但不限于是各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备,服务器104可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The risk prediction method provided by this application can be applied in the application environment shown in FIG. 1. The terminal 102 and the server 104 communicate via the network. The server 104 receives the prediction request sent by the terminal 102 and obtains the information of each dimension of the enterprise corresponding to the prediction request. The prediction request carries the enterprise logo, and the enterprise logo corresponds to the information of each dimension of the enterprise. The server 104 extracts enterprise indicators from various dimensions of the enterprise. The enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists. The enterprise index is used as a prediction feature to input into the available prediction model that is pre-trained, and the prediction label corresponding to the enterprise index is output. The prediction model can be trained according to the prompt information corresponding to the training data set and sample data. Risk prediction label. The server 104 obtains the prediction information corresponding to the enterprise index according to the prediction tag, the prediction information includes the risk prediction information corresponding to the risk prediction tag, and the risk-free prediction information corresponding to the risk-free prediction tag, and sends the prediction information to the corresponding terminal 102. The terminal 102 may be, but not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices. The server 104 may be implemented by an independent server or a server cluster composed of multiple servers.
在其中一个实施例中,如图2所示,提供了一种风险预测方法,以该方法应用于图1中的服务器为例进行说明,包括以下步骤:In one of the embodiments, as shown in FIG. 2, a risk prediction method is provided. Taking the method applied to the server in FIG. 1 as an example for illustration, it includes the following steps:
S202,服务器接收终端发送的预测请求,并获取与预测请求对应的企业的各维度信息, 预测请求携带企业标识,企业标识与企业的各维度信息对应。S202. The server receives the prediction request sent by the terminal, and obtains information on each dimension of the enterprise corresponding to the prediction request. The prediction request carries an enterprise ID, and the enterprise ID corresponds to each dimension information of the enterprise.
其中,预测请求携带企业标识,企业标识和企业的各维度信息之间存在对应关系,服务器可根据企业标识和企业的各维度信息之间的对应关系,获取与企业标识对应的企业的各维度信息。Among them, the prediction request carries the enterprise logo, and there is a correspondence between the enterprise logo and each dimension information of the enterprise, and the server can obtain each dimension information of the enterprise corresponding to the enterprise logo according to the correspondence between the enterprise logo and each dimension information of the enterprise .
具体地,服务器通过接收终端发送的预测请求,并解析预测请求,获取预测请求携带的企业标识。获取企业标识和企业的各维度信息之间的对应关系,根据所获取到的企业标识和企业的各维度信息之间的对应关系,从数据库中获取与企业标识对应的企业的各维度信息。Specifically, the server receives the prediction request sent by the terminal, and parses the prediction request to obtain the enterprise identification carried in the prediction request. Obtain the correspondence between the enterprise logo and each dimension of the enterprise, and according to the correspondence between the acquired enterprise logo and each dimension of the enterprise, obtain the dimension information of the enterprise corresponding to the enterprise logo from the database.
进一步地,在实际应用场景中,企业的管理系统进行日常运转,可通过调用接口,从与企业管理系统连接的其他平台上,获取与企业相关的信息和数据,根据所获取的企业相关的信息和数据,生成对应企业的各维度信息。服务器在接收到企业所在的对应终端发送的预测请求后,解析对应终端发送的预测请求,并获取预测请求携带的企业标识。进而服务器可根据企业标识识别对应的企业,从数据库中获取与企业标识对应的企业的各维度信息。Further, in the actual application scenario, the enterprise management system performs daily operations, and can call the interface to obtain information and data related to the enterprise from other platforms connected to the enterprise management system, based on the acquired enterprise-related information And data to generate information about each dimension of the corresponding enterprise. After receiving the prediction request sent by the corresponding terminal where the enterprise is located, the server parses the prediction request sent by the corresponding terminal and obtains the enterprise identification carried in the prediction request. Furthermore, the server can identify the corresponding enterprise according to the enterprise identification, and obtain the information of each dimension of the enterprise corresponding to the enterprise identification from the database.
S204,服务器从企业的各维度信息中提取企业指标,企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单。S204. The server extracts enterprise indicators from all dimensions of the enterprise. The enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists.
具体地,服务器通过从数据库中获取企业的各维度信息,并从企业的各维度信息中提取企业指标。其中,企业的各维度信息和企业指标之间存在对应关系,服务器可通过获取企业的各维度信息和企业指标之间的对应关系,并根据企业的各维度信息和企业指标之间的对应关系,分别提取与各维度信息对应的企业指标。Specifically, the server acquires the information of each dimension of the enterprise from the database, and extracts the enterprise index from the information of each dimension of the enterprise. Among them, there is a correspondence between each dimension information of the enterprise and the enterprise index. The server can obtain the correspondence between each dimension information of the enterprise and the enterprise index, and according to the correspondence between the dimension information of the enterprise and the enterprise index, Separately extract the enterprise indexes corresponding to the information of each dimension.
企业指标包括各项财务指标,法务诉讼信息,舆情信息以及进出口清单等。其中,财务指标是指企业总结和评价财务状况和经营成果的相对指标,具体包括:偿债能力指标,包括资产负债率、流动比率、速动比率;营运能力指标,包括应收账款周转率、存货周转率;盈利能力指标,包括资本金利润率、销售利润率(营业收入利税率)、成本费用利润率等。Enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists. Among them, financial indicators refer to the relative indicators for enterprises to summarize and evaluate the financial status and operating results, including: debt solvency indicators, including asset-liability ratio, current ratio, quick ratio; operating ability indicators, including accounts receivable turnover , Inventory turnover rate; profitability indicators, including capital profit rate, sales profit rate (operating income tax rate), cost expense profit rate, etc.
法务诉讼信息表示企业参与法律事务的诉讼情况。舆情是“舆论情况”的简称,是指在一定的社会空间内,围绕中介性社会事件的发生、发展和变化,作为主体的民众对作为客体的社会管理者、企业、个人及其他各类组织及其政治、社会、道德等方面的取向产生和持有的社会态度,包括正面舆情和负面舆情,也可表示较多群众关于社会中各种现象、问题所表达的信念、态度、意见和情绪等表现的总和。进出口清单表示企业产品的进出口情况,包括产品类型,对应产品数量以及产品售价及购买价等。The legal litigation information indicates the legal situation of the enterprise's participation in legal affairs. Public opinion is the abbreviation of "public opinion situation", which refers to the occurrence, development and change of intermediary social events in a certain social space, and the public as the subject of social managers, enterprises, individuals and other various organizations as the object Social attitudes generated and held by their political, social, and moral orientations, including positive and negative public opinion, can also express the beliefs, attitudes, opinions, and emotions expressed by more people about various phenomena and problems in society Wait for the sum of performance. The import and export list shows the import and export situation of the company's products, including product type, corresponding product quantity, product selling price and purchase price, etc.
S206,服务器将企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与企业指标对应的预测标签,可用预测模型根据训练数据集和样本数据对应的提示信息训练得到,预测标签包括风险预测标签和无风险预测标签。S206, the server inputs the enterprise index as a prediction feature into the available prediction model that is pre-trained, and outputs the prediction label corresponding to the enterprise index. The available prediction model is trained based on the prompt information corresponding to the training data set and sample data. Labels and risk-free predictive labels.
具体地,服务器通过将企业指标作为预测特征输入可用预测模型中,并分别获取与企 业指标对应的风险预测标签和无风险预测标签。进而服务器可根据预设的风险预测标签和风险预测信息之间的对应关系,获取与风险预测标签对应的风险预测信息,并根据预设的无风险预测标签和无风险预测信息之间的对应关系,获取与无风险预测标签对应的无风险预测信息。Specifically, the server inputs the enterprise index as a prediction feature into the available prediction model, and obtains the risk prediction label and the risk-free prediction label corresponding to the enterprise index, respectively. Furthermore, the server can obtain the risk prediction information corresponding to the risk prediction label according to the correspondence between the preset risk prediction label and the risk prediction information, and according to the correspondence between the preset risk-free prediction label and the risk-free prediction information To obtain the risk-free prediction information corresponding to the risk-free prediction label.
进一步地,服务器通过从数据库中获取样本数据,并从样本数据中提取样本标签,将样本标签作为初始模型的预测标签。同时,服务器通过获取样本数据对应的指标,并将样本数据对应的指标作为初始模型的预测特征,进而根据预测特征和预测标签生成训练数据集。利用训练数据集和样本数据对应的提示信息,对初始模型进行训练,获得可用预测模型。Further, the server obtains the sample data from the database and extracts the sample label from the sample data, and uses the sample label as the prediction label of the initial model. At the same time, the server obtains the indexes corresponding to the sample data and uses the indexes corresponding to the sample data as the prediction features of the initial model, and then generates a training data set according to the prediction features and the prediction labels. Use the prompt information corresponding to the training data set and the sample data to train the initial model to obtain an available prediction model.
初始模型为已有的风险预测模型,但并不适用于所有需要进行风险预测的企业,需要利用样本标签,即预测标签,以及预测特征,即样本数据对应的指标,生成训练数据集。服务器利用训练数据集和样本数据对应的提示信息,可对初始模型进行训练,根据进行训练过后的初始模型获得与企业对应的可用也测模型。The initial model is an existing risk prediction model, but it is not applicable to all companies that need to make risk predictions. It is necessary to use the sample label, that is, the prediction label, and the prediction feature, that is, the index corresponding to the sample data to generate the training data set. The server can use the prompt information corresponding to the training data set and the sample data to train the initial model, and obtain the available test model corresponding to the enterprise according to the initial model after the training.
S208,服务器根据预测标签获得与企业指标对应的预测信息,预测信息包括与风险预测标签对应的风险预测信息,以及与无风险预测标签对应的无风险预测信息。S208. The server obtains prediction information corresponding to the enterprise index according to the prediction tag. The prediction information includes risk prediction information corresponding to the risk prediction tag and risk-free prediction information corresponding to the risk-free prediction tag.
具体地,服务器根据预设的风险预测标签和风险预测信息之间的对应关系,获取与风险预测标签对应的风险预测信息,并根据预设的无风险预测标签和无风险预测信息之间的对应关系,获取与无风险预测标签对应的无风险预测信息。Specifically, the server obtains the risk prediction information corresponding to the risk prediction tag according to the correspondence between the preset risk prediction tag and the risk prediction information, and according to the correspondence between the preset risk-free prediction tag and the risk-free prediction information Relationship to obtain risk-free prediction information corresponding to the risk-free prediction label.
风险预测信息表示对应企业存在风险信息,对应企业接收到服务器发送的预测信息为风险预测信息,需要根据接收到的风险预测信息进行进一步地排查和处理。无风险预测信息表示对应企业不存在风险信息,对应企业接收到服务器发送的预测信息为无风险预测信息。The risk prediction information indicates that the corresponding company has risk information, and the prediction information received by the corresponding company from the server is risk prediction information, which needs to be further investigated and processed based on the received risk prediction information. The risk-free forecast information indicates that there is no risk information for the corresponding enterprise, and the forecast information sent by the server received by the corresponding enterprise is the risk-free forecast information.
S210,服务器将预测信息发送至对应终端。S210: The server sends the prediction information to the corresponding terminal.
风险预测信息表示对应企业存在风险信息,对应企业接收到服务器发送的预测信息为风险预测信息,需要根据接收到的风险预测信息进行进一步地排查和处理。无风险预测信息表示对应企业不存在风险信息,对应企业接收到服务器发送的预测信息为无风险预测信息。The risk prediction information indicates that the corresponding company has risk information, and the prediction information received by the corresponding company from the server is risk prediction information, which needs to be further investigated and processed based on the received risk prediction information. The risk-free forecast information indicates that there is no risk information for the corresponding enterprise, and the forecast information sent by the server received by the corresponding enterprise is the risk-free forecast information.
上述风险预测方法中,服务器通过根据终端发送的预测请求获取企业的各维度信息,并从企业的各维度信息中提取企业指标。从而可将企业指标作为预测特征输入预先训练得到的可用预测模型中,得到与企业指标获得对应的预测信息,并将预测信息至终端。在数据更新的情况下无需反复对各指标进行数据预处理工作,可降低资源消耗,同时针对不同预测标签分别向对应的终端发送对应的预测信息,可进一步提高风险预测效果。In the above risk prediction method, the server obtains the information of each dimension of the enterprise according to the prediction request sent by the terminal, and extracts the enterprise index from the information of each dimension of the enterprise. Therefore, the enterprise index can be input into the available prediction model that is pre-trained as the prediction feature, and the prediction information corresponding to the enterprise index can be obtained, and the prediction information can be sent to the terminal. In the case of data update, there is no need to repeatedly perform data preprocessing on each indicator, which can reduce resource consumption, and at the same time, send corresponding prediction information to corresponding terminals for different prediction tags, which can further improve the risk prediction effect.
在其中一个实施例中,如图3所示,构建初始模型的步骤包括:In one of the embodiments, as shown in FIG. 3, the steps of constructing the initial model include:
S302,服务器从数据库中获取样本数据,并从样本数据中提取样本标签,将样本标签作为初始模型的预测标签。S302. The server obtains sample data from the database, and extracts sample labels from the sample data, and uses the sample labels as the prediction labels of the initial model.
具体地,服务器从样本数据中提取样本标签,并获取样本标签的属性,根据属性将样本数据分为第一类样本和第二类样本。服务器分别获取与第一类样本对应的第一样本标签,以及与第二类样本对应的第二样本标签。服务器将第一样本标签作为初始模型的风险预测标签,将第二样本标签作为初始模型的无风险预测标签。Specifically, the server extracts sample tags from the sample data, and obtains the attributes of the sample tags, and divides the sample data into the first type of samples and the second type of samples according to the attributes. The server respectively obtains a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample. The server uses the first sample label as the risk prediction label of the initial model, and uses the second sample label as the risk-free prediction label of the initial model.
样本标签的属性,用于表示与样本标签对应的样本数据是否携带风险数据。服务器可根据样本标签的属性,将样本数据分为第一类样本和第二类样本,第一类样本为携带风险数据的样本数据,第二类样本为未携带风险数据的样本数据。进而,第一样本标签为携带风险数据的样本标签,第二样本标签为无风险数据的样本标签。The attribute of the sample label is used to indicate whether the sample data corresponding to the sample label carries risk data. The server may divide the sample data into a first type sample and a second type sample according to the attributes of the sample label. The first type sample is sample data carrying risk data, and the second type sample is sample data not carrying risk data. Furthermore, the first sample label is a sample label carrying risk data, and the second sample label is a sample label without risk data.
S304,服务器获取样本数据对应的样本指标,并将样本指标作为初始模型的预测特征。S304. The server acquires the sample index corresponding to the sample data, and uses the sample index as the prediction feature of the initial model.
具体地,服务器通过获取样本数据和指标之间的对应关系,并根据样本数据和指标之间的对应关系,获取与样本数据对应的指标。服务器通过获取样本数据和初始模型的预测特征之间的关联关系,并根据样本数据和预测特征之间的关联关系,将样本数据对应的指标作为初始模型的预测特征。Specifically, the server obtains the index corresponding to the sample data by acquiring the correspondence between the sample data and the index, and according to the correspondence between the sample data and the index. The server obtains the correlation between the sample data and the predicted features of the initial model, and uses the indexes corresponding to the sample data as the predicted features of the initial model according to the correlation between the sample data and the predicted features.
进一步地,服务器通过从数据库中获取样本数据集,并将样本数据集存储至预设对象中。进而,服务器通过调用数据库中的训练函数,并根据预设的训练函数和训练参数之间的对应关系,获取与训练函数对应的训练参数。从预设对象中调用样本数据集,并根据训练参数和样本数据集,构建初始模型。Further, the server obtains the sample data set from the database and stores the sample data set in the preset object. Furthermore, the server acquires the training parameters corresponding to the training function by calling the training function in the database and according to the correspondence between the preset training function and the training parameter. The sample data set is called from the preset object, and the initial model is constructed according to the training parameters and the sample data set.
训练参数包括树的最大深度,收缩步长以及迭代次数。树的最大深度表示从根结点到叶子结点的最长路径加1,而用递归的观点来看,树的深度,等于其最大左右子树的深度加1。收缩步长表示进行让一个数值在每次收缩运算中,加上某个数(此即步长),并重复执行此项运算。迭代表示重复执行一系列运算步骤,从前面的量依次求出后面的量的过程,此过程的每一次结果,都是由对前一次所得结果施行相同的运算步骤得到的,而迭代次数即表示执行重复一系列运算步骤的次数。The training parameters include the maximum depth of the tree, the contraction step size and the number of iterations. The maximum depth of the tree represents the longest path from the root node to the leaf node plus 1, and from a recursive point of view, the depth of the tree is equal to the depth of its largest left and right subtree plus 1. The contraction step length means that a value is added to each contraction operation by adding a certain number (this is the step length), and this operation is repeatedly performed. Iteration refers to the process of repeatedly performing a series of calculation steps and sequentially determining the subsequent amounts from the previous amount. Each result of this process is obtained by performing the same calculation step on the previous result, and the number of iterations means The number of times to repeat a series of calculation steps.
S306,服务器根据预测特征和预测标签生成训练数据集。S306. The server generates a training data set according to the predicted feature and the predicted label.
具体地,预测特征为样本数据对应的指标,预测标签为样本数据的样本标签,包括第一样本标签和第二样本标签,其中,第一样本标签为携带风险数据的样本标签,第二样本标签为无风险数据的样本标签。服务器根据样本数据对应的指标和第一样本标签,以及样本数据对应的指标和第二样本标签,生成训练数据集。Specifically, the prediction feature is an indicator corresponding to the sample data, and the prediction label is a sample label of the sample data, including a first sample label and a second sample label, where the first sample label is a sample label carrying risk data, the second The sample label is a sample label of non-risk data. The server generates a training data set according to the index corresponding to the sample data and the first sample label, and the index corresponding to the sample data and the second sample label.
S308,服务器利用训练数据集和样本数据对应的提示信息,对初始模型进行训练,获得可用预测模型。S308. The server uses the prompt information corresponding to the training data set and the sample data to train the initial model to obtain an available prediction model.
具体地,服务器根据训练参数对样本数据集进行训练,获得训练数据集,并将训练数据集和样本数据对应的提示信息应用于初始模型,获得预测百分数。根据预测百分数和样本数据集,构建可用预测模型。Specifically, the server trains the sample data set according to the training parameters to obtain the training data set, and applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage. According to the prediction percentage and sample data set, build a usable prediction model.
训练参数包括树的最大深度,收缩步长以及迭代次数。树的最大深度表示从根结点到叶子结点的最长路径加1,收缩步长表示进行让一个数值在每次收缩运算中,加上某个数 (此即步长),并重复执行此项运算,迭代次数即表示执行重复一系列运算步骤的次数。The training parameters include the maximum depth of the tree, the contraction step size and the number of iterations. The maximum depth of the tree indicates that the longest path from the root node to the leaf node is increased by 1, and the contraction step indicates that a value is added to each contraction operation, plus a certain number (this is the step size), and repeated execution In this operation, the number of iterations indicates the number of times to repeat a series of calculation steps.
初始模型为已有的风险预测模型,但并不适用于所有需要进行风险预测的企业,将训练数据集和样本数据对应的提示信息应用于初始模型,可获得预测百分数。预测百分数用于表示训练数据集和样本数据对应的提示信息,应用于初始模型后,通过计算训练数据集和样本数据之间的关联程度得到。The initial model is an existing risk prediction model, but it is not suitable for all companies that need to make risk predictions. The prompt information corresponding to the training data set and sample data is applied to the initial model to obtain the prediction percentage. The prediction percentage is used to represent the prompt information corresponding to the training data set and the sample data. After being applied to the initial model, it is obtained by calculating the degree of association between the training data set and the sample data.
上述风险预测方法中,服务器通过从获取到的样本数据中提取样本标签,并将样本标签作为初始模型的预测标签。获取样本数据对应的指标,并将样本数据对应的指标作为初始模型的预测特征,并根据预测特征和预测标签生成训练数据集,进而利用训练数据集和样本数据对应的提示信息,对初始模型进行训练,获得可用预测模型。从而通过直接将样本标签作为初始模型的预测标签,可减少每次数据更新时,需要重新对样本数据进行的预处理操作,可降低资源消耗。In the above risk prediction method, the server extracts the sample label from the obtained sample data, and uses the sample label as the prediction label of the initial model. Obtain the index corresponding to the sample data, and use the index corresponding to the sample data as the prediction feature of the initial model, and generate the training data set according to the prediction feature and the prediction label, and then use the prompt information corresponding to the training data set and the sample data to carry out the initial model Train to get a usable prediction model. Therefore, by directly using the sample label as the prediction label of the initial model, it is possible to reduce the need for reprocessing the sample data every time the data is updated, and reduce resource consumption.
在其中一个实施例中,如图4所示,从样本数据中提取样本标签,并将样本标签作为初始模型的预测标签的步骤,包括:In one of the embodiments, as shown in FIG. 4, the step of extracting the sample label from the sample data and using the sample label as the prediction label of the initial model includes:
S402,服务器从样本数据中提取样本标签,并获取样本标签的属性。S402. The server extracts sample tags from the sample data and obtains the attributes of the sample tags.
S404,服务器根据属性将样本数据分为第一类样本和第二类样本。S404. The server divides the sample data into the first type of samples and the second type of samples according to the attributes.
具体地,服务器通过获取样本数据,并获取样本数据和样本标签之间的对应关系,根据样本数据和样本标签之间的对应关系,从样本数据中提取对应的样本标签。其中,样本标签的属性,用于表示与样本标签对应的样本数据是否携带风险数据。服务器可根据样本标签的属性,将样本数据分为第一类样本和第二类样本,第一类样本为携带风险数据的样本数据,第二类样本为未携带风险数据的样本数据。Specifically, the server obtains the sample data and the correspondence between the sample data and the sample label, and extracts the corresponding sample label from the sample data according to the correspondence between the sample data and the sample label. The attribute of the sample label is used to indicate whether the sample data corresponding to the sample label carries risk data. The server may divide the sample data into a first type sample and a second type sample according to the attributes of the sample label. The first type sample is sample data carrying risk data, and the second type sample is sample data not carrying risk data.
S406,服务器获取与第一类样本对应的第一样本标签,以及与第二类样本对应的第二样本标签。S406. The server obtains a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample.
具体地,服务器通过获取第一类样本和第一样本标签之间的对应关系,并根据第一类样本和第一样本标签之间的对应关系,获取与第一类样本对应的第一样本标签。其中,第一样本标签为携带风险数据的样本标签。服务器通过获取第二类样本和第二样本标签之间的对应关系,并根据第二类样本和第二样本标签之间的对应关系,获取与第二类样本对应的第二样本标签。其中,第二样本标签为无风险数据的样本标签。Specifically, the server acquires the corresponding relationship between the first type of sample and the first sample label, and obtains the first corresponding to the first type of sample according to the corresponding relationship between the first type of sample and the first sample label Sample label. The first sample label is a sample label that carries risk data. The server obtains the second sample label corresponding to the second type sample by acquiring the corresponding relationship between the second type sample and the second sample label, and according to the corresponding relationship between the second type sample and the second sample label. Among them, the second sample label is a sample label of non-risk data.
S408,服务器将第一样本标签作为初始模型的风险预测标签,将第二样本标签作为初始模型的无风险预测标签。S408. The server uses the first sample label as the risk prediction label of the initial model, and uses the second sample label as the risk-free prediction label of the initial model.
具体地,第一样本标签为携带风险数据的样本标签,第二样本标签为无风险数据的样本标签。服务器将第一样本标签,即携带风险数据的样本标签,作为初始模型的风险预测标签,与携带风险数据的第一类样本对应。将第二样本标签即未携带风险数据的样本标签,作为初始模型的无风险预测标签,与未携带风险数据的第二类样本对应。Specifically, the first sample label is a sample label carrying risk data, and the second sample label is a sample label without risk data. The server uses the first sample label, that is, the sample label that carries risk data, as the risk prediction label of the initial model, and corresponds to the first type of sample that carries risk data. The second sample label, that is, the sample label that does not carry risk data, is used as the risk-free prediction label of the initial model, and corresponds to the second type sample that does not carry risk data.
上述将样本标签作为初始模型的预测标签的步骤中,服务器通过从样本数据中提取样本标签,并获取样本标签的属性,根据属性将样本数据分为第一类样本和第二类样本。获 取与第一类样本对应的第一样本标签,以及与第二类样本对应的第二样本标签,并将第一样本标签作为初始模型的风险预测标签,将第二样本标签作为初始模型的无风险预测标签。从而可实现将携带风险数据和无风险数据的样本标签区分开,有利于执行针对性地样本数据处理,提高了工作效率。In the above step of using the sample label as the prediction label of the initial model, the server extracts the sample label from the sample data and obtains the attribute of the sample label, and divides the sample data into the first type sample and the second type sample according to the attribute. Obtain the first sample label corresponding to the first type sample and the second sample label corresponding to the second type sample, and use the first sample label as the risk prediction label of the initial model and the second sample label as the initial model Of risk-free prediction labels. Therefore, the sample tags carrying the risk data and the non-risk data can be distinguished, which is beneficial for performing targeted sample data processing and improving work efficiency.
在其中一个实施例中,提供了一种风险预测方法,该方法还包括以下步骤:In one of the embodiments, a risk prediction method is provided, and the method further includes the following steps:
服务器从数据库中获取样本数据集,并将样本数据集存储至预设对象中;调用数据库中的训练函数;根据预设的训练函数和训练参数之间的对应关系,获取与训练函数对应的训练参数;从预设对象中调用样本数据集,并根据训练参数和样本数据集,构建初始模型。The server obtains the sample data set from the database and stores the sample data set in the preset object; calls the training function in the database; according to the correspondence between the preset training function and the training parameter, obtains the training corresponding to the training function Parameters; call the sample data set from the preset object, and build the initial model based on the training parameters and the sample data set.
训练参数包括树的最大深度,收缩步长以及迭代次数。树的最大深度表示从根结点到叶子结点的最长路径加1,收缩步长表示进行让一个数值在每次收缩运算中,加上某个数(此即步长),并重复执行此项运算,迭代次数即表示执行重复一系列运算步骤的次数。The training parameters include the maximum depth of the tree, the contraction step size and the number of iterations. The maximum depth of the tree indicates that the longest path from the root node to the leaf node is increased by 1, and the contraction step indicates that a value is added to each contraction operation, plus a certain number (this is the step size), and repeated execution In this operation, the number of iterations indicates the number of times to repeat a series of calculation steps.
具体地,服务器通过获取训练函数和训练参数之间的对应关系,并根据训练函数和训练参数之间的对应关系,获取与所调用的训练函数对应的训练参数。利用训练参数对从预设对象中调用的样本数据集进行训练,获得初始模型。Specifically, the server obtains the training parameters corresponding to the called training function by acquiring the correspondence between the training function and the training parameter, and according to the correspondence between the training function and the training parameter. Use the training parameters to train the sample data set called from the preset object to obtain the initial model.
上述风险预测方法中,服务器通过根据预设的训练函数和训练参数之间的对应关系,获取与训练函数对应的训练参数,并从预设对象中调用样本数据集,根据训练参数和样本数据集,构建初始模型。从而可实现初始模型的建立,利用训练参数对样本数据集的训练,进而为后续可用预测模型提供对应的基础,提高了工作效率。In the above risk prediction method, the server obtains the training parameters corresponding to the training function according to the correspondence between the preset training function and the training parameter, and calls the sample data set from the preset object, according to the training parameter and the sample data set To build the initial model. Therefore, the establishment of the initial model can be achieved, and the training data can be used to train the sample data set, thereby providing a corresponding basis for the subsequent available prediction models, and improving work efficiency.
在其中一个实施例中,接收终端发送的预测请求,并根据预测请求获取企业的各维度信息的步骤,包括:In one of the embodiments, the step of receiving the prediction request sent by the terminal and obtaining the information of each dimension of the enterprise according to the prediction request includes:
服务器接收并解析终端发送的预测请求;获取预测请求携带的企业标识;基于企业标The server receives and parses the prediction request sent by the terminal; obtains the enterprise logo carried in the prediction request; based on the enterprise logo
识和企业的各维度信息之间的对应关系,获取与企业标识对应的企业的各维度信息。Correspondence between the identification and the information of each dimension of the enterprise to obtain the information of each dimension of the enterprise corresponding to the enterprise logo.
其中,预测请求携带企业标识,企业标识和企业的各维度信息之间存在对应关系,服务器可根据企业标识和企业的各维度信息之间的对应关系,获取与企业标识对应的企业的各维度信息。Among them, the prediction request carries the enterprise logo, and there is a correspondence between the enterprise logo and each dimension information of the enterprise, and the server can obtain each dimension information of the enterprise corresponding to the enterprise logo according to the correspondence between the enterprise logo and each dimension information of the enterprise .
具体地,服务器通过接收终端发送的预测请求,并解析预测请求,获取预测请求携带的企业标识。获取企业标识和企业的各维度信息之间的对应关系,根据所获取到的企业标识和企业的各维度信息之间的对应关系,从数据库中获取与企业标识对应的企业的各维度信息。Specifically, the server receives the prediction request sent by the terminal, and parses the prediction request to obtain the enterprise identification carried in the prediction request. Obtain the correspondence between the enterprise logo and each dimension of the enterprise, and according to the correspondence between the acquired enterprise logo and each dimension of the enterprise, obtain the dimension information of the enterprise corresponding to the enterprise logo from the database.
上述根据预测请求获取企业的各维度信息的步骤,服务器接收并解析终端发送的预测请求,获取预测请求携带的请求标签,并获取与请求标签对应的企业的各维度信息。从而可实现针对性的企业的各维度信息获取,明确不同预测请求和对应企业之间的对应关系,有利于个性化的企业的维度信息的获取和存储。In the above step of obtaining the information of each dimension of the enterprise according to the prediction request, the server receives and parses the prediction request sent by the terminal, obtains the request tag carried by the prediction request, and obtains the dimension information of the enterprise corresponding to the request tag. Therefore, it is possible to obtain the information of each dimension of the targeted enterprise, clarify the correspondence between different prediction requests and the corresponding enterprise, and facilitate the acquisition and storage of the dimension information of the personalized enterprise.
在其中一个实施例中,根据预测标签获得与企业指标对应的预测信息的步骤,包括:In one of the embodiments, the step of obtaining prediction information corresponding to the enterprise index according to the prediction label includes:
服务器根据预设的风险预测标签和风险预测信息之间的对应关系,获取与风险预测标 签对应的风险预测信息;根据预设的无风险预测标签和无风险预测信息之间的对应关系,获取与无风险预测标签对应的无风险预测信息。The server obtains the risk prediction information corresponding to the risk prediction tag according to the correspondence between the preset risk prediction tag and the risk prediction information; according to the correspondence between the preset risk-free prediction tag and the risk-free prediction information, the Risk-free prediction information corresponding to the risk-free prediction label.
具体地,风险预测信息表示对应企业存在风险信息,对应企业接收到服务器发送的预测信息为风险预测信息,需要根据接收到的风险预测信息进行进一步地排查和处理。无风险预测信息表示对应企业不存在风险信息,对应企业接收到服务器发送的预测信息为无风险预测信息。Specifically, the risk prediction information indicates that there is risk information for the corresponding enterprise, and the prediction information received by the corresponding enterprise from the server is risk prediction information, which needs to be further investigated and processed according to the received risk prediction information. The risk-free forecast information indicates that there is no risk information for the corresponding enterprise, and the forecast information sent by the server received by the corresponding enterprise is the risk-free forecast information.
初始模型为已有的风险预测模型,但并不适用于所有需要进行风险预测的企业,需要利用样本标签,即预测标签,以及预测特征,即样本数据对应的指标,生成训练数据集。服务器利用训练数据集和样本数据对应的提示信息,可对初始模型进行训练,根据进行训练过后的初始模型获得与企业对应的可用也测模型。The initial model is an existing risk prediction model, but it is not applicable to all companies that need to make risk predictions. It is necessary to use the sample label, that is, the prediction label, and the prediction feature, that is, the index corresponding to the sample data to generate the training data set. The server can use the prompt information corresponding to the training data set and the sample data to train the initial model, and obtain the available test model corresponding to the enterprise according to the initial model after the training.
上述步骤中,服务器通过分别获取与企业指标对应的风险预测标签和无风险预测标签,进而可获取与风险预测标签对应的风险预测信息,以及与无风险预测标签对应的无风险预测信息。从而实现了针对不同企业指标获取对应的预测标签,进而可分别获取对应的风险预测信息和无风险预测信息,提高了风险预测的准确度。In the above steps, the server can obtain the risk prediction information corresponding to the risk prediction label and the risk-free prediction information corresponding to the risk-free prediction label by separately acquiring the risk prediction label and the risk-free prediction label corresponding to the enterprise index. Therefore, corresponding prediction labels are obtained for different enterprise indicators, and corresponding risk prediction information and risk-free prediction information can be obtained separately, thereby improving the accuracy of risk prediction.
在其中一个实施例中,利用训练数据集和样本数据对应的提示信息,对初始模型进行训练,获得可用预测模型的步骤,包括:In one of the embodiments, using the prompt information corresponding to the training data set and the sample data to train the initial model to obtain a usable prediction model includes:
服务器根据训练参数对样本数据集进行训练,获得训练数据集;将训练数据集和样本数据对应的提示信息应用于初始模型,获得预测百分数;根据预测百分数和样本数据集,构建可用预测模型。The server trains the sample data set according to the training parameters to obtain the training data set; applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage; according to the prediction percentage and the sample data set, constructs an available prediction model.
训练参数包括树的最大深度,收缩步长以及迭代次数。树的最大深度表示从根结点到叶子结点的最长路径加1,收缩步长表示进行让一个数值在每次收缩运算中,加上某个数(此即步长),并重复执行此项运算,迭代次数即表示执行重复一系列运算步骤的次数。The training parameters include the maximum depth of the tree, the contraction step size and the number of iterations. The maximum depth of the tree indicates that the longest path from the root node to the leaf node is increased by 1, and the contraction step indicates that a value is added to each contraction operation, plus a certain number (this is the step size), and repeated execution In this operation, the number of iterations indicates the number of times to repeat a series of calculation steps.
具体地,初始模型为已有的风险预测模型,但并不适用于所有需要进行风险预测的企业,将训练数据集和样本数据对应的提示信息应用于初始模型,可获得预测百分数。预测百分数用于表示训练数据集和样本数据对应的提示信息,应用于初始模型后,通过计算训练数据集和样本数据之间的关联程度得到。Specifically, the initial model is an existing risk prediction model, but it is not applicable to all enterprises that need to make risk predictions. The prompt information corresponding to the training data set and sample data is applied to the initial model to obtain the prediction percentage. The prediction percentage is used to represent the prompt information corresponding to the training data set and the sample data. After being applied to the initial model, it is obtained by calculating the degree of association between the training data set and the sample data.
上述获得可用预测模型的步骤,服务器根据训练参数对样本数据集进行训练,获得训练数据集,并将训练数据集和样本数据对应的提示信息应用于初始模型,获得预测百分数,进而根据预测百分数和样本数据集,构建可用预测模型。从而通过对初始模型的进一步训练,获得与企业对应的可用预测模型,提高了根据企业指标进行风险预测的准确度。In the above step of obtaining a usable prediction model, the server trains the sample data set according to the training parameters to obtain the training data set, and applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage, and then obtains the prediction percentage based on Sample data sets to build available predictive models. Therefore, by further training the initial model, the available prediction model corresponding to the enterprise is obtained, and the accuracy of risk prediction according to the enterprise index is improved.
应该理解的是,虽然图2-4的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2-4中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次 进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flowcharts of FIGS. 2-4 are displayed in order according to the arrows, the steps are not necessarily executed in the order indicated by the arrows. Unless clearly stated in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least some of the steps in FIGS. 2-4 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. These sub-steps or stages The execution order of is not necessarily sequential, but may be executed in turn or alternately with at least a part of other steps or sub-steps or stages of other steps.
在其中一个实施例中,如图5所示,提供了一种风险预测装置,包括:包括:维度信息获取模块502、企业指标提取模块504、标签获取模块506、预测信息获取模块508以及发送模块510,其中:In one of the embodiments, as shown in FIG. 5, a risk prediction device is provided, which includes: a dimension information acquisition module 502, an enterprise index extraction module 504, a label acquisition module 506, a prediction information acquisition module 508, and a transmission module 510, where:
维度信息获取模块502,用于接收终端发送的预测请求,并获取与预测请求对应的企业的各维度信息;预测请求携带企业标识,企业标识与企业的各维度信息对应。The dimension information obtaining module 502 is used to receive the prediction request sent by the terminal and obtain each dimension information of the enterprise corresponding to the prediction request; the prediction request carries the enterprise ID, and the enterprise ID corresponds to each dimension information of the enterprise.
企业指标提取模块504,用于从企业的各维度信息中提取企业指标;企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单。The enterprise index extraction module 504 is used to extract enterprise indexes from all dimensions of the enterprise; the enterprise indexes include various financial indexes, legal litigation information, public opinion information, and import and export lists.
标签获取模块506,用于将企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与企业指标对应的预测标签;可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;预测标签包括风险预测标签和无风险预测标签。The label acquisition module 506 is used to input the enterprise index as the prediction feature into the available prediction model obtained by pre-training, and output the prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; prediction Labels include risk prediction labels and risk-free prediction labels.
预测信息获取模块508,用于根据预测标签获得与企业指标对应的预测信息;预测信息包括与风险预测标签对应的风险预测信息,以及与无风险预测标签对应的无风险预测信息。The prediction information obtaining module 508 is used to obtain the prediction information corresponding to the enterprise index according to the prediction label; the prediction information includes the risk prediction information corresponding to the risk prediction label and the risk-free prediction information corresponding to the risk-free prediction label.
发送模块510,用于将预测信息发送至对应终端。The sending module 510 is used to send the prediction information to the corresponding terminal.
上述风险预测装置中,服务器通过根据终端发送的预测请求获取企业的各维度信息,并从企业的各维度信息中提取企业指标。从而可将企业指标作为预测特征输入预先训练得到的可用预测模型中,得到与企业指标获得对应的风险预测信息和无风险预测信息,并分别将风险预测信息和无风险预测信息发送至对应终端。在数据更新的情况下无需反复对各指标进行数据预处理工作,可降低资源消耗,同时针对不同预测标签分别向对应的终端发送对应的预测信息,可进一步提高风险预测效果。In the above risk prediction device, the server acquires the information of each dimension of the enterprise according to the prediction request sent by the terminal, and extracts the enterprise index from the information of each dimension of the enterprise. Therefore, the enterprise index can be input into the available prediction model that is pre-trained as the prediction feature, the risk prediction information and the risk-free prediction information corresponding to the enterprise index can be obtained, and the risk prediction information and the risk-free prediction information can be sent to the corresponding terminal respectively. In the case of data update, there is no need to repeatedly perform data preprocessing on each indicator, which can reduce resource consumption, and at the same time, send corresponding prediction information to corresponding terminals for different prediction tags, which can further improve the risk prediction effect.
在其中一个实施例中,提供了一种风险预测装置,该装置还包括:In one of the embodiments, a risk prediction device is provided, and the device further includes:
预测标签生成模块,用于从数据库中获取样本数据,并从样本数据中提取样本标签,将样本标签作为初始模型的预测标签;预测特征生成模块,用于获取样本数据对应的样本指标,并将样本指标作为初始模型的预测特征;训练数据集生成模块,用于根据预测特征和预测标签生成训练数据集;可用预测模型生成模块,用于利用训练数据集和样本数据对应的提示信息,对初始模型进行训练,获得可用预测模型。The prediction label generation module is used to obtain sample data from the database, and extract the sample label from the sample data, and use the sample label as the prediction label of the initial model; the prediction feature generation module is used to obtain the sample index corresponding to the sample data, and The sample index is used as the prediction feature of the initial model; the training data set generation module is used to generate the training data set based on the prediction feature and the prediction label; the available prediction model generation module is used to use the prompt information corresponding to the training data set and the sample data The model is trained to obtain an available prediction model.
上述风险预测装置,服务器通过从获取到的样本数据中提取样本标签,并将样本标签作为初始模型的预测标签。获取样本数据对应的指标,并将样本数据对应的指标作为初始模型的预测特征,并根据预测特征和预测标签生成训练数据集,进而利用训练数据集和样本数据对应的提示信息,对初始模型进行训练,获得可用预测模型。从而通过直接将样本标签作为初始模型的预测标签,可减少每次数据更新时,需要重新对样本数据进行的预处理操作,可降低资源消耗。In the above risk prediction device, the server extracts the sample label from the acquired sample data, and uses the sample label as the prediction label of the initial model. Obtain the index corresponding to the sample data, and use the index corresponding to the sample data as the prediction feature of the initial model, and generate the training data set according to the prediction feature and the prediction label, and then use the prompt information corresponding to the training data set and the sample data to carry out the initial model Train to get a usable prediction model. Therefore, by directly using the sample label as the prediction label of the initial model, it is possible to reduce the need for reprocessing the sample data every time the data is updated, and reduce resource consumption.
在其中一个实施例中,提供了一种风险预测装置,该装置还包括:In one of the embodiments, a risk prediction device is provided, and the device further includes:
样本数据集获取模块,用于从数据库中获取样本数据集,并将样本数据集存储至预设对象中;训练函数调用模块,用于调用数据库中的训练函数;训练参数获取模块,用于根据预设的训练函数和训练参数之间的对应关系,获取与训练函数对应的训练参数;初始模型构建模块,用于从预设对象中调用样本数据集,并根据训练参数和样本数据集,构建初始模型。The sample data set acquisition module is used to obtain the sample data set from the database and store the sample data set into a preset object; the training function call module is used to call the training function in the database; the training parameter acquisition module is used to Correspondence between the preset training function and training parameters to obtain the training parameters corresponding to the training function; the initial model building module is used to call the sample data set from the preset object and build according to the training parameters and sample data set The initial model.
上述风险预测装置,服务器通过根据预设的训练函数和训练参数之间的对应关系,获取与训练函数对应的训练参数,并从预设对象中调用样本数据集,根据训练参数和样本数据集,构建初始模型。从而可实现初始模型的建立,利用训练参数对样本数据集的训练,进而为后续可用预测模型提供对应的基础,提高了工作效率。In the above risk prediction device, the server obtains the training parameters corresponding to the training function according to the correspondence between the preset training function and the training parameter, and calls the sample data set from the preset object, according to the training parameter and the sample data set, Build the initial model. Therefore, the establishment of the initial model can be achieved, and the training data can be used to train the sample data set, thereby providing a corresponding basis for the subsequent available prediction models, and improving work efficiency.
在其中一个实施例中,维度信息获取模块,还用于:In one of the embodiments, the dimensional information acquisition module is also used to:
接收并解析终端发送的预测请求;获取预测请求携带的企业标识基于企业标识和企业的各维度信息之间的对应关系,获取与企业标识对应的企业的各维度信息。Receive and parse the prediction request sent by the terminal; obtain the enterprise ID carried in the prediction request based on the correspondence between the enterprise ID and each dimension information of the enterprise, and obtain each dimension information of the enterprise corresponding to the enterprise ID.
上述维度信息获取模块,服务器接收并解析终端发送的预测请求,获取预测请求携带的企业标识,并获取与企业标识对应的企业的各维度信息。从而可实现针对性的企业的各维度信息获取,明确不同预测请求和对应企业之间的对应关系,有利于个性化的企业的维度信息的获取和存储。In the above-mentioned dimension information obtaining module, the server receives and parses the prediction request sent by the terminal, obtains the enterprise ID carried in the prediction request, and obtains each dimension information of the enterprise corresponding to the enterprise ID. Therefore, it is possible to obtain the information of each dimension of the targeted enterprise, clarify the correspondence between different prediction requests and the corresponding enterprise, and facilitate the acquisition and storage of the dimension information of the personalized enterprise.
在其中一个实施例中,提示信息获取模块,还用于:In one of the embodiments, the prompt information acquisition module is also used to:
将企业指标作为预测特征输入可用预测模型中;分别获取与企业指标对应的风险预测标签和无风险预测标签;根据预设的风险预测标签和风险预测信息之间的对应关系,获取与风险预测标签对应的风险预测信息;根据预设的无风险预测标签和无风险预测信息之间的对应关系,获取与无风险预测标签对应的无风险预测信息。Enter enterprise indicators as prediction features into available prediction models; obtain risk prediction tags and risk-free prediction tags corresponding to enterprise indexes; obtain and predict risk tags based on the correspondence between preset risk prediction tags and risk prediction information Corresponding risk prediction information; according to the correspondence between the preset risk-free prediction label and the risk-free prediction information, obtain the risk-free prediction information corresponding to the risk-free prediction label.
上述提示信息获取模块,服务器将企业指标作为预测特征输入可用预测模型中,分别获取与企业指标对应的风险预测标签和无风险预测标签,进而可获取与风险预测标签对应的风险预测信息,以及与无风险预测标签对应的无风险预测信息。从而实现了针对不同企业指标获取对应的预测标签,进而可分别获取对应的风险预测信息和无风险预测信息,提高了风险预测的准确度。In the above prompt information acquisition module, the server inputs enterprise indicators as prediction features into the available prediction model, respectively obtains the risk prediction label and the risk-free prediction label corresponding to the enterprise index, and then can obtain the risk prediction information corresponding to the risk prediction label and Risk-free prediction information corresponding to the risk-free prediction label. Therefore, corresponding prediction labels are obtained for different enterprise indicators, and corresponding risk prediction information and risk-free prediction information can be obtained separately, thereby improving the accuracy of risk prediction.
在其中一个实施例中,预测标签生成模块,还用于:In one of the embodiments, the predicted label generation module is also used to:
从样本数据中提取样本标签,并获取样本标签的属性;根据属性将样本数据分为第一类样本和第二类样本;获取与第一类样本对应的第一样本标签,以及与第二类样本对应的第二样本标签;其中,第一样本标签为携带风险数据的样本标签,第二样本标签为无风险数据的样本标签;将第一样本标签作为初始模型的风险预测标签,将第二样本标签作为初始模型的无风险预测标签。Extract the sample label from the sample data and obtain the attributes of the sample label; divide the sample data into the first type sample and the second type sample according to the attribute; obtain the first sample label corresponding to the first type sample and the second The second sample label corresponding to the class sample; where the first sample label is a sample label carrying risk data, and the second sample label is a sample label without risk data; using the first sample label as the risk prediction label of the initial model, Use the second sample label as the risk-free prediction label of the initial model.
上述预测标签生成模块,服务器通过从样本数据中提取样本标签,并获取样本标签的属性,根据属性将样本数据分为第一类样本和第二类样本。获取与第一类样本对应的第一样本标签,以及与第二类样本对应的第二样本标签,并将第一样本标签作为初始模型的风 险预测标签,将第二样本标签作为初始模型的无风险预测标签。从而可实现将携带风险数据和无风险数据的样本标签区分开,有利于执行针对性地样本数据处理,提高了工作效率。In the above prediction label generation module, the server extracts the sample label from the sample data and obtains the attribute of the sample label, and divides the sample data into the first type sample and the second type sample according to the attribute. Obtain the first sample label corresponding to the first type sample and the second sample label corresponding to the second type sample, and use the first sample label as the risk prediction label of the initial model and the second sample label as the initial model Of risk-free prediction labels. Therefore, the sample tags carrying the risk data and the non-risk data can be distinguished, which is beneficial for performing targeted sample data processing and improving work efficiency.
在其中一个实施例中,预测信息获取模块,还用于:In one of the embodiments, the prediction information acquisition module is also used to:
根据预设的风险预测标签和风险预测信息之间的对应关系,获取与风险预测标签对应的风险预测信息;根据预设的无风险预测标签和无风险预测信息之间的对应关系,获取与无风险预测标签对应的无风险预测信息。According to the corresponding relationship between the preset risk prediction label and the risk prediction information, obtain the risk prediction information corresponding to the risk prediction label; Risk-free prediction information corresponding to the risk prediction label.
上述预测信息获取模块,服务器通过分别获取与企业指标对应的风险预测标签和无风险预测标签,进而可获取与风险预测标签对应的风险预测信息,以及与无风险预测标签对应的无风险预测信息。从而实现了针对不同企业指标获取对应的预测标签,进而可分别获取对应的风险预测信息和无风险预测信息,提高了风险预测的准确度。In the above prediction information acquisition module, the server can obtain the risk prediction information corresponding to the risk prediction label and the risk-free prediction information corresponding to the risk-free prediction label by separately acquiring the risk prediction label and the risk-free prediction label corresponding to the enterprise index. Therefore, corresponding prediction labels are obtained for different enterprise indicators, and corresponding risk prediction information and risk-free prediction information can be obtained separately, thereby improving the accuracy of risk prediction.
在其中一个实施例中,可用预测模型构建模块,还用于:In one of the embodiments, a predictive model building module may be used, which is also used to:
根据训练参数对样本数据集进行训练,获得训练数据集;将训练数据集和样本数据对应的提示信息应用于初始模型,获得预测百分数;根据预测百分数和样本数据集,构建可用预测模型。Train the sample data set according to the training parameters to obtain the training data set; apply the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage; according to the prediction percentage and the sample data set, construct an available prediction model.
上述可用预测模型构建模块,服务器根据训练参数对样本数据集进行训练,获得训练数据集,并将训练数据集和样本数据对应的提示信息应用于初始模型,获得预测百分数,进而根据预测百分数和样本数据集,构建可用预测模型。从而通过对初始模型的进一步训练,获得与企业对应的可用预测模型,提高了根据企业指标进行风险预测的准确度。The above-mentioned available prediction model building module, the server trains the sample data set according to the training parameters to obtain the training data set, and applies the prompt information corresponding to the training data set and the sample data to the initial model to obtain the prediction percentage, and then obtains the prediction percentage according to the prediction percentage Data sets, build available predictive models. Therefore, by further training the initial model, the available prediction model corresponding to the enterprise is obtained, and the accuracy of risk prediction according to the enterprise index is improved.
关于风险预测装置的具体限定可以参见上文中对于风险预测方法的限定,在此不再赘述。上述风险预测装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific definition of the risk prediction device, reference may be made to the definition of the risk prediction method above, which will not be repeated here. Each module in the above risk prediction device may be implemented in whole or in part by software, hardware, or a combination thereof. The above modules may be embedded in the hardware or independent of the processor in the computer device, or may be stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图6所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储企业的各维度信息数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种风险预测方法。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure may be as shown in FIG. 6. The computer device includes a processor, memory, network interface, and database connected by a system bus. Among them, the processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer-readable instructions, and a database. The internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium. The database of the computer device is used to store information data of various dimensions of the enterprise. The network interface of the computer device is used to communicate with external terminals through a network connection. The computer readable instructions are executed by the processor to implement a risk prediction method.
本领域技术人员可以理解,图6中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 6 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or less components than shown in the figure, or combine certain components, or have a different arrangement of components.
一种计算机设备,包括存储器和一个或多个处理器,存储器中存储有计算机可读指令,计算机可读指令被处理器执行时实现本申请任意一个实施例中提供的不平衡样本数据 预处理方法的步骤。A computer device includes a memory and one or more processors. The memory stores computer-readable instructions. When the computer-readable instructions are executed by the processor, the unbalanced sample data preprocessing method provided in any embodiment of the present application is implemented. A step of.
一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器实现本申请任意一个实施例中提供的不平衡样本数据预处理方法的步骤。One or more non-volatile computer-readable storage media storing computer-readable instructions, which when executed by one or more processors, cause the one or more processors to implement any one of the embodiments of the present application The steps of the unbalanced sample data preprocessing method provided.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art may understand that all or part of the process in the method of the foregoing embodiments may be completed by instructing relevant hardware through computer-readable instructions, and the computer-readable instructions may be stored in a non-volatile computer In the readable storage medium, when the computer-readable instructions are executed, they may include the processes of the foregoing method embodiments. Wherein, any reference to the memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and / or volatile memory. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be arbitrarily combined. To simplify the description, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, all It is considered as the scope described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementations of the present application, and their descriptions are more specific and detailed, but they should not be understood as limiting the scope of the invention patent. It should be noted that, for those of ordinary skill in the art, without departing from the concept of the present application, a number of modifications and improvements can also be made, which all fall within the protection scope of the present application. Therefore, the protection scope of the patent of this application shall be subject to the appended claims.

Claims (20)

  1. 一种风险预测方法,应用于服务器,包括:A risk prediction method applied to the server, including:
    接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;Receiving a prediction request sent by a terminal, and acquiring information of each dimension of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
    从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;Extract enterprise indicators from all dimensions of the enterprise; the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists;
    将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;Input the enterprise index as a prediction feature into an available prediction model obtained by pre-training, and output a prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; Forecast labels include risk prediction labels and risk-free prediction labels;
    根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及Obtaining prediction information corresponding to the enterprise indicator according to the prediction tag; the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag; and
    将所述预测信息发送至对应终端。Send the prediction information to the corresponding terminal.
  2. 根据权利要求1所述的方法,其特征在于,还包括:The method according to claim 1, further comprising:
    从数据库中获取样本数据,并从所述样本数据中提取样本标签,将所述样本标签作为初始模型的预测标签;Obtain sample data from the database, and extract sample labels from the sample data, and use the sample labels as the prediction labels of the initial model;
    获取与所述样本数据对应的样本指标,并将所述样本指标作为初始模型的预测特征;Acquiring a sample index corresponding to the sample data, and using the sample index as a prediction feature of the initial model;
    根据所述预测特征和所述预测标签生成训练数据集;及Generating a training data set according to the prediction feature and the prediction label; and
    利用所述训练数据集和所述样本数据对应的提示信息,对所述初始模型进行训练,获得可用预测模型。Using the prompt information corresponding to the training data set and the sample data, the initial model is trained to obtain an available prediction model.
  3. 根据权利要求2所述的方法,其特征在于,还包括:The method of claim 2, further comprising:
    从数据库中获取样本数据集,并将所述样本数据集存储至预设对象中;Obtain the sample data set from the database and store the sample data set in a preset object;
    调用数据库中的训练函数;Call the training function in the database;
    根据预设的训练函数和训练参数之间的对应关系,获取与所述训练函数对应的训练参数;及Obtaining training parameters corresponding to the training function according to the correspondence between the preset training function and the training parameters; and
    从所述预设对象中调用样本数据集,并根据所述训练参数和所述样本数据集,构建初始模型。The sample data set is called from the preset object, and an initial model is constructed according to the training parameters and the sample data set.
  4. 根据权利要求1所述的方法,其特征在于,所述接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息,包括:The method according to claim 1, wherein the receiving of the prediction request sent by the terminal and obtaining the information of each dimension of the enterprise corresponding to the prediction request includes:
    接收并解析所述终端发送的预测请求;及Receive and parse the prediction request sent by the terminal; and
    获取所述预测请求携带的企业标识;Obtaining the enterprise identification carried in the prediction request;
    基于所述企业标识和所述企业的各维度信息之间的对应关系,获取与所述企业标识对应的企业的各维度信息。Based on the correspondence between the enterprise ID and the dimension information of the enterprise, acquire the dimension information of the enterprise corresponding to the enterprise ID.
  5. 根据权利要求2所述的方法,其特征在于,所述从所述样本数据中提取样本标签,将所述样本标签作为初始模型的预测标签,包括:The method according to claim 2, wherein the extracting the sample label from the sample data and using the sample label as the prediction label of the initial model includes:
    从样本数据中提取样本标签,并获取所述样本标签的属性;Extracting the sample label from the sample data, and obtaining the attributes of the sample label;
    根据所述属性将所述样本数据分为第一类样本和第二类样本;Divide the sample data into first-type samples and second-type samples according to the attributes;
    获取与所述第一类样本对应的第一样本标签,以及与所述第二类样本对应的第二样本标签;其中,所述第一样本标签为携带风险数据的样本标签,所述第二样本标签为无风险数据的样本标签;及Acquiring a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample; wherein the first sample label is a sample label carrying risk data, the The second sample label is a sample label of no risk data; and
    将所述第一样本标签作为初始模型的风险预测标签,将所述第二样本标签作为所述初始模型的无风险预测标签。The first sample label is used as a risk prediction label of the initial model, and the second sample label is used as a risk-free prediction label of the initial model.
  6. 根据权利要求5所述的方法,其特征在于,所述根据所述预测标签获得与所述企业指标对应的预测信息,包括:The method according to claim 5, wherein the obtaining prediction information corresponding to the enterprise index according to the prediction tag includes:
    根据预设的风险预测标签和风险预测信息之间的对应关系,获取与所述风险预测标签对应的风险预测信息;及Obtain the risk prediction information corresponding to the risk prediction tag according to the correspondence between the preset risk prediction tag and the risk prediction information; and
    根据预设的无风险预测标签和无风险预测信息之间的对应关系,获取与所述无风险预测标签对应的无风险预测信息。Obtain the risk-free prediction information corresponding to the risk-free prediction label according to the correspondence between the preset risk-free prediction label and the risk-free prediction information.
  7. 根据权利要求2所述的方法,其特征在于,所述利用所述训练数据集和所述样本数据对应的提示信息,对所述初始模型进行训练,获得可用预测模型,包括:The method according to claim 2, wherein the using the prompt information corresponding to the training data set and the sample data to train the initial model to obtain a usable prediction model includes:
    根据训练参数对样本数据集进行训练,获得训练数据集;Train the sample data set according to the training parameters to obtain the training data set;
    将所述训练数据集和所述样本数据对应的提示信息应用于初始模型,获得预测百分数;及Applying the prompt information corresponding to the training data set and the sample data to the initial model to obtain a prediction percentage; and
    根据所述预测百分数和样本数据集,构建所述可用预测模型。According to the prediction percentage and the sample data set, the available prediction model is constructed.
  8. 一种风险预测装置,包括:A risk prediction device, including:
    维度信息获取模块,用于接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;The dimension information obtaining module is used to receive the prediction request sent by the terminal, and obtain each dimension information of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise ;
    企业指标提取模块,用于从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;The enterprise index extraction module is used to extract enterprise indexes from all dimensions of the enterprise; the enterprise indexes include various financial indexes, legal litigation information, public opinion information, and import and export lists;
    标签获取模块,用于将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;The tag acquisition module is used to input the enterprise index as a prediction feature into an available prediction model that is pre-trained, and output a prediction tag corresponding to the enterprise index; the available prediction model is based on a prompt corresponding to the training data set and sample data Information training; the prediction labels include risk prediction labels and risk-free prediction labels;
    预测信息获取模块,用于根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及The prediction information obtaining module is used to obtain prediction information corresponding to the enterprise index according to the prediction label; the prediction information includes risk prediction information corresponding to the risk prediction label, and corresponding to the risk-free prediction label No risk prediction information; and
    发送模块,用于将所述预测信息发送至对应终端。The sending module is used to send the prediction information to the corresponding terminal.
  9. 根据权利要求8所述的装置,其特征在于,所述预测标签生成模块还用于:The apparatus according to claim 8, wherein the predicted label generation module is further used to:
    从样本数据中提取样本标签,并获取所述样本标签的属性;Extracting the sample label from the sample data, and obtaining the attributes of the sample label;
    根据所述属性将所述样本数据分为第一类样本和第二类样本;Divide the sample data into first-type samples and second-type samples according to the attributes;
    获取与所述第一类样本对应的第一样本标签,以及与所述第二类样本对应的第二样本 标签;其中,所述第一样本标签为携带风险数据的样本标签,所述第二样本标签为无风险数据的样本标签;及Acquiring a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample; wherein the first sample label is a sample label carrying risk data, the The second sample label is a sample label of no risk data; and
    将所述第一样本标签作为初始模型的风险预测标签,将所述第二样本标签作为所述初始模型的无风险预测标签。The first sample label is used as a risk prediction label of the initial model, and the second sample label is used as a risk-free prediction label of the initial model.
  10. 一种计算机设备,包括存储器及一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device includes a memory and one or more processors. The memory stores computer-readable instructions. When the computer-readable instructions are executed by the one or more processors, the one or more Each processor performs the following steps:
    接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;Receiving a prediction request sent by a terminal, and acquiring information of each dimension of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
    从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;Extract enterprise indicators from all dimensions of the enterprise; the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists;
    将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;Input the enterprise index as a prediction feature into an available prediction model obtained by pre-training, and output a prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; Forecast labels include risk prediction labels and risk-free prediction labels;
    根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及Obtaining prediction information corresponding to the enterprise indicator according to the prediction tag; the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag; and
    将所述预测信息发送至对应终端。Send the prediction information to the corresponding terminal.
  11. 根据权利要求10所述的计算机设备,其特征在于,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device of claim 10, wherein the processor further executes the following steps when executing the computer-readable instructions:
    从数据库中获取样本数据,并从所述样本数据中提取样本标签,将所述样本标签作为初始模型的预测标签;Obtain sample data from the database, and extract sample labels from the sample data, and use the sample labels as the prediction labels of the initial model;
    获取与所述样本数据对应的样本指标,并将所述样本指标作为初始模型的预测特征;Acquiring a sample index corresponding to the sample data, and using the sample index as a prediction feature of the initial model;
    根据所述预测特征和所述预测标签生成训练数据集;及Generating a training data set according to the prediction feature and the prediction label; and
    利用所述训练数据集和所述样本数据对应的提示信息,对所述初始模型进行训练,获得可用预测模型。Using the prompt information corresponding to the training data set and the sample data, the initial model is trained to obtain an available prediction model.
  12. 根据权利要求11所述的计算机设备,其特征在于,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 11, wherein the processor further executes the following steps when executing the computer-readable instructions:
    从数据库中获取样本数据集,并将所述样本数据集存储至预设对象中;Obtain the sample data set from the database and store the sample data set in a preset object;
    调用数据库中的训练函数;Call the training function in the database;
    根据预设的训练函数和训练参数之间的对应关系,获取与所述训练函数对应的训练参数;及Obtaining training parameters corresponding to the training function according to the correspondence between the preset training function and the training parameters; and
    从所述预设对象中调用样本数据集,并根据所述训练参数和所述样本数据集,构建初始模型。The sample data set is called from the preset object, and an initial model is constructed according to the training parameters and the sample data set.
  13. 根据权利要求10所述的计算机设备,其特征在于,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device of claim 10, wherein the processor further executes the following steps when executing the computer-readable instructions:
    接收并解析所述终端发送的预测请求;及Receive and parse the prediction request sent by the terminal; and
    获取所述预测请求携带的企业标识;Obtaining the enterprise identification carried in the prediction request;
    基于所述企业标识和所述企业的各维度信息之间的对应关系,获取与所述企业标识对应的企业的各维度信息。Based on the correspondence between the enterprise ID and the dimension information of the enterprise, acquire the dimension information of the enterprise corresponding to the enterprise ID.
  14. 根据权利要求11所述的计算机设备,其特征在于,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 11, wherein the processor further executes the following steps when executing the computer-readable instructions:
    从样本数据中提取样本标签,并获取所述样本标签的属性;Extracting the sample label from the sample data, and obtaining the attributes of the sample label;
    根据所述属性将所述样本数据分为第一类样本和第二类样本;Divide the sample data into first-type samples and second-type samples according to the attributes;
    获取与所述第一类样本对应的第一样本标签,以及与所述第二类样本对应的第二样本标签;其中,所述第一样本标签为携带风险数据的样本标签,所述第二样本标签为无风险数据的样本标签;及Acquiring a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample; wherein the first sample label is a sample label carrying risk data, the The second sample label is a sample label of no risk data; and
    将所述第一样本标签作为初始模型的风险预测标签,将所述第二样本标签作为所述初始模型的无风险预测标签。The first sample label is used as a risk prediction label of the initial model, and the second sample label is used as a risk-free prediction label of the initial model.
  15. 根据权利要求14所述的计算机设备,其特征在于,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 14, wherein the processor further executes the following steps when executing the computer-readable instructions:
    根据预设的风险预测标签和风险预测信息之间的对应关系,获取与所述风险预测标签对应的风险预测信息;及Obtain the risk prediction information corresponding to the risk prediction tag according to the correspondence between the preset risk prediction tag and the risk prediction information; and
    根据预设的无风险预测标签和无风险预测信息之间的对应关系,获取与所述无风险预测标签对应的无风险预测信息。Obtain the risk-free prediction information corresponding to the risk-free prediction label according to the correspondence between the preset risk-free prediction label and the risk-free prediction information.
  16. 一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:One or more non-volatile computer-readable storage media storing computer-readable instructions, which when executed by one or more processors, cause the one or more processors to perform the following steps:
    接收终端发送的预测请求,并获取与所述预测请求对应的企业的各维度信息;所述预测请求携带企业标识,所述企业标识与所述企业的各维度信息对应;Receiving a prediction request sent by a terminal, and acquiring information of each dimension of the enterprise corresponding to the prediction request; the prediction request carries an enterprise identifier, and the enterprise identifier corresponds to each dimension information of the enterprise;
    从所述企业的各维度信息中提取企业指标;所述企业指标包括各项财务指标、法务诉讼信息、舆情信息以及进出口清单;Extract enterprise indicators from all dimensions of the enterprise; the enterprise indicators include various financial indicators, legal litigation information, public opinion information, and import and export lists;
    将所述企业指标作为预测特征输入预先训练得到的可用预测模型中,输出与所述企业指标对应的预测标签;所述可用预测模型根据训练数据集和样本数据对应的提示信息训练得到;所述预测标签包括风险预测标签和无风险预测标签;Input the enterprise index as a prediction feature into an available prediction model obtained by pre-training, and output a prediction label corresponding to the enterprise index; the available prediction model is trained based on the prompt information corresponding to the training data set and sample data; Forecast labels include risk prediction labels and risk-free prediction labels;
    根据所述预测标签获得与所述企业指标对应的预测信息;所述预测信息包括与所述风险预测标签对应的风险预测信息,以及与所述无风险预测标签对应的无风险预测信息;及Obtaining prediction information corresponding to the enterprise indicator according to the prediction tag; the prediction information includes risk prediction information corresponding to the risk prediction tag, and risk-free prediction information corresponding to the risk-free prediction tag; and
    将所述预测信息发送至对应终端。Send the prediction information to the corresponding terminal.
  17. 根据权利要求16所述的存储介质,其特征在于,所述计算机可读指令被所述处理器执行时还执行以下步骤:The storage medium according to claim 16, wherein when the computer-readable instructions are executed by the processor, the following steps are further performed:
    从数据库中获取样本数据,并从所述样本数据中提取样本标签,将所述样本标签作为初始模型的预测标签;Obtain sample data from the database, and extract sample labels from the sample data, and use the sample labels as the prediction labels of the initial model;
    获取与所述样本数据对应的样本指标,并将所述样本指标作为初始模型的预测特征;Acquiring a sample index corresponding to the sample data, and using the sample index as a prediction feature of the initial model;
    根据所述预测特征和所述预测标签生成训练数据集;及Generating a training data set according to the prediction feature and the prediction label; and
    利用所述训练数据集和所述样本数据对应的提示信息,对所述初始模型进行训练,获得可用预测模型。Using the prompt information corresponding to the training data set and the sample data, the initial model is trained to obtain an available prediction model.
  18. 根据权利要求17所述的存储介质,其特征在于,所述计算机可读指令被所述处理器执行时还执行以下步骤:The storage medium according to claim 17, wherein when the computer-readable instructions are executed by the processor, the following steps are further performed:
    从数据库中获取样本数据集,并将所述样本数据集存储至预设对象中;Obtain the sample data set from the database and store the sample data set in a preset object;
    调用数据库中的训练函数;Call the training function in the database;
    根据预设的训练函数和训练参数之间的对应关系,获取与所述训练函数对应的训练参数;及Obtaining training parameters corresponding to the training function according to the correspondence between the preset training function and the training parameters; and
    从所述预设对象中调用样本数据集,并根据所述训练参数和所述样本数据集,构建初始模型。The sample data set is called from the preset object, and an initial model is constructed according to the training parameters and the sample data set.
  19. 根据权利要求16所述的存储介质,其特征在于,所述计算机可读指令被所述处理器执行时还执行以下步骤:The storage medium according to claim 16, wherein when the computer-readable instructions are executed by the processor, the following steps are further performed:
    接收并解析所述终端发送的预测请求;及Receive and parse the prediction request sent by the terminal; and
    获取所述预测请求携带的企业标识;Obtaining the enterprise identification carried in the prediction request;
    基于所述企业标识和所述企业的各维度信息之间的对应关系,获取与所述企业标识对应的企业的各维度信息。Based on the correspondence between the enterprise ID and the dimension information of the enterprise, acquire the dimension information of the enterprise corresponding to the enterprise ID.
  20. 根据权利要求17所述的存储介质,其特征在于,所述计算机可读指令被所述处理器执行时还执行以下步骤:The storage medium according to claim 17, wherein when the computer-readable instructions are executed by the processor, the following steps are further performed:
    从样本数据中提取样本标签,并获取所述样本标签的属性;Extracting the sample label from the sample data, and obtaining the attributes of the sample label;
    根据所述属性将所述样本数据分为第一类样本和第二类样本;Divide the sample data into first-type samples and second-type samples according to the attributes;
    获取与所述第一类样本对应的第一样本标签,以及与所述第二类样本对应的第二样本标签;其中,所述第一样本标签为携带风险数据的样本标签,所述第二样本标签为无风险数据的样本标签;及Acquiring a first sample label corresponding to the first type of sample and a second sample label corresponding to the second type of sample; wherein the first sample label is a sample label carrying risk data, the The second sample label is a sample label of no risk data; and
    将所述第一样本标签作为初始模型的风险预测标签,将所述第二样本标签作为所述初始模型的无风险预测标签。The first sample label is used as a risk prediction label of the initial model, and the second sample label is used as a risk-free prediction label of the initial model.
PCT/CN2019/099382 2018-10-11 2019-08-06 Risk forecast method, device, computer apparatus, and storage medium WO2020073727A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811183672.5A CN109523117A (en) 2018-10-11 2018-10-11 Risk Forecast Method, device, computer equipment and storage medium
CN201811183672.5 2018-10-11

Publications (1)

Publication Number Publication Date
WO2020073727A1 true WO2020073727A1 (en) 2020-04-16

Family

ID=65770500

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/099382 WO2020073727A1 (en) 2018-10-11 2019-08-06 Risk forecast method, device, computer apparatus, and storage medium

Country Status (2)

Country Link
CN (1) CN109523117A (en)
WO (1) WO2020073727A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109523117A (en) * 2018-10-11 2019-03-26 平安科技(深圳)有限公司 Risk Forecast Method, device, computer equipment and storage medium
CN111582751B (en) * 2020-05-19 2022-06-14 国网吉林省电力有限公司 Time-weighted electricity purchasing risk early warning method
CN111695908B (en) * 2020-06-04 2024-02-20 中国工商银行股份有限公司 Bill business risk prediction method and device
CN112561320A (en) * 2020-12-14 2021-03-26 中国建设银行股份有限公司 Training method of mechanism risk prediction model, mechanism risk prediction method and device
CN112465411B (en) * 2021-01-27 2021-06-04 支付宝(杭州)信息技术有限公司 Risk prediction method, device and equipment
CN113283795B (en) * 2021-06-11 2023-04-07 同盾科技有限公司 Data processing method and device based on two-classification model, medium and equipment
CN113487344B (en) * 2021-06-11 2023-01-17 中国联合网络通信集团有限公司 Communication service prediction method, centralized server, edge service terminal and medium
CN116028964B (en) * 2023-03-28 2023-05-23 中国标准化研究院 Information security risk management system
CN117273963A (en) * 2023-11-21 2023-12-22 之江实验室 Risk identification method and device based on car insurance scene

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140143134A1 (en) * 2010-01-13 2014-05-22 Corelogic Solutions, Llc System and method of detecting and assessing multiple types of risks related to mortgage lending
CN108197178A (en) * 2017-12-22 2018-06-22 国云科技股份有限公司 A kind of business risk appraisal procedure
CN108197740A (en) * 2017-12-29 2018-06-22 广东金赋科技股份有限公司 Business failure Forecasting Methodology, electronic equipment and computer storage media
CN108564286A (en) * 2018-04-19 2018-09-21 天合泽泰(厦门)征信服务有限公司 A kind of artificial intelligence finance air control credit assessment method and system based on big data reference
CN108596439A (en) * 2018-03-29 2018-09-28 北京中兴通网络科技股份有限公司 A kind of the business risk prediction technique and system of knowledge based collection of illustrative plates
CN109523117A (en) * 2018-10-11 2019-03-26 平安科技(深圳)有限公司 Risk Forecast Method, device, computer equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106611291A (en) * 2016-12-13 2017-05-03 税云网络科技服务有限公司 Information push method and device
CN107563614A (en) * 2017-08-16 2018-01-09 北京金堤科技有限公司 A kind of business risk appraisal procedure and device
CN107909274B (en) * 2017-11-17 2023-02-28 平安科技(深圳)有限公司 Enterprise investment risk assessment method and device and storage medium
CN108229806A (en) * 2017-12-27 2018-06-29 中国银行股份有限公司 A kind of method and system for analyzing business risk

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140143134A1 (en) * 2010-01-13 2014-05-22 Corelogic Solutions, Llc System and method of detecting and assessing multiple types of risks related to mortgage lending
CN108197178A (en) * 2017-12-22 2018-06-22 国云科技股份有限公司 A kind of business risk appraisal procedure
CN108197740A (en) * 2017-12-29 2018-06-22 广东金赋科技股份有限公司 Business failure Forecasting Methodology, electronic equipment and computer storage media
CN108596439A (en) * 2018-03-29 2018-09-28 北京中兴通网络科技股份有限公司 A kind of the business risk prediction technique and system of knowledge based collection of illustrative plates
CN108564286A (en) * 2018-04-19 2018-09-21 天合泽泰(厦门)征信服务有限公司 A kind of artificial intelligence finance air control credit assessment method and system based on big data reference
CN109523117A (en) * 2018-10-11 2019-03-26 平安科技(深圳)有限公司 Risk Forecast Method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN109523117A (en) 2019-03-26

Similar Documents

Publication Publication Date Title
WO2020073727A1 (en) Risk forecast method, device, computer apparatus, and storage medium
WO2020253358A1 (en) Service data risk control analysis processing method, apparatus and computer device
WO2020037942A1 (en) Risk prediction processing method and apparatus, computer device and medium
CN109657917B (en) Risk early warning method and device for evaluation object, computer equipment and storage medium
CN110400219B (en) Service processing method and system, and transaction monitoring method and system
US20170124494A1 (en) Systems, methods and devices for modelling operational risk
CA2910754C (en) Systems, methods and devices for modelling operational risk
CN112348321A (en) Risk user identification method and device and electronic equipment
CN112990281A (en) Abnormal bid identification model training method, abnormal bid identification method and abnormal bid identification device
CN115936895A (en) Risk assessment method, device and equipment based on artificial intelligence and storage medium
CN115630221A (en) Terminal application interface display data processing method and device and computer equipment
CN110910241B (en) Cash flow evaluation method, apparatus, server device and storage medium
CN117114901A (en) Method, device, equipment and medium for processing insurance data based on artificial intelligence
CN112163154A (en) Data processing method, device, equipment and storage medium
CN115222112A (en) Behavior prediction method, behavior prediction model generation method and electronic equipment
CN114723239A (en) Multi-party collaborative modeling method, device, equipment, medium and program product
CN114048330A (en) Risk conduction probability knowledge graph generation method, device, equipment and storage medium
CN114493686A (en) Operation content generation and pushing method and device
CN113177840A (en) Client risk identification method and device
CN117172632B (en) Enterprise abnormal behavior detection method, device, equipment and storage medium
CN117390258A (en) Financial service recommendation method, device, computer equipment and storage medium
CN115293783A (en) Risk user identification method and device, computer equipment and storage medium
CN117611352A (en) Vehicle insurance claim processing method, device, computer equipment and storage medium
CN117876021A (en) Data prediction method, device, equipment and storage medium based on artificial intelligence
CN116702789A (en) Semantic analysis method, semantic analysis device, computer equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19871766

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19871766

Country of ref document: EP

Kind code of ref document: A1