WO2023065545A1 - Risk prediction method and apparatus, and device and storage medium - Google Patents

Risk prediction method and apparatus, and device and storage medium Download PDF

Info

Publication number
WO2023065545A1
WO2023065545A1 PCT/CN2022/071241 CN2022071241W WO2023065545A1 WO 2023065545 A1 WO2023065545 A1 WO 2023065545A1 CN 2022071241 W CN2022071241 W CN 2022071241W WO 2023065545 A1 WO2023065545 A1 WO 2023065545A1
Authority
WO
WIPO (PCT)
Prior art keywords
risk
graph
preset
knowledge graph
sparse
Prior art date
Application number
PCT/CN2022/071241
Other languages
French (fr)
Chinese (zh)
Inventor
肖京
李娜
王磊
赵盟盟
王媛
谭韬
陈又新
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2023065545A1 publication Critical patent/WO2023065545A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities

Definitions

  • the present application relates to the technical field of artificial intelligence, and in particular to a risk prediction method, device, electronic equipment, and computer-readable storage medium.
  • the existing risk prediction methods are mainly based on the relatively simple network structure between enterprises, and the generation of the network structure depends on the relationship between enterprises, such as credit relationship.
  • a risk prediction method provided by this application includes:
  • Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
  • the present application also provides a risk prediction device, the device comprising:
  • the knowledge map construction module is used to obtain a multi-source information set, extract a risk-aware factor set from the multi-source information set, and construct a time-series knowledge map based on the risk-aware factor set, and use the preset implicit relationship to complement the algorithm
  • the implicit relationship in the time series knowledge graph is obtained to obtain a standard knowledge graph
  • the target risk entity prediction module is used to construct a risk prediction model based on a preset reinforcement learning algorithm, use the risk prediction model to perform risk prediction on entities in the standard knowledge map, obtain a risk probability, and set the risk probability to be greater than Or an entity equal to a preset probability threshold as a target risk entity;
  • An event map generation module configured to use a preset causal relation supplementary algorithm to supplement the time-series knowledge map with causal relations to obtain an event map;
  • the graph quantification module is used to use the preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the closeness of dependence, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the event hazard degree;
  • the macro prediction module is used to obtain a macro prediction model based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training, and use the macro prediction model to predict the target risk entity Prediction is performed to obtain the macro risk probability, and the industry type corresponding to the entity whose macro risk probability is greater than or equal to the preset macro threshold is determined as a risk industry.
  • the present application also provides an electronic device, the electronic device comprising:
  • the memory stores a computer program executable by the at least one processor, the computer program is executed by the at least one processor, so that the at least one processor can perform the following steps:
  • Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
  • the present application also provides a computer-readable storage medium, at least one computer program is stored in the computer-readable storage medium, and the at least one computer program is executed by a processor in an electronic device to implement the following steps:
  • Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
  • Fig. 1 is a schematic flow chart of a risk prediction method provided by an embodiment of the present application
  • FIG. 2 is a functional block diagram of a risk prediction device provided by an embodiment of the present application.
  • Fig. 3 is a schematic structural diagram of an electronic device implementing the risk prediction method provided by an embodiment of the present application.
  • the embodiment of this application provides a risk prediction method.
  • the subject of execution of the risk prediction method includes but is not limited to at least one of electronic devices such as a server and a terminal that can be configured to execute the method provided by the embodiment of the present application.
  • the risk prediction method can be executed by software or hardware installed on the terminal device or server device, and the software can be a block chain platform.
  • the server includes, but is not limited to: a single server, a server cluster, a cloud server or a cloud server cluster, and the like.
  • the server can be an independent server, or it can provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, content delivery network (ContentDelivery Network) , CDN), and cloud servers for basic cloud computing services such as big data and artificial intelligence platforms.
  • the risk prediction method includes:
  • the multi-source information set can be obtained from multi-source heterogeneous information sources, wherein the multi-source heterogeneous information sources come from multiple channels such as Internet public data, business data and government data, including but not It is limited to various types such as video, electronic pictures, remote sensing images, and texts.
  • multi-source heterogeneous information sources come from multiple channels such as Internet public data, business data and government data, including but not It is limited to various types such as video, electronic pictures, remote sensing images, and texts.
  • the extracting the risk perception factor set from the multi-source information set includes:
  • the information in the multi-source information set contains different types, and the text information and image information in the multi-source information set are screened out, and the text information is factor-extracted using a preset natural language processing technology, wherein, Described natural language processing technique can be to utilize Word2Vec algorithm to carry out word embedding, utilize FastText model or ELMO (Embedding from language models, two-way language model) model to carry out text feature encoding, utilize BERT (Bidirectional Encoder Representations from Transformer, two-way encoder representation) The model performs text feature extraction.
  • Word2Vec algorithm to carry out word embedding
  • ELMO Embedding from language models, two-way language model
  • BERT Bidirectional Encoder Representations from Transformer, two-way encoder representation
  • the image recognition technology can be target detection algorithms such as YOLO V2-V4 or SSD, target recognition algorithms such as AlexNet or ResNet, and weakly supervised semantic segmentation methods and other semantic segmentation algorithms.
  • optical character recognition (OCR) and NLP technology can be used to complete information extraction.
  • targets such as crops, shipping goods, and sea and land transportation can be identified from ultra-high-resolution satellite images, and then early warnings can be made for changes in important links of economic production.
  • OCR technology can be used to extract useful information from non-standard information such as financial notes and transaction notes Important information for risk assessment, while nighttime light remote sensing data can be used to dynamically predict population density and urban expansion speed.
  • NLP natural language processing
  • Text data such as news, public opinion, and forum information
  • find the relationship between financial events and extract relevant factors that describe economic uncertainty
  • Text data which can mine information such as enterprise income, business development scale, and company development strategy tendency; it can also extract event tendency scores, attention Degree index, risk volatility and other factors.
  • constructing a time series knowledge map based on the risk perception factor set includes:
  • a graph is constructed based on the entity and the entity relationship to obtain a time series knowledge graph.
  • the algorithm for entity relationship extraction includes but is not limited to the following: supervised learning (SVM, NN, naive Bayesian), semi-supervised learning (distance supervision, Bootstrapping), deep learning ( Pipeline such as Att-CNN&Att-BLSTM, Joint Model such as LSTM-RNNs).
  • supervised learning SVM, NN, naive Bayesian
  • semi-supervised learning distance supervision, Bootstrapping
  • deep learning Pipeline such as Att-CNN&Att-BLSTM, Joint Model such as LSTM-RNNs.
  • SWRL graph-based reasoning Path Ranking
  • the relationship between entities in the time-series knowledge graph can be enriched and calibrated, and the application range of the time-series knowledge graph can be broadened.
  • the implicit relationship refers to the entity relationship that is difficult to obtain directly between entities, and the time-series knowledge graph contains multiple entities and the relationship between entities, but the relationship between entities in the time-series knowledge graph is more obvious Therefore, it is necessary to deeply mine each entity to obtain the implicit relationship, and add the implicit relationship to the time-series knowledge graph to obtain a standard knowledge graph.
  • the use of the preset implicit relationship complement algorithm to complement the implicit relationship in the time-series knowledge graph to obtain a standard knowledge graph includes:
  • this solution uses the graph convolutional network model to complete the implicit relationship of the time-series knowledge graph, mainly in that the graph sparse processing is performed on the time-series knowledge graph first, and the sparse data obtained after the graph sparse Knowledge graphs are used as input to relational graph convolutional networks for relational prediction.
  • This method can avoid the problem of low prediction accuracy of the model due to sparse features when the observed entity relationship samples are insufficient and the entity relationship types are diverse.
  • it can solve the problem of nodes on large graphs as the size of the graph increases. Functions such as classification and relation prediction can also be time and space intensive problems.
  • the graph convolutional network model converts the input knowledge map into a sparse knowledge map and performs the next step of relationship prediction.
  • the preset sparse graph convolutional network performs graph sparse processing on the time series knowledge graph to obtain a sparse knowledge graph, including:
  • E ⁇ e 1 ,..., e m ⁇ is a set of relationships between entities in the time-series knowledge graph
  • the adjacency matrix corresponding to the time-series knowledge graph is a two-dimensional array storing the relationship between entities.
  • the preset weight matrix of the graph convolutional network can be W (0) and W (1) .
  • the construction of the sparse output function based on the adjacency matrix, the feature matrix and the preset weight matrix includes:
  • W is the preset weight matrix
  • A is the adjacency matrix
  • IN is a fixed parameter
  • diag is a diagonal matrix
  • W (0) and W (1) are the preset weight matrices of the sparse graph convolutional network
  • ReLU is linear rectification function
  • X is the feature matrix.
  • optimization of the sparse output function by using the multiplier-based alternating direction algorithm includes:
  • An adaptive moment estimation optimizer is used to update the gradients of the variables in the simplified output function to obtain the updated gradients of the variables.
  • simplifying the sparse output function means that the output of the sparse graph convolutional network depends on and W, but can be expressed as a function of A, then the output can be expressed as a function of A and W, that is, Z(A,W), because W remains unchanged, Z(A,W) can be simplified to Z(A) .
  • the adaptive moment estimation optimizer is used to update the gradient of the variables in the simplified output function, and the variables in the optimized sparse output function are updated to the time-series knowledge graph to obtain a sparse knowledge graph .
  • the optimization process using the multiplier-based Alternating Direction Algorithm can preserve the network backbone of the structural and hierarchical information in the temporal knowledge graph, and preserve the performance of the temporal knowledge graph while maintaining the performance of node classification prediction. edge information.
  • the method further includes:
  • the relational graph convolutional network is output as a trained relational graph convolutional network.
  • the relational graph convolutional network includes an entity encoder and a decoder
  • the entity encoder is used to generate a latent feature representation of an entity
  • the decoder is used to score the latent feature representation through a scoring function.
  • the entity encoder in the preset relational graph convolutional network is used to perform feature prediction on the sparse knowledge graph, that is, the R-GCN is used as the encoder to generate a real vector representation e i of each entity.
  • the R-GCN model stacks L layers according to the preset method, the output of the previous layer in the R-GCN model will be used as the input of the next layer, and the entity encoder uses the output of R-GCN as the vector of each entity means that is the hidden vector (hidden state) of node v i in the l-layer neural network, and d (l) is the dimension represented by the vector of this layer.
  • the scoring of the latent features corresponding to the entity based on the decoder in the relational graph convolutional network includes:
  • the latent features corresponding to the entities are scored using the DistMult factorization model in the decoder.
  • the DistMult factorization model is a kind of semantic matching model.
  • the semantic matching model uses a similarity-based scoring function to measure the possibility of the existence or establishment of this triple by matching the latent semantics of entities and relations in the latent space.
  • DistMult factorization model in the decoder to score the potential features corresponding to the entity, including:
  • cross-entropy loss value calculated according to the target potential feature and the preset cross-entropy loss function includes:
  • the cross-entropy loss function is:
  • the relationship prediction is carried out on the sparse knowledge map, and the implicit relationship is obtained as "using the PPP mode to trigger the oriental garden event", then the implicit relationship "using the PPP mode to trigger the oriental garden event" is added to the time series knowledge map In the process, the standard knowledge graph is obtained.
  • the preset reinforcement learning algorithm is a framework that can be applied to sequential decision-making and control tasks, wherein the agent (Agent) in the reinforcement learning algorithm optimizes its behavior by interacting with the environment (Environment).
  • the construction of a risk prediction model based on a preset reinforcement learning algorithm includes:
  • the time difference is used as an objective function, and the reinforcement learning algorithm is used as a framework to train a risk prediction model.
  • the original risk status data refers to the identification data of the risk situation to which the current data belongs.
  • Pre-training data can be obtained by extracting data conforming to preset sampling standards, for example, extracting data whose risk profile satisfies high-risk and medium-risk conditions as pre-training data.
  • the reinforcement learning algorithm is the Actor-Critic algorithm.
  • Actor-Critic the role of the agent is divided into a participant (Actor) and a critic (Critic).
  • Actor and Critic respectively represent Policy and Value function.
  • the critic is responsible for processing the received reward r, i.e. evaluating the quality of the current policy by adjusting the value function.
  • the participants are updated by using information from the critic.
  • the risk prediction model can be used to perform risk prediction on a plurality of different entities in the standard knowledge map, and the entities are input into the risk prediction model to obtain the risk corresponding to the entity probability, and the entity whose risk probability is greater than or equal to the preset probability threshold is taken as the target risk entity.
  • the entities in the standard knowledge graph include different types of enterprises.
  • an enterprise whose risk probability in the standard knowledge graph is predicted to be greater than or equal to the preset probability threshold according to the risk prediction model is used as a target risk enterprise.
  • the causal relationship supplement refers to supplementing the causal sequence and other relationships among entities in the time series knowledge graph.
  • the use of the preset causality supplement algorithm to supplement the time-series knowledge map with causality to obtain the event map includes:
  • Event fusion is performed on multiple standard triples to obtain fusion events, and the fusion events are added to the time series knowledge graph to obtain an event graph.
  • an event refers to an event or state change that occurs at a specific point in time or a period of time, or in a specific geographical area, and consists of one or more actions involving one or more roles.
  • One event causes or causes or causes another event between two events.
  • the causal relationship includes positive, negative, explicit and implicit relationships, and includes other things such as exclusion of turning, juxtaposition, etc., which can help event fusion and reasoning various relationships.
  • the causality induction is to form a causal triplet in the data form of a triplet of "causal event-relationship-result event" from two events with causal relationship extracted from the training text set.
  • event extraction and causality induction can be completed through a pre-training model, wherein the basic model in the pre-training model adopts the structural idea of BERT+CRF, and in fact, RoBERTa, an improved version of BERT, is used as the pre-training model.
  • the input is a word embedding vector.
  • multiple transformer modules are used to output multiple hidden vectors. Then the sequence label generation task is completed by the Seq2Seq model.
  • this model can also complete the standardized expression of events and remove adverbs, particles and other tasks.
  • the joint model makes full use of the semantic information of the pre-trained model, and achieves good results in both event extraction and induction.
  • the preset screening criteria are pre-constructed rules or templates to filter event nodes that do not conform to expression habits or incomplete expressions, and after obtaining standardized event representation and causal relationship, use pre-constructed rules or templates to filter events Express custom or incomplete event nodes.
  • Different event nodes after screening may refer to the same entity in the real world because they have the same meaning. Therefore, event fusion is required to obtain fusion events.
  • performing event fusion on multiple standard triples to obtain a fusion event includes:
  • the standard triplet corresponding to the similarity is divided into the first cluster of events
  • the incremental clustering algorithm can obtain real-time clustering results, so after clustering some event samples, samples can be extracted from the clustering results to expand the training set and retrain the model so that the model can learn new Event and text features to enhance the clustering effect. Finally, after the above steps are completed, combine all the clusters with fewer samples in the clustering result with Buffer as uncertain samples, and use the model after multiple trainings to cluster these samples to obtain the final clustering result. Thus, the process of this part of event fusion is completed.
  • the use of the preset social network analysis algorithm to quantify the relationship of the standard knowledge map to obtain the closeness of dependence includes:
  • the degree centrality refers to the number of edges connected to a node, which is used to represent the connection degree of nodes
  • the modularity class is used for community detection, and is used to measure community division Quality or stability, a modular classification measure equal to the number of edges within a group minus the expected number of edges in an equivalent network with randomly set edges.
  • the Betweenness centrality measures how easy it is for a node to reach other nodes.
  • the calculation of the proximity centrality of the standard knowledge graph using a preset proximity centrality calculation formula includes:
  • C B (v) represents the proximity centrality value of node v
  • ⁇ st (v) represents the sum of the shortest paths from node s to node t passing through v
  • ⁇ st represents the distance between node s and node t The sum of all the shortest path numbers of , v,s,t ⁇ V.
  • the preset calculation formula of dependence closeness includes:
  • the graph attention network learns the attention coefficients on all neighbors of the node to perform feature aggregation, and can Improving the performance of many graph learning tasks.
  • the scoring function depends on the attention coefficient of the network and the related entity feature vector, and then train the model with the goal of minimizing the mean square error loss, and finally output the relationship between each pair of entities in the form of a matrix. risk hazard score. Therefore, for different knowledge graphs and entity characteristics, it is possible to quantitatively evaluate entity relationships in the financial field, such as corporate credit relationships, supply chain relationships, and inter-industry input-output relationships.
  • the S7 includes:
  • the supervised model and the unsupervised time series model are combined into a macro forecasting model by using a preset semi-supervised Bayesian algorithm.
  • the macro forecast model is used to predict the risk of the industry, and the macro forecast model is used to predict the target risk entity to obtain the macro risk probability, and the macro risk probability is greater than or equal to the predicted
  • the industry type corresponding to the entity of the set macro threshold is a risk industry.
  • the target risk entity is the construction and environmental protection sector
  • the macro forecast model is used to predict the target risk entity
  • the macro risk probability is 0.6. If the preset macro threshold is 0.5, the macro risk probability is greater than As for the macro threshold, the target risk entity is the construction industry corresponding to the construction and environmental protection sector as the risk industry.
  • the embodiment of the present application extracts the risk perception factor set from the pre-acquired multi-source information set, and builds a time-series knowledge map based on the risk-aware factor set.
  • the relation complement algorithm completes the implicit relation in the time series knowledge map to obtain the standard knowledge map.
  • Building a risk prediction model based on a preset reinforcement learning algorithm, using the reinforcement learning algorithm to construct the model can ensure the stability of the model, and using the risk prediction model to perform risk prediction on entities in the standard knowledge map, and obtain Target risk entity.
  • the risk forecasting model can realize risk forecasting from the perspective of entities, and the macro forecasting model can predict the target risk entities that have undergone risk forecasting from an industry perspective, thereby improving the accuracy of risk forecasting for industries. Therefore, the risk prediction method proposed in this application can solve the problem that the accuracy of risk prediction for the industry is not high enough.
  • FIG. 2 it is a functional block diagram of a risk prediction device provided by an embodiment of the present application.
  • the risk prediction device 100 described in this application can be installed in an electronic device. According to the realized functions, the risk prediction device 100 may include a knowledge graph construction module 101 , a target risk entity prediction module 102 , an event graph generation module 103 , a graph quantification module 104 and a macro prediction module 105 .
  • the module described in this application can also be called a unit, which refers to a series of computer program segments that can be executed by the processor of the electronic device and can complete fixed functions, and are stored in the memory of the electronic device.
  • each module/unit is as follows:
  • the knowledge map construction module 101 is used to obtain a multi-source information set, extract a risk-aware factor set from the multi-source information set, and construct a time-series knowledge map based on the risk-aware factor set, and use preset implicit relationships to supplement The algorithm completes the implicit relationship in the time-series knowledge graph to obtain a standard knowledge graph;
  • the target risk entity prediction module 102 is configured to construct a risk prediction model based on a preset reinforcement learning algorithm, use the risk prediction model to perform risk prediction on entities in the standard knowledge graph, obtain a risk probability, and convert the Entities whose risk probability is greater than or equal to the preset probability threshold are regarded as target risk entities;
  • the event map generation module 103 is used to supplement the sequence knowledge map with causality by using a preset causal relation supplementary algorithm to obtain an event map;
  • the graph quantification module 104 is used to quantify the relationship of the standard knowledge graph by using a preset social network analysis algorithm to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph, Get the degree of hazard of the event;
  • the macro-prediction module 105 is used to obtain a macro-prediction model based on the event map, the dependency closeness and the degree of hazard of the event in combination with graph neural network and semi-supervised method training, and use the macro-prediction model to analyze the
  • the target risk entity performs prediction to obtain the macro risk probability, and determines that the industry type corresponding to the entity whose macro risk probability is greater than or equal to the preset macro threshold is a risk industry.
  • each module described in the risk prediction device 100 in the embodiment of the present application uses the same technical means as the risk prediction method described in Figure 1 above, and can produce the same technical effect, and will not be repeated here. .
  • FIG. 3 it is a schematic structural diagram of an electronic device implementing a risk prediction method provided by an embodiment of the present application.
  • the electronic device 1 may include a processor 10, a memory 11, a communication bus 12, and a communication interface 13, and may also include a computer program stored in the memory 11 and operable on the processor 10, such as a risk prediction program .
  • the processor 10 may be composed of integrated circuits in some embodiments, for example, may be composed of a single packaged integrated circuit, or may be composed of multiple integrated circuits with the same function or different functions packaged, including one or A combination of multiple central processing units (Central Processing unit, CPU), microprocessors, digital processing chips, graphics processors and various control chips, etc.
  • the processor 10 is the control core (Control Unit) of the electronic device, and uses various interfaces and lines to connect the various components of the entire electronic device, and runs or executes programs or modules stored in the memory 11 (such as executing risk prediction program, etc.), and call the data stored in the memory 11 to execute various functions of the electronic device and process data.
  • Control Unit Control Unit
  • the memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, mobile hard disk, multimedia card, card type memory (for example: SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, etc. .
  • the storage 11 may be an internal storage unit of the electronic device in some embodiments, such as a mobile hard disk of the electronic device.
  • the memory 11 can also be an external storage device of an electronic device in other embodiments, such as a plug-in mobile hard disk equipped on an electronic device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD ) card, flash card (Flash Card), etc.
  • the memory 11 may also include both an internal storage unit of the electronic device and an external storage device.
  • the memory 11 can not only be used to store application software and various data installed in electronic equipment, such as codes of risk prediction programs, but also can be used to temporarily store data that has been output or will be output.
  • the communication bus 12 may be a peripheral component interconnect (PCI for short) bus or an extended industry standard architecture (EISA for short) bus or the like.
  • PCI peripheral component interconnect
  • EISA extended industry standard architecture
  • the bus can be divided into address bus, data bus, control bus and so on.
  • the bus is configured to realize connection and communication between the memory 11 and at least one processor 10 and the like.
  • the communication interface 13 is used for communication between the electronic device and other devices, including a network interface and a user interface.
  • the network interface may include a wired interface and/or a wireless interface (such as a WI-FI interface, a Bluetooth interface, etc.), which are generally used to establish a communication connection between the electronic device and other electronic devices.
  • the user interface may be a display (Display) or an input unit (such as a keyboard (Keyboard)).
  • the user interface may also be a standard wired interface or a wireless interface.
  • the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, and the like.
  • the display may also be properly referred to as a display screen or a display unit, and is used for displaying information processed in the electronic device and for displaying a visualized user interface.
  • FIG. 3 only shows an electronic device with components. Those skilled in the art can understand that the structure shown in FIG. 3 does not constitute a limitation to the electronic device 1, and may include fewer or more components, or combinations of certain components, or different arrangements of components.
  • the electronic device may also include a power supply (such as a battery) for supplying power to various components.
  • the power supply may be logically connected to the at least one processor 10 through a power management device, so that Realize functions such as charge management, discharge management, and power consumption management.
  • the power supply may also include one or more DC or AC power supplies, recharging devices, power failure detection circuits, power converters or inverters, power status indicators and other arbitrary components.
  • the electronic device may also include various sensors, a Bluetooth module, a Wi-Fi module, etc., which will not be repeated here.
  • the risk prediction program stored in the memory 11 in the electronic device 1 is a combination of multiple instructions. When running in the processor 10, it can realize:
  • Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
  • the integrated modules/units of the electronic device 1 are realized in the form of software function units and sold or used as independent products, they can be stored in a computer-readable storage medium.
  • the computer-readable storage medium may be volatile or non-volatile.
  • the computer-readable storage medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read- Only Memory).
  • the present application also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor of an electronic device, it can realize:
  • Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
  • modules described as separate components may or may not be physically separated, and the components shown as modules may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional module in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware, or in the form of hardware plus software function modules.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with each other using cryptographic methods. Each data block contains a batch of network transaction information, which is used to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • AI artificial intelligence
  • digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results.

Abstract

The present application relates to artificial intelligence technology. Disclosed is a risk prediction method, comprising: constructing a temporal knowledge graph on the basis of a risk perception factor set that is extracted from a multi-source information set, and performing implicit relationship supplementation and causal relationship supplementation on the temporal knowledge graph, so as to obtain a standard knowledge graph and an event evolutionary graph; performing prediction by using a risk prediction model that is constructed on the basis of a reinforcement learning algorithm, so as to obtain a target risk entity; and performing relationship quantification and degree quantification on the standard knowledge graph, so as to obtain a dependency closeness and an event hazard degree, performing training on the basis of the event evolutionary graph, the dependency closeness and the event hazard degree and in view of a graph neural network and a semi-supervised method, so as to obtain a macro prediction model, and performing prediction by using the macro prediction model, so as to obtain a risk industry corresponding to the target risk entity. In addition, the present application further relates to blockchain technology. The event evolutionary graph can be stored in a node of a blockchain. Also provided in the present application are a risk prediction apparatus, an electronic device and a storage medium. By means of the present application, the accuracy of performing risk prediction on an industry can be improved.

Description

风险预测方法、装置、设备及存储介质Risk prediction method, device, equipment and storage medium
本申请要求于2021年10月19日提交中国专利局、申请号为CN202111216347.6、名称为“风险预测方法、装置、设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application with application number CN202111216347.6 and titled "Risk Prediction Method, Device, Equipment, and Storage Medium" filed with the China Patent Office on October 19, 2021, the entire contents of which are incorporated by reference in this application.
技术领域technical field
本申请涉及人工智能技术领域,尤其涉及一种风险预测方法、装置、电子设备及计算机可读存储介质。The present application relates to the technical field of artificial intelligence, and in particular to a risk prediction method, device, electronic equipment, and computer-readable storage medium.
背景技术Background technique
随着科技的发展和社会的进步,为了保证经济的稳定发展,提前进行关于行业的金融风险的预测是十分有必要的。行业的金融风险可能危及整个金融体系稳定,对实体经济造成严重负面效应的风险。With the development of science and technology and the progress of society, in order to ensure the stable development of the economy, it is very necessary to predict the financial risks of the industry in advance. The financial risk of the industry may endanger the stability of the entire financial system and cause serious negative effects on the real economy.
现有的风险预测方法主要基于较为简单的企业之间的网络结构,网络结构的生成要依赖于企业关系,如信贷关系。发明人意识到,企业间的关系复杂多样,例如合作研发关系、竞争关系、供应链关系等难以观测,因此会导致网络结构中缺失多种无法观测的隐含关系,进而导致对行业进行风险预测的准确度较低。The existing risk prediction methods are mainly based on the relatively simple network structure between enterprises, and the generation of the network structure depends on the relationship between enterprises, such as credit relationship. The inventor realized that the relationships among enterprises are complex and diverse, such as cooperative research and development relationships, competitive relationships, supply chain relationships, etc., which are difficult to observe, which will lead to the lack of many unobservable implicit relationships in the network structure, which in turn leads to risk prediction for the industry The accuracy is lower.
发明内容Contents of the invention
本申请提供的一种风险预测方法,包括:A risk prediction method provided by this application includes:
获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
本申请还提供一种风险预测装置,所述装置包括:The present application also provides a risk prediction device, the device comprising:
知识图谱构建模块,用于获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱,利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;The knowledge map construction module is used to obtain a multi-source information set, extract a risk-aware factor set from the multi-source information set, and construct a time-series knowledge map based on the risk-aware factor set, and use the preset implicit relationship to complement the algorithm The implicit relationship in the time series knowledge graph is obtained to obtain a standard knowledge graph;
目标风险实体预测模块,用于基于预设的强化学习算法构建风险预测模型,利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;The target risk entity prediction module is used to construct a risk prediction model based on a preset reinforcement learning algorithm, use the risk prediction model to perform risk prediction on entities in the standard knowledge map, obtain a risk probability, and set the risk probability to be greater than Or an entity equal to a preset probability threshold as a target risk entity;
事理图谱生成模块,用于利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;An event map generation module, configured to use a preset causal relation supplementary algorithm to supplement the time-series knowledge map with causal relations to obtain an event map;
图谱量化模块,用于利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件 危害程度;The graph quantification module is used to use the preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the closeness of dependence, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the event hazard degree;
宏观预测模块,用于基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型,利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。The macro prediction module is used to obtain a macro prediction model based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training, and use the macro prediction model to predict the target risk entity Prediction is performed to obtain the macro risk probability, and the industry type corresponding to the entity whose macro risk probability is greater than or equal to the preset macro threshold is determined as a risk industry.
本申请还提供一种电子设备,所述电子设备包括:The present application also provides an electronic device, the electronic device comprising:
至少一个处理器;以及,at least one processor; and,
与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
所述存储器存储有可被所述至少一个处理器执行的计算机程序,所述计算机程序被所述至少一个处理器执行,以使所述至少一个处理器能够执行如下步骤:The memory stores a computer program executable by the at least one processor, the computer program is executed by the at least one processor, so that the at least one processor can perform the following steps:
获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质中存储有至少一个计算机程序,所述至少一个计算机程序被电子设备中的处理器执行以实现如下步骤:The present application also provides a computer-readable storage medium, at least one computer program is stored in the computer-readable storage medium, and the at least one computer program is executed by a processor in an electronic device to implement the following steps:
获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
附图说明Description of drawings
图1为本申请一实施例提供的风险预测方法的流程示意图;Fig. 1 is a schematic flow chart of a risk prediction method provided by an embodiment of the present application;
图2为本申请一实施例提供的风险预测装置的功能模块图;FIG. 2 is a functional block diagram of a risk prediction device provided by an embodiment of the present application;
图3为本申请一实施例提供的实现所述风险预测方法的电子设备的结构示意图。Fig. 3 is a schematic structural diagram of an electronic device implementing the risk prediction method provided by an embodiment of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional features and advantages of the present application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.
本申请实施例提供一种风险预测方法。所述风险预测方法的执行主体包括但不限于服务端、终端等能够被配置为执行本申请实施例提供的该方法的电子设备中的至少一种。换言之,所述风险预测方法可以由安装在终端设备或服务端设备的软件或硬件来执行,所述软件可以是区块链平台。所述服务端包括但不限于:单台服务器、服务器集群、云端服务器或云端服务器集群等。所述服务器可以是独立的服务器,也可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、内容分发网络(ContentDelivery Network,CDN)、以及大数据和人工智能平台等基础云计算服务的云服务器。The embodiment of this application provides a risk prediction method. The subject of execution of the risk prediction method includes but is not limited to at least one of electronic devices such as a server and a terminal that can be configured to execute the method provided by the embodiment of the present application. In other words, the risk prediction method can be executed by software or hardware installed on the terminal device or server device, and the software can be a block chain platform. The server includes, but is not limited to: a single server, a server cluster, a cloud server or a cloud server cluster, and the like. The server can be an independent server, or it can provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, content delivery network (ContentDelivery Network) , CDN), and cloud servers for basic cloud computing services such as big data and artificial intelligence platforms.
参照图1所示,为本申请一实施例提供的风险预测方法的流程示意图。在本实施例中,所述风险预测方法包括:Referring to FIG. 1 , it is a schematic flowchart of a risk prediction method provided by an embodiment of the present application. In this embodiment, the risk prediction method includes:
S1、获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱。S1. Acquire a multi-source information set, extract a risk perception factor set from the multi-source information set, and construct a time-series knowledge graph based on the risk perception factor set.
本申请实施例中,可以从多源异构信息源中获取多源信息集,其中,所述多源异构信息源来自于互联网公开数据、业务数据和政府数据等多个渠道,包括但不限于视频、电子图片、遥感影像、文本等多种类型。In the embodiment of this application, the multi-source information set can be obtained from multi-source heterogeneous information sources, wherein the multi-source heterogeneous information sources come from multiple channels such as Internet public data, business data and government data, including but not It is limited to various types such as video, electronic pictures, remote sensing images, and texts.
具体地,所述从所述多源信息集中提取风险感知因子集,包括:Specifically, the extracting the risk perception factor set from the multi-source information set includes:
识别所述多源信息集中的文本信息和图像信息;identifying text information and image information in the multi-source information set;
利用预设的自然语言处理技术对所述文本信息进行因子提取,得到文本感知因子集;performing factor extraction on the text information using a preset natural language processing technology to obtain a text-aware factor set;
利用预设的图像识别技术对所述图像信息进行因子提取,得到图像感知因子集;performing factor extraction on the image information by using a preset image recognition technology to obtain an image perception factor set;
将所述文本感知因子集和所述图像感知因子集进行汇总,得到风险感知因子集。Summarizing the text perception factor set and the image perception factor set to obtain a risk perception factor set.
详细地,所述多源信息集中的信息包含不同的类型,筛选出所述多源信息集中的文本信息和图像信息,利用预设的自然语言处理技术对所述文本信息进行因子提取,其中,所述自然语言处理技术可以为利用Word2Vec算法进行词嵌入,利用FastText模型或者ELMO(Embedding from language models,双向语言模型)模型进行文本特征编码,利用BERT(Bidirectional Encoder Representations from Transformer,双向编码器表征)模型进行文本特征提取。利用预设的图像识别技术对所述图像信息进行因子提取,其中,所述图像识别技术可以为YOLO V2~V4或者SSD等目标检测算法、AlexNet或者ResNet等目标识别算法、弱监督的语义分割方法等语义分割算法。In detail, the information in the multi-source information set contains different types, and the text information and image information in the multi-source information set are screened out, and the text information is factor-extracted using a preset natural language processing technology, wherein, Described natural language processing technique can be to utilize Word2Vec algorithm to carry out word embedding, utilize FastText model or ELMO (Embedding from language models, two-way language model) model to carry out text feature encoding, utilize BERT (Bidirectional Encoder Representations from Transformer, two-way encoder representation) The model performs text feature extraction. Use preset image recognition technology to extract factors from the image information, wherein the image recognition technology can be target detection algorithms such as YOLO V2-V4 or SSD, target recognition algorithms such as AlexNet or ResNet, and weakly supervised semantic segmentation methods and other semantic segmentation algorithms.
在本方案中,对于图片信息而言,例如可以利用卫星图像识别技术、光学字符识别(OCR)并结合NLP技术等技术完成信息提取。例如可以从超高分辨率卫星图像中识别农作物、航运货物、海陆运输等目标,进而对经济生产重要环节走势变化做出预警,可以使用OCR技术从财务票据、交易票据等非标准信息中提取用于风险审核的重要信息,而夜间灯光遥感数据则可用来动态预测人口密度、城市扩张速度。对于文本信息内容,可以利用自然语言处理(NLP)结合机器学习等技术完成信息提取。如可以从新闻、舆情、论坛资讯类文本数据中实时识别金融实体、发现金融事件的关联关系,提取刻画经济不确定性等的相关因子;从上市公司年报、IPO招股说明书和公司前瞻性陈述类文本数据,可挖掘企业收入、业务发展规模、公司发展战略倾向等信息;也可从社交媒体类文本信息中,包括推特、微博、微信公众号和论坛帖子等,提取事件倾向评分、关注度指数、风险波动率等因子。In this solution, for image information, for example, satellite image recognition technology, optical character recognition (OCR) and NLP technology can be used to complete information extraction. For example, targets such as crops, shipping goods, and sea and land transportation can be identified from ultra-high-resolution satellite images, and then early warnings can be made for changes in important links of economic production. OCR technology can be used to extract useful information from non-standard information such as financial notes and transaction notes Important information for risk assessment, while nighttime light remote sensing data can be used to dynamically predict population density and urban expansion speed. For text information content, natural language processing (NLP) combined with machine learning and other technologies can be used to complete information extraction. For example, it is possible to identify financial entities in real time from text data such as news, public opinion, and forum information, discover the relationship between financial events, and extract relevant factors that describe economic uncertainty; from listed company annual reports, IPO prospectuses, and company forward-looking statements Text data, which can mine information such as enterprise income, business development scale, and company development strategy tendency; it can also extract event tendency scores, attention Degree index, risk volatility and other factors.
进一步地,所述基于所述风险感知因子集构建时序知识图谱,包括:Further, the constructing a time series knowledge map based on the risk perception factor set includes:
抽取所述风险感知因子集中的实体和实体关系;extracting entities and entity relationships in the set of risk perception factors;
基于所述实体和所述实体关系进行图谱构建,得到时序知识图谱。A graph is constructed based on the entity and the entity relationship to obtain a time series knowledge graph.
其中,利用Word2Vec或者LSTM+CRF进行实体抽取,进行实体关系抽取的算法包括但不限于以下:监督学习(SVM,NN,朴素贝叶斯)、半监督学习(远程监督,Bootstrapping)、深度学习(Pipeline如Att-CNN&Att-BLSTM,Joint Model如LSTM-RNNs)。利用基于规则的推理SWRL或者基于图的推理Path Ranking进行图谱构建,得到时序知识图谱。Among them, using Word2Vec or LSTM+CRF for entity extraction, the algorithm for entity relationship extraction includes but is not limited to the following: supervised learning (SVM, NN, naive Bayesian), semi-supervised learning (distance supervision, Bootstrapping), deep learning ( Pipeline such as Att-CNN&Att-BLSTM, Joint Model such as LSTM-RNNs). Use rule-based reasoning SWRL or graph-based reasoning Path Ranking to construct graphs to obtain time-series knowledge graphs.
例如,以PPP(Public-Private Partnership,政府和社会资本合作)合作模式为诱因引发的“东方园林”2018年发债事件为例,所述实体可以为东方园林、10亿公司债等,所述实体关系可以为宣布、发行等。For example, taking the PPP (Public-Private Partnership, government and social capital cooperation) cooperation model as an example of the bond issuance event of "Oriental Garden" in 2018, the entity mentioned can be Oriental Garden, 1 billion corporate bonds, etc. An entity relationship can be Announcement, Issue, etc.
S2、利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱。S2. Completing the implicit relationship in the time-series knowledge graph by using the preset implicit relationship complement algorithm to obtain a standard knowledge graph.
本申请实施例中,通过补全所述时序知识图谱中的隐含关系可以丰富和校准所述时序知识图谱中各个实体之间的关系,扩宽所述时序知识图谱的应用范围。In the embodiment of the present application, by completing the implicit relationship in the time-series knowledge graph, the relationship between entities in the time-series knowledge graph can be enriched and calibrated, and the application range of the time-series knowledge graph can be broadened.
其中,所述隐含关系是指实体之间难以直接获得的实体关系,所述时序知识图谱中包含多个实体和实体之间的关系,但时序知识图谱中的实体之间的关系为较为明显的观测得到的关系,因此需要对各个实体进行深度挖掘,得到隐含关系,并将所述隐含关系补充至所述时序知识图谱中,得到标准知识图谱。Wherein, the implicit relationship refers to the entity relationship that is difficult to obtain directly between entities, and the time-series knowledge graph contains multiple entities and the relationship between entities, but the relationship between entities in the time-series knowledge graph is more obvious Therefore, it is necessary to deeply mine each entity to obtain the implicit relationship, and add the implicit relationship to the time-series knowledge graph to obtain a standard knowledge graph.
具体地,所述利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱,包括:Specifically, the use of the preset implicit relationship complement algorithm to complement the implicit relationship in the time-series knowledge graph to obtain a standard knowledge graph includes:
基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱;performing graph sparse processing on the time-series knowledge graph based on a preset sparse graph convolutional network to obtain a sparse knowledge graph;
利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系;Using the trained relational graph convolutional network to predict the relationship of the sparse knowledge graph to obtain the hidden relationship;
将所述隐含关系补全在所述时序知识图谱中,得到标准知识图谱。Completing the implicit relationship in the time series knowledge graph to obtain a standard knowledge graph.
详细地,本方案采用图卷积网络模型对所述时序知识图谱进行隐含关系的补全,主要在于先对所述时序知识图谱进行图稀疏化处理,并将经过图稀疏化后得到的稀疏知识图谱作为关系图卷积网络的输入进行关系预测。这种方法可以避免在观测的实体关系样本不足、实体关系类型多样的情况下出现的由于特征稀疏而造成模型预测精度较低的问题,同时可以解决随着图形的大小增长,大型图上的节点分类和关系预测等功能也可能占用大量的时间和空间的问题。In detail, this solution uses the graph convolutional network model to complete the implicit relationship of the time-series knowledge graph, mainly in that the graph sparse processing is performed on the time-series knowledge graph first, and the sparse data obtained after the graph sparse Knowledge graphs are used as input to relational graph convolutional networks for relational prediction. This method can avoid the problem of low prediction accuracy of the model due to sparse features when the observed entity relationship samples are insufficient and the entity relationship types are diverse. At the same time, it can solve the problem of nodes on large graphs as the size of the graph increases. Functions such as classification and relation prediction can also be time and space intensive problems.
其中,图卷积网络模型会将输入的知识图谱转换为稀疏知识图谱并进行下一步的关系预测。Among them, the graph convolutional network model converts the input knowledge map into a sparse knowledge map and performs the next step of relationship prediction.
进一步地,所述基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱,包括:Further, the preset sparse graph convolutional network performs graph sparse processing on the time series knowledge graph to obtain a sparse knowledge graph, including:
确定所述时序知识图谱对应的邻接矩阵及特征矩阵,并获取所述稀疏图卷积网络的预设权重矩阵;Determine the adjacency matrix and feature matrix corresponding to the time series knowledge graph, and obtain the preset weight matrix of the sparse graph convolutional network;
基于所述邻接矩阵、所述特征矩阵和所述预设权重矩阵构建得到稀疏输出函数;constructing a sparse output function based on the adjacency matrix, the feature matrix and the preset weight matrix;
利用基于乘数的交替方向算法对所述稀疏输出函数进行优化处理,将优化后的所述稀疏输出函数中的变量对所述时序知识图谱进行变量更新,得到稀疏知识图谱。Optimizing the sparse output function by using a multiplier-based alternating direction algorithm, updating variables in the optimized sparse output function on the time-series knowledge graph to obtain a sparse knowledge graph.
详细地,若所述时序知识图谱为G=(V,E),其中,V={v 1,…,v n}为所述时序知识图谱中实体的集合,E={e 1,…,e m}为所述时序知识图谱中实体与实体之间关系的集合,而所述时序知识图谱对应的邻接矩阵即为存放实体与实体之间关系的二维数组。所述特征矩阵为每个实体对应的节点的特征,表示为X(v)=[x 1,…,x k],当所述稀疏图卷积网络中包含两层子网络时,所述稀疏图卷积网络的预设权重矩阵可以为W (0)和W (1)In detail, if the time-series knowledge graph is G=(V,E), where V={v 1 ,...,v n } is the set of entities in the time-series knowledge graph, E={e 1 ,..., e m } is a set of relationships between entities in the time-series knowledge graph, and the adjacency matrix corresponding to the time-series knowledge graph is a two-dimensional array storing the relationship between entities. The feature matrix is the feature of the node corresponding to each entity, expressed as X(v)=[x 1 ,...,x k ], when the sparse graph convolutional network contains two layers of sub-networks, the sparse The preset weight matrix of the graph convolutional network can be W (0) and W (1) .
其中,对于所述时序知识图谱为G=(V,E)来说,其对应的邻接矩阵为:Wherein, for the time series knowledge map is G=(V, E), its corresponding adjacency matrix is:
Figure PCTCN2022071241-appb-000001
Figure PCTCN2022071241-appb-000001
具体地,所述基于所述邻接矩阵、所述特征矩阵和所述预设权重矩阵构建得到稀疏输出函数,包括:Specifically, the construction of the sparse output function based on the adjacency matrix, the feature matrix and the preset weight matrix includes:
Figure PCTCN2022071241-appb-000002
Figure PCTCN2022071241-appb-000002
Figure PCTCN2022071241-appb-000003
Figure PCTCN2022071241-appb-000003
Figure PCTCN2022071241-appb-000004
Figure PCTCN2022071241-appb-000004
Figure PCTCN2022071241-appb-000005
Figure PCTCN2022071241-appb-000005
其中,
Figure PCTCN2022071241-appb-000006
为所述稀疏输出函数,W为所述预设权重矩阵,
Figure PCTCN2022071241-appb-000007
为更新邻接矩阵,A为所述邻接矩阵,I N为固定参数,diag为对角矩阵,W (0)和W (1)为所述稀疏图卷积网络的预设权重矩阵,ReLU为线性整流函数,X为所述特征矩阵。
in,
Figure PCTCN2022071241-appb-000006
is the sparse output function, W is the preset weight matrix,
Figure PCTCN2022071241-appb-000007
To update the adjacency matrix, A is the adjacency matrix, IN is a fixed parameter, diag is a diagonal matrix, W (0) and W (1) are the preset weight matrices of the sparse graph convolutional network, and ReLU is linear rectification function, X is the feature matrix.
进一步地,所述利用基于乘数的交替方向算法对所述稀疏输出函数进行优化处理,包括:Further, the optimization of the sparse output function by using the multiplier-based alternating direction algorithm includes:
对所述稀疏输出函数进行简化处理,得到简化输出函数;Simplifying the sparse output function to obtain a simplified output function;
利用自适应矩估计优化器对所述简化输出函数中的变量的梯度进行更新,得到更新后的变量的梯度。An adaptive moment estimation optimizer is used to update the gradients of the variables in the simplified output function to obtain the updated gradients of the variables.
详细地,对所述稀疏输出函数进行简化处理是指所述稀疏图卷积网络的输出依赖于
Figure PCTCN2022071241-appb-000008
和W,而但是
Figure PCTCN2022071241-appb-000009
可以被表示为关于A的函数,则输出可以被表示为关于A和W的函数,即Z(A,W),因为W保持不变,Z(A,W)可被简化为Z(A)。
In detail, simplifying the sparse output function means that the output of the sparse graph convolutional network depends on
Figure PCTCN2022071241-appb-000008
and W, but
Figure PCTCN2022071241-appb-000009
can be expressed as a function of A, then the output can be expressed as a function of A and W, that is, Z(A,W), because W remains unchanged, Z(A,W) can be simplified to Z(A) .
其中,所述简化输出函数为:Wherein, the simplified output function is:
Figure PCTCN2022071241-appb-000010
Figure PCTCN2022071241-appb-000010
具体地,利用自适应矩估计优化器对所述简化输出函数中的变量的梯度进行更新,将优化后的所述稀疏输出函数中的变量对所述时序知识图谱进行变量更新,得到稀疏知识图谱。Specifically, the adaptive moment estimation optimizer is used to update the gradient of the variables in the simplified output function, and the variables in the optimized sparse output function are updated to the time-series knowledge graph to obtain a sparse knowledge graph .
详细地,利用基于乘数的交替方向算法(ADMN)进行优化处理可以保留所述时序知识图谱中的结构和层次信息的网络主干,并在保持节点分类预测的性能的同时又保留的时序知识图谱的边缘信息。In detail, the optimization process using the multiplier-based Alternating Direction Algorithm (ADMN) can preserve the network backbone of the structural and hierarchical information in the temporal knowledge graph, and preserve the performance of the temporal knowledge graph while maintaining the performance of node classification prediction. edge information.
进一步地,所述利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系之前,所述方法还包括:Further, before using the trained relational graph convolutional network to perform relation prediction on the sparse knowledge map, and obtain the hidden relation, the method further includes:
利用预设的关系图卷积网络中的实体编码器对所述稀疏知识图谱进行特征预测,得到所述稀疏知识图谱中的实体对应的潜在特征;Using the entity encoder in the preset relational graph convolution network to perform feature prediction on the sparse knowledge graph, and obtain the potential features corresponding to the entities in the sparse knowledge graph;
基于所述关系图卷积网络中的解码器对所述实体对应的潜在特征进行评分,将所述评分大于或者等于预设的评分阈值的对应潜在特征作为目标潜在特征;Scoring the latent features corresponding to the entity based on the decoder in the relational graph convolutional network, and using the corresponding latent features whose scores are greater than or equal to a preset scoring threshold as target latent features;
根据所述目标潜在特征和预设的交叉熵损失函数计算得到交叉熵损失值;Calculate and obtain a cross-entropy loss value according to the target latent feature and a preset cross-entropy loss function;
当所述交叉熵损失值小于或者等于预设的损失阈值时,将所述关系图卷积网络输出为训练好的关系图卷积网络。When the cross-entropy loss value is less than or equal to a preset loss threshold, the relational graph convolutional network is output as a trained relational graph convolutional network.
其中,所述关系图卷积网络中包含实体编码器和解码器,所述实体编码器用于产生实体的潜在特征表示,所述解码器用于通过一个评分函数对所述潜在特征表示进行评分处理。Wherein, the relational graph convolutional network includes an entity encoder and a decoder, the entity encoder is used to generate a latent feature representation of an entity, and the decoder is used to score the latent feature representation through a scoring function.
具体地,利用预设的关系图卷积网络中的实体编码器对所述稀疏知识图谱进行特征预测即利用R-GCN作为编码器生成每个实体的实向量表示e i。其中,R-GCN模型按照预设的方式堆叠L层,R-GCN模型中上一层的输出会作为下一层的输入,所述实体编码器使用R-GCN的输出作为每个实体的向量表示,即
Figure PCTCN2022071241-appb-000011
是节点v i在第l层 神经网络中的隐向量(hidden state),d (l)是该层向量表示的维度。
Specifically, the entity encoder in the preset relational graph convolutional network is used to perform feature prediction on the sparse knowledge graph, that is, the R-GCN is used as the encoder to generate a real vector representation e i of each entity. Among them, the R-GCN model stacks L layers according to the preset method, the output of the previous layer in the R-GCN model will be used as the input of the next layer, and the entity encoder uses the output of R-GCN as the vector of each entity means that
Figure PCTCN2022071241-appb-000011
is the hidden vector (hidden state) of node v i in the l-layer neural network, and d (l) is the dimension represented by the vector of this layer.
进一步地,所述基于所述关系图卷积网络中的解码器对所述实体对应的潜在特征进行评分,包括:Further, the scoring of the latent features corresponding to the entity based on the decoder in the relational graph convolutional network includes:
利用所述解码器中的DistMult因子分解模型对所述实体对应的潜在特征进行评分。The latent features corresponding to the entities are scored using the DistMult factorization model in the decoder.
详细地,DistMult因子分解模型是语义匹配模型的一种,语义匹配模型利用基于相似性的评分函数,通过匹配实体和关系在隐空间的潜在语义衡量此三元组存在或成立的可能性。In detail, the DistMult factorization model is a kind of semantic matching model. The semantic matching model uses a similarity-based scoring function to measure the possibility of the existence or establishment of this triple by matching the latent semantics of entities and relations in the latent space.
具体地,利用所述解码器中的DistMult因子分解模型对所述实体对应的潜在特征进行评分,包括:Specifically, use the DistMult factorization model in the decoder to score the potential features corresponding to the entity, including:
Figure PCTCN2022071241-appb-000012
Figure PCTCN2022071241-appb-000012
其中,
Figure PCTCN2022071241-appb-000013
是头实体s的隐性向量表示,
Figure PCTCN2022071241-appb-000014
是尾实体o的隐向量表示,
Figure PCTCN2022071241-appb-000015
是对于关系类型r的邻接矩阵,d是实体向量的维度。
in,
Figure PCTCN2022071241-appb-000013
is the implicit vector representation of the head entity s,
Figure PCTCN2022071241-appb-000014
is the implicit vector representation of the tail entity o,
Figure PCTCN2022071241-appb-000015
is the adjacency matrix for relation type r, and d is the dimension of the entity vector.
进一步地,所述根据所述目标潜在特征和预设的交叉熵损失函数计算得到交叉熵损失值,包括:Further, the cross-entropy loss value calculated according to the target potential feature and the preset cross-entropy loss function includes:
所述交叉熵损失函数为:The cross-entropy loss function is:
Figure PCTCN2022071241-appb-000016
Figure PCTCN2022071241-appb-000016
其中,
Figure PCTCN2022071241-appb-000017
为所述交叉熵损失值,
Figure PCTCN2022071241-appb-000018
是所有正负三元组样本的集合,对于集合中的每个元素(s,r,o,y),
Figure PCTCN2022071241-appb-000019
分别为头实体和尾实体,
Figure PCTCN2022071241-appb-000020
为关系类型,y则是一个指示器,y=1表示正样本,y=0表示负样本。
in,
Figure PCTCN2022071241-appb-000017
is the cross-entropy loss value,
Figure PCTCN2022071241-appb-000018
Is the set of all positive and negative triplet samples, for each element (s, r, o, y) in the set,
Figure PCTCN2022071241-appb-000019
are head entity and tail entity, respectively,
Figure PCTCN2022071241-appb-000020
is the relationship type, y is an indicator, y=1 means a positive sample, and y=0 means a negative sample.
具体地,利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系,并将所述隐含关系补全在所述时序知识图谱中,得到标准知识图谱。Specifically, use the trained relational graph convolutional network to predict the relationship of the sparse knowledge graph to obtain implicit relationships, and complete the implicit relationships in the time-series knowledge graph to obtain a standard knowledge graph.
例如,对所述稀疏知识图谱进行关系预测,得到隐含关系为“用PPP模式引发东方园林事件”,则将所述隐含关系“用PPP模式引发东方园林事件”补充在所述时序知识图谱中,得到标准知识图谱。For example, the relationship prediction is carried out on the sparse knowledge map, and the implicit relationship is obtained as "using the PPP mode to trigger the oriental garden event", then the implicit relationship "using the PPP mode to trigger the oriental garden event" is added to the time series knowledge map In the process, the standard knowledge graph is obtained.
S3、基于预设的强化学习算法构建风险预测模型。S3. Construct a risk prediction model based on a preset reinforcement learning algorithm.
本申请实施例中,预设的强化学习算法是一个可以应用于顺序决策和控制任务的框架,其中,强化学习算法中的代理(Agent)通过与环境(Environment)交互来优化其行为。In the embodiment of the present application, the preset reinforcement learning algorithm is a framework that can be applied to sequential decision-making and control tasks, wherein the agent (Agent) in the reinforcement learning algorithm optimizes its behavior by interacting with the environment (Environment).
具体地,所述基于预设的强化学习算法构建风险预测模型,包括:Specifically, the construction of a risk prediction model based on a preset reinforcement learning algorithm includes:
获取原始风险状态数据,对所述原始风险状态数据进行抽样处理,得到预训练数据;Obtaining original risk state data, performing sampling processing on the original risk state data, and obtaining pre-training data;
利用预设的深度神经网络对所述预训练数据进行拟合处理,得到所述预训练数据对应的状态动作;Using a preset deep neural network to perform fitting processing on the pre-training data to obtain state actions corresponding to the pre-training data;
获取执行所述状态动作下的初始风险状态数据,并计算所述初始风险状态数据和所述原始风险状态数据之间的时间差分;Acquiring initial risk state data under execution of the state action, and calculating a time difference between the initial risk state data and the original risk state data;
以所述时间差分作为目标函数,以所述强化学习算法为框架训练得到风险预测模型。The time difference is used as an objective function, and the reinforcement learning algorithm is used as a framework to train a risk prediction model.
其中,所述原始风险状态数据是指现在数据所属的风险情况的标识数据。可以抽取符合预设的抽样标准的数据得到预训练数据,例如,抽取风险情况满足高风险和中风险条件的数据作为预训练数据。Wherein, the original risk status data refers to the identification data of the risk situation to which the current data belongs. Pre-training data can be obtained by extracting data conforming to preset sampling standards, for example, extracting data whose risk profile satisfies high-risk and medium-risk conditions as pre-training data.
详细地,所述强化学习算法为Actor-Critic算法,在基于策略梯度的Actor-Critic方法中,代理的角色被分为了参与者(Actor)和批评者(Critic),本质上Actor和Critic分别代表策略(Policy)和值函数(Value function)。给定当前状态x,参与者仅负责生成动作u。批评者负责处理收到的奖励r,即通过调整值函数来评估当前策略的质量。在批评者执行了多个策略评估步骤后,通过使用来自批评者的信息以更新参与者。In detail, the reinforcement learning algorithm is the Actor-Critic algorithm. In the Actor-Critic method based on the policy gradient, the role of the agent is divided into a participant (Actor) and a critic (Critic). In essence, Actor and Critic respectively represent Policy and Value function. Given a current state x, an actor is only responsible for generating action u. The critic is responsible for processing the received reward r, i.e. evaluating the quality of the current policy by adjusting the value function. After the critic has performed multiple policy evaluation steps, the participants are updated by using information from the critic.
S4、利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体。S4. Use the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain risk probabilities, and use entities whose risk probabilities are greater than or equal to a preset probability threshold as target risk entities.
本申请实施例中,可以利用所述风险预测模型对所述标准知识图谱中的多个不同的实体进行风险预测,将所述实体输入至所述风险预测模型中,得到所述实体对应的风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体。In the embodiment of the present application, the risk prediction model can be used to perform risk prediction on a plurality of different entities in the standard knowledge map, and the entities are input into the risk prediction model to obtain the risk corresponding to the entity probability, and the entity whose risk probability is greater than or equal to the preset probability threshold is taken as the target risk entity.
其中,所述标准知识图谱中的实体包含不同类型的企业。Wherein, the entities in the standard knowledge graph include different types of enterprises.
例如,在本方案中,即根据所述风险预测模型预测出所述标准知识图谱中风险概率大于或者等于所述预设概率阈值的企业作为目标风险企业。For example, in this solution, an enterprise whose risk probability in the standard knowledge graph is predicted to be greater than or equal to the preset probability threshold according to the risk prediction model is used as a target risk enterprise.
S5、利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱。S5. Using a preset causality supplement algorithm to supplement the time series knowledge graph with causality to obtain an event graph.
本申请实施例中,所述因果关系补充是指将所述时序知识图谱中各个实体之间的因果顺序等关系进行补充。In the embodiment of the present application, the causal relationship supplement refers to supplementing the causal sequence and other relationships among entities in the time series knowledge graph.
具体地,所述利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱,包括:Specifically, the use of the preset causality supplement algorithm to supplement the time-series knowledge map with causality to obtain the event map includes:
获取训练文本集,对所述训练文本集进行事件抽取及因果关系归纳,得到多个因果三元组;Obtain a training text set, perform event extraction and causal relationship induction on the training text set, and obtain multiple causal triplets;
保留所述因果三元组中符合预设筛选标准的多个因果事件作为标准三元组;Retaining a plurality of causal events that meet preset screening criteria in the causal triplet as a standard triplet;
对多个所述标准三元组进行事件融合,得到融合事件,并将所述融合事件补充至所述时序知识图谱中,得到事理图谱。Event fusion is performed on multiple standard triples to obtain fusion events, and the fusion events are added to the time series knowledge graph to obtain an event graph.
详细地,事件是指发生在某个特定时间点或者时间段、某个特定的地域范围,由一个或者多个角色参与的一个或者多个动作组成的事情或者状态的改变,因果关系是指两个事件之间一个事件导致或引起或造成了另一个事件,所述因果关系包含正向、负向、显性以及隐性关系,并且包含了其他例如排除转折、并列等可以帮助事件融合和推理的各种关系。所述因果关系归纳即将从所述训练文本集中抽取的具有因果关系的两个事件形成“原因事件-关系-结果事件”的三元组的数据形式的因果三元组。In detail, an event refers to an event or state change that occurs at a specific point in time or a period of time, or in a specific geographical area, and consists of one or more actions involving one or more roles. One event causes or causes or causes another event between two events. The causal relationship includes positive, negative, explicit and implicit relationships, and includes other things such as exclusion of turning, juxtaposition, etc., which can help event fusion and reasoning various relationships. The causality induction is to form a causal triplet in the data form of a triplet of "causal event-relationship-result event" from two events with causal relationship extracted from the training text set.
具体地,可以通过预训练模型完成事件抽取及因果关系归纳,其中,所述预训练模型中的基础模型采用BERT+CRF的结构思想,实际上以BERT改进版本RoBERTa作为预训练模型。输入为词嵌入向量,通过段落嵌入、位置嵌入和标点嵌入后,再经过多个transformer模块,输出多个隐向量。然后通过Seq2Seq模型完成序列标签生成任务。此外,因为输入的每个字都会得到相应的标签(例如事件开头字、事件中间字、事件结尾字、其他字等),此模型同时能完成事件的标准化表达以及除去副词、助词等任务。该联合模型充分运用了预训练模型的语义信息,在事件抽取和归纳上都取得了较好的效果。Specifically, event extraction and causality induction can be completed through a pre-training model, wherein the basic model in the pre-training model adopts the structural idea of BERT+CRF, and in fact, RoBERTa, an improved version of BERT, is used as the pre-training model. The input is a word embedding vector. After paragraph embedding, position embedding and punctuation embedding, multiple transformer modules are used to output multiple hidden vectors. Then the sequence label generation task is completed by the Seq2Seq model. In addition, because each input word will get a corresponding label (such as the beginning word of the event, the middle word of the event, the ending word of the event, other words, etc.), this model can also complete the standardized expression of events and remove adverbs, particles and other tasks. The joint model makes full use of the semantic information of the pre-trained model, and achieves good results in both event extraction and induction.
详细地,所述预设筛选标准为提前构建的规则或模板过滤不符合表达习惯或者表述不完整的事件节点,在得到标准化的事件表示和因果关系后,使用提前构建的规则或模板过滤不符合表达习惯或者表述不完整的事件节点。筛选后的不同事件节点可能因为其表义相同,指代的是现实世界中的同一实体,为此需要进行事件融合,得到融合事件。In detail, the preset screening criteria are pre-constructed rules or templates to filter event nodes that do not conform to expression habits or incomplete expressions, and after obtaining standardized event representation and causal relationship, use pre-constructed rules or templates to filter events Express custom or incomplete event nodes. Different event nodes after screening may refer to the same entity in the real world because they have the same meaning. Therefore, event fusion is required to obtain fusion events.
进一步地,所述对多个所述标准三元组进行事件融合,得到融合事件,包括:Further, performing event fusion on multiple standard triples to obtain a fusion event includes:
对多个所述标准三元组进行向量化,得到多个三元向量组;vectorizing multiple standard triples to obtain multiple triple vector groups;
计算多个所述三元向量组中事件之间的相似度;calculating the similarity between events in a plurality of said triplet vector groups;
若所述相似度大于预设的第一阈值时,将所述相似度对应的标准三元组划分至第一簇事件中;If the similarity is greater than a preset first threshold, the standard triplet corresponding to the similarity is divided into the first cluster of events;
若所述相似度小于预设的第二阈值时,将所述相似度对应的标准三元组划分至第二簇事件中;If the similarity is less than a preset second threshold, dividing the standard triplet corresponding to the similarity into a second cluster of events;
若所述相似度小于所述第一阈值且大于所述第二阈值时,将所述相似度对应的标准三元组划分至缓冲簇事件中。If the similarity is less than the first threshold and greater than the second threshold, classify the standard triplet corresponding to the similarity into buffered cluster events.
详细地,增量聚类算法可以得到实时的聚类结果,所以对部分事件样本进行聚类以后,可从聚类结果中抽取样本扩充训练集,对模型进行再训练,使模型能学习新的事件和文本 特征,增强聚类效果。最后,在上述步骤结束后,将聚类结果中所含样本较少的所有簇与Buffer结合作为不确定样本,利用多次训练后的模型对这些样本进行聚类,得到最终的聚类结果,从而完成此部分事件融合的过程。In detail, the incremental clustering algorithm can obtain real-time clustering results, so after clustering some event samples, samples can be extracted from the clustering results to expand the training set and retrain the model so that the model can learn new Event and text features to enhance the clustering effect. Finally, after the above steps are completed, combine all the clusters with fewer samples in the clustering result with Buffer as uncertain samples, and use the model after multiple trainings to cluster these samples to obtain the final clustering result. Thus, the process of this part of event fusion is completed.
S6、利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度。S6. Use the preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the dependency closeness, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the event hazard degree.
本申请实施例中,所述利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,包括:In the embodiment of the present application, the use of the preset social network analysis algorithm to quantify the relationship of the standard knowledge map to obtain the closeness of dependence includes:
根据度中心性和模块化分类度量的定义确定所述标准知识图谱的度中心性及模块化分类度量;Determine the degree centrality and modular classification metrics of the standard knowledge graph according to the definitions of degree centrality and modular classification metrics;
利用预设的接近中心性计算公式计算所述标准知识图谱的接近中心性,并将所述度中心性、所述模块化分类度量和所述接近中心性代入至预设的依存紧密度计算公式中,得到依存紧密度。Calculate the proximity centrality of the standard knowledge graph using a preset proximity centrality calculation formula, and substitute the degree centrality, the modular classification measure, and the proximity centrality into the preset dependency closeness calculation formula , get the dependency closeness.
详细地,所述度中心性(Degree centrality)是指一个节点相连的边的数量,用于表示节点的连接程度,所述模块化分类度量(Modularity class)用于社区检测,用来衡量社区划分质量或稳定性,模块化分类度量等于组内的边数减去随机设置边的等效网络中的预期边数。所述接近中心性(Betweenness centrality)衡量了一个节点到达其他节点的难易程度。In detail, the degree centrality (Degree centrality) refers to the number of edges connected to a node, which is used to represent the connection degree of nodes, and the modularity class is used for community detection, and is used to measure community division Quality or stability, a modular classification measure equal to the number of edges within a group minus the expected number of edges in an equivalent network with randomly set edges. The Betweenness centrality measures how easy it is for a node to reach other nodes.
具体地,所述利用预设的接近中心性计算公式计算所述标准知识图谱的接近中心性,包括:Specifically, the calculation of the proximity centrality of the standard knowledge graph using a preset proximity centrality calculation formula includes:
Figure PCTCN2022071241-appb-000021
Figure PCTCN2022071241-appb-000021
其中,C B(v)代表节点v的接近中心性值,σ st(v)代表从节点s到节点t之间经过v的最短路径数之和,σ st代表从节点s到节点t之间的所有最短路径数之和,v,s,t∈V。 Among them, C B (v) represents the proximity centrality value of node v, σ st (v) represents the sum of the shortest paths from node s to node t passing through v, and σ st represents the distance between node s and node t The sum of all the shortest path numbers of , v,s,t∈V.
进一步地,所述预设的依存紧密度计算公式,包括:Further, the preset calculation formula of dependence closeness includes:
Figure PCTCN2022071241-appb-000022
Figure PCTCN2022071241-appb-000022
其中,T(v i,v j)是v i和v j之间的依存紧密度,w D,w B,w M∈(0,1)为各子指标的权重且w D+w B+w M=1。 Among them, T(v i , v j ) is the closeness of dependence between v i and v j , w D , w B , w M ∈ (0,1) are the weights of each sub-indicator and w D +w B + w M =1.
具体地,利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度,所述图注意力网络(GAT)学习节点所有邻居上的注意力系数以进行特征聚合,并可以提高许多图学习任务的性能。利用图注意网络对于风险危害程度进行打分,评分函数依赖于网络的注意力系数和相关的实体特征向量,然后以最小化均方差损失为目标训练模型,最后以矩阵形式输出每一对实体之间的风险危害程度分数。由此针对不同的知识图谱和实体特征,可以量化评估针对金融领域的实体关系,例如企业信贷关系、供应链关系、行业间投入产出等。Specifically, using the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard, the graph attention network (GAT) learns the attention coefficients on all neighbors of the node to perform feature aggregation, and can Improving the performance of many graph learning tasks. Use the graph attention network to score the degree of risk hazard. The scoring function depends on the attention coefficient of the network and the related entity feature vector, and then train the model with the goal of minimizing the mean square error loss, and finally output the relationship between each pair of entities in the form of a matrix. risk hazard score. Therefore, for different knowledge graphs and entity characteristics, it is possible to quantitatively evaluate entity relationships in the financial field, such as corporate credit relationships, supply chain relationships, and inter-industry input-output relationships.
S7、基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型。S7. Obtain a macro forecasting model based on the event graph, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised training.
本申请实施例中,所述S7,包括:In the embodiment of this application, the S7 includes:
将所述事理图谱、所述依存紧密度和所述事件危害程度汇总为标注数据,并以所述标注数据为风险标签构建有监督模型;Summarizing the event map, the degree of dependency and the degree of hazard of the event into labeled data, and using the labeled data as a risk label to construct a supervised model;
获取无标注数据,以所述无标注数据和所述标注数据为基础构建无监督时序模型;Obtaining unlabeled data, constructing an unsupervised time series model based on the unlabeled data and the labeled data;
利用预设的半监督贝叶斯算法将所述有监督模型和所述无监督时序模型结合为宏观预测模型。The supervised model and the unsupervised time series model are combined into a macro forecasting model by using a preset semi-supervised Bayesian algorithm.
S8、利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。S8. Use the macro prediction model to predict the target risk entity to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
本申请实施例中,所述宏观预测模型用于进行行业的的风险预测,利用所述宏观预测 模型对所述目标风险实体进行预测,得到宏观风险概率,将所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。In the embodiment of the present application, the macro forecast model is used to predict the risk of the industry, and the macro forecast model is used to predict the target risk entity to obtain the macro risk probability, and the macro risk probability is greater than or equal to the predicted The industry type corresponding to the entity of the set macro threshold is a risk industry.
例如,所述目标风险实体为建筑环保板块,利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率为0.6,若预设的宏观阈值为0.5,则所述宏观风险概率大于所述宏观阈值,则将目标风险实体为建筑环保板块对应的建筑行业作为风险行业。For example, the target risk entity is the construction and environmental protection sector, and the macro forecast model is used to predict the target risk entity, and the macro risk probability is 0.6. If the preset macro threshold is 0.5, the macro risk probability is greater than As for the macro threshold, the target risk entity is the construction industry corresponding to the construction and environmental protection sector as the risk industry.
本申请实施例从预获取的多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱,所述时序知识图谱作为后续风险预测的数据基础,利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱。基于预设的强化学习算法构建风险预测模型,利用所述强化学习算法进行模型的构建可以保证模型的稳定性,并利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到目标风险实体。对所述标准知识图谱分别进行关系量化和程度量化,得到依存紧密度和事件危害程度,结合进行因果补充得到的事理图谱训练得到宏观预测模型,并利用宏观预测模型对所述目标风险实体进行预测,得到对应的风险预测行业。所述风险预测模型可以实现从实体角度的风险预测,通过宏观预测模型对已经经过风险预测的目标风险实体进行行业角度的预测,提高了对行业进行风险预测的准确度。因此本申请提出的风险预测方法可以实现解决对行业进行风险预测的准确度不够高的问题。The embodiment of the present application extracts the risk perception factor set from the pre-acquired multi-source information set, and builds a time-series knowledge map based on the risk-aware factor set. The relation complement algorithm completes the implicit relation in the time series knowledge map to obtain the standard knowledge map. Building a risk prediction model based on a preset reinforcement learning algorithm, using the reinforcement learning algorithm to construct the model can ensure the stability of the model, and using the risk prediction model to perform risk prediction on entities in the standard knowledge map, and obtain Target risk entity. Respectively quantify the relationship and degree of the standard knowledge map to obtain the closeness of dependence and the degree of hazard of the event, and combine the training of the event map obtained by causal supplementation to obtain a macro-prediction model, and use the macro-prediction model to predict the target risk entity , to get the corresponding risk prediction industry. The risk forecasting model can realize risk forecasting from the perspective of entities, and the macro forecasting model can predict the target risk entities that have undergone risk forecasting from an industry perspective, thereby improving the accuracy of risk forecasting for industries. Therefore, the risk prediction method proposed in this application can solve the problem that the accuracy of risk prediction for the industry is not high enough.
如图2所示,是本申请一实施例提供的风险预测装置的功能模块图。As shown in FIG. 2 , it is a functional block diagram of a risk prediction device provided by an embodiment of the present application.
本申请所述风险预测装置100可以安装于电子设备中。根据实现的功能,所述风险预测装置100可以包括知识图谱构建模块101、目标风险实体预测模块102、事理图谱生成模块103、图谱量化模块104及宏观预测模块105。本申请所述模块也可以称之为单元,是指一种能够被电子设备处理器所执行,并且能够完成固定功能的一系列计算机程序段,其存储在电子设备的存储器中。The risk prediction device 100 described in this application can be installed in an electronic device. According to the realized functions, the risk prediction device 100 may include a knowledge graph construction module 101 , a target risk entity prediction module 102 , an event graph generation module 103 , a graph quantification module 104 and a macro prediction module 105 . The module described in this application can also be called a unit, which refers to a series of computer program segments that can be executed by the processor of the electronic device and can complete fixed functions, and are stored in the memory of the electronic device.
在本实施例中,关于各模块/单元的功能如下:In this embodiment, the functions of each module/unit are as follows:
所述知识图谱构建模块101,用于获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱,利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;The knowledge map construction module 101 is used to obtain a multi-source information set, extract a risk-aware factor set from the multi-source information set, and construct a time-series knowledge map based on the risk-aware factor set, and use preset implicit relationships to supplement The algorithm completes the implicit relationship in the time-series knowledge graph to obtain a standard knowledge graph;
所述目标风险实体预测模块102,用于基于预设的强化学习算法构建风险预测模型,利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;The target risk entity prediction module 102 is configured to construct a risk prediction model based on a preset reinforcement learning algorithm, use the risk prediction model to perform risk prediction on entities in the standard knowledge graph, obtain a risk probability, and convert the Entities whose risk probability is greater than or equal to the preset probability threshold are regarded as target risk entities;
所述事理图谱生成模块103,用于利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;The event map generation module 103 is used to supplement the sequence knowledge map with causality by using a preset causal relation supplementary algorithm to obtain an event map;
所述图谱量化模块104,用于利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;The graph quantification module 104 is used to quantify the relationship of the standard knowledge graph by using a preset social network analysis algorithm to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph, Get the degree of hazard of the event;
所述宏观预测模块105,用于基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型,利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。The macro-prediction module 105 is used to obtain a macro-prediction model based on the event map, the dependency closeness and the degree of hazard of the event in combination with graph neural network and semi-supervised method training, and use the macro-prediction model to analyze the The target risk entity performs prediction to obtain the macro risk probability, and determines that the industry type corresponding to the entity whose macro risk probability is greater than or equal to the preset macro threshold is a risk industry.
详细地,本申请实施例中所风险预测装置100中所述的各模块在使用时采用与上述图1所述的风险预测方法一样的技术手段,并能够产生相同的技术效果,这里不再赘述。In detail, each module described in the risk prediction device 100 in the embodiment of the present application uses the same technical means as the risk prediction method described in Figure 1 above, and can produce the same technical effect, and will not be repeated here. .
如图3所示,是本申请一实施例提供的实现风险预测方法的电子设备的结构示意图。As shown in FIG. 3 , it is a schematic structural diagram of an electronic device implementing a risk prediction method provided by an embodiment of the present application.
所述电子设备1可以包括处理器10、存储器11、通信总线12以及通信接口13,还可以包括存储在所述存储器11中并可在所述处理器10上运行的计算机程序,如风险预测程序。The electronic device 1 may include a processor 10, a memory 11, a communication bus 12, and a communication interface 13, and may also include a computer program stored in the memory 11 and operable on the processor 10, such as a risk prediction program .
其中,所述处理器10在一些实施例中可以由集成电路组成,例如可以由单个封装的 集成电路所组成,也可以是由多个相同功能或不同功能封装的集成电路所组成,包括一个或者多个中央处理器(Central Processing unit,CPU)、微处理器、数字处理芯片、图形处理器及各种控制芯片的组合等。所述处理器10是所述电子设备的控制核心(Control Unit),利用各种接口和线路连接整个电子设备的各个部件,通过运行或执行存储在所述存储器11内的程序或者模块(例如执行风险预测程序等),以及调用存储在所述存储器11内的数据,以执行电子设备的各种功能和处理数据。Wherein, the processor 10 may be composed of integrated circuits in some embodiments, for example, may be composed of a single packaged integrated circuit, or may be composed of multiple integrated circuits with the same function or different functions packaged, including one or A combination of multiple central processing units (Central Processing unit, CPU), microprocessors, digital processing chips, graphics processors and various control chips, etc. The processor 10 is the control core (Control Unit) of the electronic device, and uses various interfaces and lines to connect the various components of the entire electronic device, and runs or executes programs or modules stored in the memory 11 (such as executing risk prediction program, etc.), and call the data stored in the memory 11 to execute various functions of the electronic device and process data.
所述存储器11至少包括一种类型的可读存储介质,所述可读存储介质包括闪存、移动硬盘、多媒体卡、卡型存储器(例如:SD或DX存储器等)、磁性存储器、磁盘、光盘等。所述存储器11在一些实施例中可以是电子设备的内部存储单元,例如该电子设备的移动硬盘。所述存储器11在另一些实施例中也可以是电子设备的外部存储设备,例如电子设备上配备的插接式移动硬盘、智能存储卡(Smart Media Card,SMC)、安全数字(Secure Digital,SD)卡、闪存卡(Flash Card)等。进一步地,所述存储器11还可以既包括电子设备的内部存储单元也包括外部存储设备。所述存储器11不仅可以用于存储安装于电子设备的应用软件及各类数据,例如风险预测程序的代码等,还可以用于暂时地存储已经输出或者将要输出的数据。The memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, mobile hard disk, multimedia card, card type memory (for example: SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, etc. . The storage 11 may be an internal storage unit of the electronic device in some embodiments, such as a mobile hard disk of the electronic device. The memory 11 can also be an external storage device of an electronic device in other embodiments, such as a plug-in mobile hard disk equipped on an electronic device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD ) card, flash card (Flash Card), etc. Further, the memory 11 may also include both an internal storage unit of the electronic device and an external storage device. The memory 11 can not only be used to store application software and various data installed in electronic equipment, such as codes of risk prediction programs, but also can be used to temporarily store data that has been output or will be output.
所述通信总线12可以是外设部件互连标准(peripheral component interconnect,简称PCI)总线或扩展工业标准结构(extended industry standard architecture,简称EISA)总线等。该总线可以分为地址总线、数据总线、控制总线等。所述总线被设置为实现所述存储器11以及至少一个处理器10等之间的连接通信。The communication bus 12 may be a peripheral component interconnect (PCI for short) bus or an extended industry standard architecture (EISA for short) bus or the like. The bus can be divided into address bus, data bus, control bus and so on. The bus is configured to realize connection and communication between the memory 11 and at least one processor 10 and the like.
所述通信接口13用于上述电子设备与其他设备之间的通信,包括网络接口和用户接口。可选地,所述网络接口可以包括有线接口和/或无线接口(如WI-FI接口、蓝牙接口等),通常用于在该电子设备与其他电子设备之间建立通信连接。所述用户接口可以是显示器(Display)、输入单元(比如键盘(Keyboard)),可选地,用户接口还可以是标准的有线接口、无线接口。可选地,在一些实施例中,显示器可以是LED显示器、液晶显示器、触控式液晶显示器以及OLED(Organic Light-Emitting Diode,有机发光二极管)触摸器等。其中,显示器也可以适当的称为显示屏或显示单元,用于显示在电子设备中处理的信息以及用于显示可视化的用户界面。The communication interface 13 is used for communication between the electronic device and other devices, including a network interface and a user interface. Optionally, the network interface may include a wired interface and/or a wireless interface (such as a WI-FI interface, a Bluetooth interface, etc.), which are generally used to establish a communication connection between the electronic device and other electronic devices. The user interface may be a display (Display) or an input unit (such as a keyboard (Keyboard)). Optionally, the user interface may also be a standard wired interface or a wireless interface. Optionally, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, and the like. Wherein, the display may also be properly referred to as a display screen or a display unit, and is used for displaying information processed in the electronic device and for displaying a visualized user interface.
图3仅示出了具有部件的电子设备,本领域技术人员可以理解的是,图3示出的结构并不构成对所述电子设备1的限定,可以包括比图示更少或者更多的部件,或者组合某些部件,或者不同的部件布置。FIG. 3 only shows an electronic device with components. Those skilled in the art can understand that the structure shown in FIG. 3 does not constitute a limitation to the electronic device 1, and may include fewer or more components, or combinations of certain components, or different arrangements of components.
例如,尽管未示出,所述电子设备还可以包括给各个部件供电的电源(比如电池),优选地,电源可以通过电源管理装置与所述至少一个处理器10逻辑相连,从而通过电源管理装置实现充电管理、放电管理、以及功耗管理等功能。电源还可以包括一个或一个以上的直流或交流电源、再充电装置、电源故障检测电路、电源转换器或者逆变器、电源状态指示器等任意组件。所述电子设备还可以包括多种传感器、蓝牙模块、Wi-Fi模块等,在此不再赘述。For example, although not shown, the electronic device may also include a power supply (such as a battery) for supplying power to various components. Preferably, the power supply may be logically connected to the at least one processor 10 through a power management device, so that Realize functions such as charge management, discharge management, and power consumption management. The power supply may also include one or more DC or AC power supplies, recharging devices, power failure detection circuits, power converters or inverters, power status indicators and other arbitrary components. The electronic device may also include various sensors, a Bluetooth module, a Wi-Fi module, etc., which will not be repeated here.
应该了解,所述实施例仅为说明之用,在专利申请范围上并不受此结构的限制。It should be understood that the embodiments are only for illustration, and are not limited by the structure in terms of the scope of the patent application.
所述电子设备1中的所述存储器11存储的风险预测程序是多个指令的组合,在所述处理器10中运行时,可以实现:The risk prediction program stored in the memory 11 in the electronic device 1 is a combination of multiple instructions. When running in the processor 10, it can realize:
获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率, 并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge map to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
具体地,所述处理器10对上述指令的具体实现方法可参考附图对应实施例中相关步骤的描述,在此不赘述。Specifically, for the specific implementation method of the above instructions by the processor 10, reference may be made to the description of relevant steps in the corresponding embodiments in the drawings, and details are not repeated here.
进一步地,所述电子设备1集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读存储介质中。所述计算机可读存储介质可以是易失性的,也可以是非易失性的。例如,所述计算机可读存储介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)。Further, if the integrated modules/units of the electronic device 1 are realized in the form of software function units and sold or used as independent products, they can be stored in a computer-readable storage medium. The computer-readable storage medium may be volatile or non-volatile. For example, the computer-readable storage medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read- Only Memory).
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序在被电子设备的处理器所执行时,可以实现:The present application also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor of an electronic device, it can realize:
获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
在本申请所提供的几个实施例中,应该理解到,所揭露的设备,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In the several embodiments provided in this application, it should be understood that the disclosed devices, devices and methods can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the modules is only a logical function division, and there may be other division methods in actual implementation.
所述作为分离部件说明的模块可以是或者也可以不是物理上分开的,作为模块显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The modules described as separate components may or may not be physically separated, and the components shown as modules may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能模块的形式实现。In addition, each functional module in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware, or in the form of hardware plus software function modules.
对于本领域技术人员而言,显然本申请不限于上述示范性实施例的细节,而且在不背离本申请的精神或基本特征的情况下,能够以其他的具体形式实现本申请。It will be apparent to those skilled in the art that the present application is not limited to the details of the exemplary embodiments described above, but that the present application can be implemented in other specific forms without departing from the spirit or essential characteristics of the present application.
因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本申请的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本申请内。不应将权利要求中的任何附关联图标记视为限制 所涉及的权利要求。Therefore, the embodiments should be regarded as exemplary and not restrictive in all points of view, and the scope of the application is defined by the appended claims rather than the foregoing description, and it is intended that the scope of the present application be defined by the appended claims rather than by the foregoing description. All changes within the meaning and range of equivalents of the elements are embraced in this application. Any reference sign in a claim shall not be construed as limiting the claim concerned.
本申请所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain (Blockchain), essentially a decentralized database, is a series of data blocks associated with each other using cryptographic methods. Each data block contains a batch of network transaction information, which is used to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
本申请实施例可以基于人工智能技术对相关的数据进行获取和处理。其中,人工智能(Artificial Intelligence,AI)是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩展人的智能,感知环境、获取知识并使用知识获得最佳结果的理论、方法、技术及应用系统。The embodiments of the present application may acquire and process relevant data based on artificial intelligence technology. Among them, artificial intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .
此外,显然“包括”一词不排除其他单元或步骤,单数不排除复数。系统权利要求中陈述的多个单元或装置也可以由一个单元或装置通过软件或者硬件来实现。第一、第二等词语用来表示名称,而并不表示任何特定的顺序。In addition, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. A plurality of units or devices stated in the system claims may also be realized by one unit or device through software or hardware. The terms first, second, etc. are used to denote names and do not imply any particular order.
最后应说明的是,以上实施例仅用以说明本申请的技术方案而非限制,尽管参照较佳实施例对本申请进行了详细说明,本领域的普通技术人员应当理解,可以对本申请的技术方案进行修改或等同替换,而不脱离本申请技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application without limitation. Although the present application has been described in detail with reference to the preferred embodiments, those skilled in the art should understand that the technical solutions of the present application can be Make modifications or equivalent replacements without departing from the spirit and scope of the technical solutions of the present application.

Claims (20)

  1. 一种风险预测方法,其中,所述方法包括:A risk prediction method, wherein the method comprises:
    获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
    利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
    基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
    利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
    利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
    利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
    基于所述事理图谱、所述依存紧密度和所述事件危害程度,并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the closeness of dependence and the degree of hazard of the event, combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
    利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
  2. 如权利要求1所述的风险预测方法,其中,所述利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱,包括:The risk prediction method according to claim 1, wherein the use of the preset implicit relationship supplement algorithm to complement the implicit relationship in the time-series knowledge graph to obtain a standard knowledge graph includes:
    基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱;performing graph sparse processing on the time-series knowledge graph based on a preset sparse graph convolutional network to obtain a sparse knowledge graph;
    利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系;Using the trained relational graph convolutional network to predict the relationship of the sparse knowledge graph to obtain the hidden relationship;
    将所述隐含关系补全在所述时序知识图谱中,得到标准知识图谱。Completing the implicit relationship in the time series knowledge graph to obtain a standard knowledge graph.
  3. 如权利要求2所述的风险预测方法,其中,所述基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱,包括:The risk prediction method according to claim 2, wherein said sparse graph convolution network based on the preset sparse graph performs graph sparse processing on said time series knowledge graph to obtain a sparse knowledge graph, comprising:
    确定所述时序知识图谱对应的邻接矩阵及特征矩阵,并获取所述稀疏图卷积网络的预设权重矩阵;Determine the adjacency matrix and feature matrix corresponding to the time series knowledge graph, and obtain the preset weight matrix of the sparse graph convolutional network;
    基于所述邻接矩阵、所述特征矩阵和所述预设权重矩阵构建得到稀疏输出函数;constructing a sparse output function based on the adjacency matrix, the feature matrix and the preset weight matrix;
    利用基于乘数的交替方向算法对所述稀疏输出函数进行优化处理,将优化后的所述稀疏输出函数中的变量对所述时序知识图谱进行变量更新,得到稀疏知识图谱。Optimizing the sparse output function by using a multiplier-based alternating direction algorithm, updating variables in the optimized sparse output function on the time-series knowledge graph to obtain a sparse knowledge graph.
  4. 如权利要求2所述的风险预测方法,其中,所述利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系之前,所述方法还包括:The risk prediction method according to claim 2, wherein, using the trained relational graph convolutional network to predict the relationship of the sparse knowledge graph, before obtaining the implicit relationship, the method further includes:
    利用预设的关系图卷积网络中的实体编码器对所述稀疏知识图谱进行特征预测,得到所述稀疏知识图谱中的实体对应的潜在特征;Using the entity encoder in the preset relational graph convolution network to perform feature prediction on the sparse knowledge graph, and obtain the potential features corresponding to the entities in the sparse knowledge graph;
    基于所述关系图卷积网络中的解码器对所述实体对应的潜在特征进行评分,将所述评分大于或者等于预设的评分阈值的对应潜在特征作为目标潜在特征;Scoring the latent features corresponding to the entity based on the decoder in the relational graph convolutional network, and using the corresponding latent features whose scores are greater than or equal to a preset scoring threshold as target latent features;
    根据所述目标潜在特征和预设的交叉熵损失函数计算得到交叉熵损失值;Calculate and obtain a cross-entropy loss value according to the target latent feature and a preset cross-entropy loss function;
    当所述交叉熵损失值小于或者等于预设的损失阈值时,将所述关系图卷积网络输出为训练好的关系图卷积网络。When the cross-entropy loss value is less than or equal to a preset loss threshold, the relational graph convolutional network is output as a trained relational graph convolutional network.
  5. 如权利要求1所述的风险预测方法,其中,所述基于所述风险感知因子集构建时序知识图谱,包括:The risk prediction method according to claim 1, wherein said constructing a time series knowledge graph based on said risk perception factor set comprises:
    抽取所述风险感知因子集中的实体和实体关系;extracting entities and entity relationships in the set of risk perception factors;
    基于所述实体和所述实体关系进行图谱构建,得到时序知识图谱。A graph is constructed based on the entity and the entity relationship to obtain a time series knowledge graph.
  6. 如权利要求1至5中任一项所述的风险预测方法,其中,所述利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱,包括:The risk prediction method according to any one of claims 1 to 5, wherein said use of a preset causality supplementary algorithm to supplement the time-series knowledge map with causality to obtain an event map includes:
    获取训练文本集,对所述训练文本集进行事件抽取及因果关系归纳,得到多个因果三元组;Obtain a training text set, perform event extraction and causal relationship induction on the training text set, and obtain multiple causal triplets;
    保留所述因果三元组中符合预设筛选标准的多个因果事件作为标准三元组;Retaining a plurality of causal events that meet preset screening criteria in the causal triplet as a standard triplet;
    对多个所述标准三元组进行事件融合,得到融合事件,并将所述融合事件补充至所述时序知识图谱中,得到事理图谱。Event fusion is performed on multiple standard triples to obtain fusion events, and the fusion events are added to the time series knowledge graph to obtain an event graph.
  7. 如权利要求1至5中任一项所述的风险预测方法,其中,所述从所述多源信息集中提取风险感知因子集,包括:The risk prediction method according to any one of claims 1 to 5, wherein said extracting a risk perception factor set from said multi-source information set comprises:
    识别所述多源信息集中的文本信息和图像信息;identifying text information and image information in the multi-source information set;
    利用预设的自然语言处理技术对所述文本信息进行因子提取,得到文本感知因子集;performing factor extraction on the text information using a preset natural language processing technology to obtain a text-aware factor set;
    利用预设的图像识别技术对所述图像信息进行因子提取,得到图像感知因子集;performing factor extraction on the image information by using a preset image recognition technology to obtain an image perception factor set;
    将所述文本感知因子集和所述图像感知因子集进行汇总,得到风险感知因子集。Summarizing the text perception factor set and the image perception factor set to obtain a risk perception factor set.
  8. 一种风险预测装置,其中,所述装置包括:A risk prediction device, wherein the device comprises:
    知识图谱构建模块,用于获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱,利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;The knowledge map construction module is used to obtain a multi-source information set, extract a risk-aware factor set from the multi-source information set, and construct a time-series knowledge map based on the risk-aware factor set, and use the preset implicit relationship to complement the algorithm The implicit relationship in the time series knowledge graph is obtained to obtain a standard knowledge graph;
    目标风险实体预测模块,用于基于预设的强化学习算法构建风险预测模型,利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;The target risk entity prediction module is used to construct a risk prediction model based on a preset reinforcement learning algorithm, use the risk prediction model to perform risk prediction on entities in the standard knowledge map, obtain a risk probability, and set the risk probability to be greater than Or an entity equal to a preset probability threshold as a target risk entity;
    事理图谱生成模块,用于利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;An event map generation module, configured to use a preset causal relation supplementary algorithm to supplement the time-series knowledge map with causal relations to obtain an event map;
    图谱量化模块,用于利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;The graph quantification module is used to use the preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the closeness of dependence, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the event hazard degree;
    宏观预测模块,用于基于所述事理图谱、所述依存紧密度和所述事件危害程度并结合图神经网络和半监督方法训练得到宏观预测模型,利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。The macro prediction module is used to obtain a macro prediction model based on the event map, the dependency closeness and the event hazard degree combined with graph neural network and semi-supervised method training, and use the macro prediction model to predict the target risk entity Prediction is performed to obtain the macro risk probability, and the industry type corresponding to the entity whose macro risk probability is greater than or equal to the preset macro threshold is determined as a risk industry.
  9. 一种电子设备,其中,所述电子设备包括:An electronic device, wherein the electronic device includes:
    至少一个处理器;以及,at least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的计算机程序,所述计算机程序被所述至少一个处理器执行,以使所述至少一个处理器能够执行如下步骤:The memory stores a computer program executable by the at least one processor, the computer program is executed by the at least one processor, so that the at least one processor can perform the following steps:
    获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
    利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
    基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
    利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
    利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
    利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
    基于所述事理图谱、所述依存紧密度和所述事件危害程度,并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the closeness of dependence and the degree of hazard of the event, combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
    利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述 宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Use the macro forecasting model to predict the target risk entity to obtain the macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to the preset macro threshold is a risk industry.
  10. 如权利要求9所述的电子设备,其中,所述利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱,包括:The electronic device according to claim 9, wherein the use of the preset implicit relationship complement algorithm to complement the implicit relationship in the time-series knowledge graph to obtain a standard knowledge graph includes:
    基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱;performing graph sparse processing on the time-series knowledge graph based on a preset sparse graph convolutional network to obtain a sparse knowledge graph;
    利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系;Using the trained relational graph convolutional network to predict the relationship of the sparse knowledge graph to obtain the hidden relationship;
    将所述隐含关系补全在所述时序知识图谱中,得到标准知识图谱。Completing the implicit relationship in the time series knowledge graph to obtain a standard knowledge graph.
  11. 如权利要求10所述的电子设备,其中,所述基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱,包括:The electronic device according to claim 10, wherein said sparse graph convolutional network based on a preset performs graph sparse processing on said time-series knowledge graph to obtain a sparse knowledge graph, comprising:
    确定所述时序知识图谱对应的邻接矩阵及特征矩阵,并获取所述稀疏图卷积网络的预设权重矩阵;Determine the adjacency matrix and feature matrix corresponding to the time series knowledge graph, and obtain the preset weight matrix of the sparse graph convolutional network;
    基于所述邻接矩阵、所述特征矩阵和所述预设权重矩阵构建得到稀疏输出函数;constructing a sparse output function based on the adjacency matrix, the feature matrix and the preset weight matrix;
    利用基于乘数的交替方向算法对所述稀疏输出函数进行优化处理,将优化后的所述稀疏输出函数中的变量对所述时序知识图谱进行变量更新,得到稀疏知识图谱。Optimizing the sparse output function by using a multiplier-based alternating direction algorithm, updating variables in the optimized sparse output function on the time-series knowledge graph to obtain a sparse knowledge graph.
  12. 如权利要求10所述的电子设备,其中,所述利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系之前,所述计算机程序被所述至少一个处理器执行时还实现如下步骤:The electronic device according to claim 10, wherein, the trained relational graph convolutional network is used to predict the relationship of the sparse knowledge graph, and before the implicit relationship is obtained, the computer program is executed by the at least one processor The following steps are also implemented during execution:
    利用预设的关系图卷积网络中的实体编码器对所述稀疏知识图谱进行特征预测,得到所述稀疏知识图谱中的实体对应的潜在特征;Using the entity encoder in the preset relational graph convolution network to perform feature prediction on the sparse knowledge graph, and obtain the potential features corresponding to the entities in the sparse knowledge graph;
    基于所述关系图卷积网络中的解码器对所述实体对应的潜在特征进行评分,将所述评分大于或者等于预设的评分阈值的对应潜在特征作为目标潜在特征;Scoring the latent features corresponding to the entity based on the decoder in the relational graph convolutional network, and using the corresponding latent features whose scores are greater than or equal to a preset scoring threshold as target latent features;
    根据所述目标潜在特征和预设的交叉熵损失函数计算得到交叉熵损失值;Calculate and obtain a cross-entropy loss value according to the target latent feature and a preset cross-entropy loss function;
    当所述交叉熵损失值小于或者等于预设的损失阈值时,将所述关系图卷积网络输出为训练好的关系图卷积网络。When the cross-entropy loss value is less than or equal to a preset loss threshold, the relational graph convolutional network is output as a trained relational graph convolutional network.
  13. 如权利要求9所述的电子设备,其中,所述基于所述风险感知因子集构建时序知识图谱,包括:The electronic device according to claim 9, wherein said constructing a time series knowledge graph based on said risk perception factor set comprises:
    抽取所述风险感知因子集中的实体和实体关系;extracting entities and entity relationships in the set of risk perception factors;
    基于所述实体和所述实体关系进行图谱构建,得到时序知识图谱。A graph is constructed based on the entity and the entity relationship to obtain a time series knowledge graph.
  14. 如权利要求9至13中任一项所述的电子设备,其中,所述利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱,包括:The electronic device according to any one of claims 9 to 13, wherein said use of a preset causality complement algorithm to supplement the sequence knowledge graph with causality to obtain an event graph includes:
    获取训练文本集,对所述训练文本集进行事件抽取及因果关系归纳,得到多个因果三元组;Obtain a training text set, perform event extraction and causal relationship induction on the training text set, and obtain multiple causal triplets;
    保留所述因果三元组中符合预设筛选标准的多个因果事件作为标准三元组;Retaining a plurality of causal events that meet preset screening criteria in the causal triplet as a standard triplet;
    对多个所述标准三元组进行事件融合,得到融合事件,并将所述融合事件补充至所述时序知识图谱中,得到事理图谱。Event fusion is performed on multiple standard triples to obtain fusion events, and the fusion events are added to the time series knowledge graph to obtain an event graph.
  15. 如权利要求9至14中任一项所述的电子设备,其中,所述从所述多源信息集中提取风险感知因子集,包括:The electronic device according to any one of claims 9 to 14, wherein said extracting a risk perception factor set from said multi-source information set comprises:
    识别所述多源信息集中的文本信息和图像信息;identifying text information and image information in the multi-source information set;
    利用预设的自然语言处理技术对所述文本信息进行因子提取,得到文本感知因子集;performing factor extraction on the text information using a preset natural language processing technology to obtain a text-aware factor set;
    利用预设的图像识别技术对所述图像信息进行因子提取,得到图像感知因子集;performing factor extraction on the image information by using a preset image recognition technology to obtain an image perception factor set;
    将所述文本感知因子集和所述图像感知因子集进行汇总,得到风险感知因子集。Summarizing the text perception factor set and the image perception factor set to obtain a risk perception factor set.
  16. 一种计算机可读存储介质,存储有计算机程序,其中,所述计算机程序被处理器执行时实现如下步骤:A computer-readable storage medium storing a computer program, wherein the computer program implements the following steps when executed by a processor:
    获取多源信息集,从所述多源信息集中提取风险感知因子集,并基于所述风险感知因子集构建时序知识图谱;Obtaining a multi-source information set, extracting a risk-aware factor set from the multi-source information set, and constructing a time-series knowledge map based on the risk-aware factor set;
    利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱;Complementing the implicit relationship in the time-series knowledge graph by using a preset implicit relationship complement algorithm to obtain a standard knowledge graph;
    基于预设的强化学习算法构建风险预测模型;Build a risk prediction model based on a preset reinforcement learning algorithm;
    利用所述风险预测模型对所述标准知识图谱中的实体进行风险预测,得到风险概率,并将所述风险概率大于或者等于预设概率阈值的实体作为目标风险实体;Using the risk prediction model to perform risk prediction on the entities in the standard knowledge graph to obtain a risk probability, and use the entity whose risk probability is greater than or equal to a preset probability threshold as a target risk entity;
    利用预设的因果关系补充算法对所述时序知识图谱进行因果关系补充,得到事理图谱;Using a preset causality supplement algorithm to supplement the time series knowledge map with causality to obtain an event map;
    利用预设的社交网络分析算法对所述标准知识图谱进行关系量化,得到依存紧密度,并利用预设的图注意网络对所述标准知识图谱进行程度量化,得到事件危害程度;Using a preset social network analysis algorithm to quantify the relationship of the standard knowledge graph to obtain the degree of dependency, and use the preset graph attention network to quantify the degree of the standard knowledge graph to obtain the degree of event hazard;
    基于所述事理图谱、所述依存紧密度和所述事件危害程度,并结合图神经网络和半监督方法训练得到宏观预测模型;Based on the event map, the closeness of dependence and the degree of hazard of the event, combined with graph neural network and semi-supervised method training to obtain a macro-prediction model;
    利用所述宏观预测模型对所述目标风险实体进行预测,得到宏观风险概率,确定所述宏观风险概率大于或者等于预设的宏观阈值的实体所对应的行业类型为风险行业。Predict the target risk entity by using the macro prediction model to obtain a macro risk probability, and determine that the industry type corresponding to the entity whose macro risk probability is greater than or equal to a preset macro threshold is a risk industry.
  17. 如权利要求16所述的计算机可读存储介质,其中,所述利用预设的隐含关系补充算法补全所述时序知识图谱中的隐含关系,得到标准知识图谱,包括:The computer-readable storage medium according to claim 16, wherein said using a preset implicit relationship complement algorithm to complement the implicit relationship in the time-series knowledge graph to obtain a standard knowledge graph includes:
    基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱;performing graph sparse processing on the time-series knowledge graph based on a preset sparse graph convolutional network to obtain a sparse knowledge graph;
    利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系;Using the trained relational graph convolutional network to predict the relationship of the sparse knowledge graph to obtain the hidden relationship;
    将所述隐含关系补全在所述时序知识图谱中,得到标准知识图谱。Completing the implicit relationship in the time series knowledge graph to obtain a standard knowledge graph.
  18. 如权利要求17所述的计算机可读存储介质,其中,所述基于预设的稀疏图卷积网络对所述时序知识图谱进行图稀疏化处理,得到稀疏知识图谱,包括:The computer-readable storage medium according to claim 17, wherein said sparse graph convolutional network based on a preset performs graph sparse processing on said time-series knowledge graph to obtain a sparse knowledge graph, comprising:
    确定所述时序知识图谱对应的邻接矩阵及特征矩阵,并获取所述稀疏图卷积网络的预设权重矩阵;Determine the adjacency matrix and feature matrix corresponding to the time series knowledge graph, and obtain the preset weight matrix of the sparse graph convolutional network;
    基于所述邻接矩阵、所述特征矩阵和所述预设权重矩阵构建得到稀疏输出函数;constructing a sparse output function based on the adjacency matrix, the feature matrix and the preset weight matrix;
    利用基于乘数的交替方向算法对所述稀疏输出函数进行优化处理,将优化后的所述稀疏输出函数中的变量对所述时序知识图谱进行变量更新,得到稀疏知识图谱。Optimizing the sparse output function by using a multiplier-based alternating direction algorithm, updating variables in the optimized sparse output function on the time-series knowledge graph to obtain a sparse knowledge graph.
  19. 如权利要求17所述的计算机可读存储介质,其中,所述利用训练好的关系图卷积网络对所述稀疏知识图谱进行关系预测,得到隐含关系之前,所述计算机程序被处理器执行时还实现如下步骤:The computer-readable storage medium according to claim 17, wherein, the trained relational graph convolutional network is used to predict the relationship of the sparse knowledge graph, and before the implicit relationship is obtained, the computer program is executed by the processor Also implement the following steps:
    利用预设的关系图卷积网络中的实体编码器对所述稀疏知识图谱进行特征预测,得到所述稀疏知识图谱中的实体对应的潜在特征;Using the entity encoder in the preset relational graph convolution network to perform feature prediction on the sparse knowledge graph, and obtain the potential features corresponding to the entities in the sparse knowledge graph;
    基于所述关系图卷积网络中的解码器对所述实体对应的潜在特征进行评分,将所述评分大于或者等于预设的评分阈值的对应潜在特征作为目标潜在特征;Scoring the latent features corresponding to the entity based on the decoder in the relational graph convolutional network, and using the corresponding latent features whose scores are greater than or equal to a preset scoring threshold as target latent features;
    根据所述目标潜在特征和预设的交叉熵损失函数计算得到交叉熵损失值;Calculate and obtain a cross-entropy loss value according to the target latent feature and a preset cross-entropy loss function;
    当所述交叉熵损失值小于或者等于预设的损失阈值时,将所述关系图卷积网络输出为训练好的关系图卷积网络。When the cross-entropy loss value is less than or equal to a preset loss threshold, the relational graph convolutional network is output as a trained relational graph convolutional network.
  20. 如权利要求16所述的计算机可读存储介质,其中,所述基于所述风险感知因子集构建时序知识图谱,包括:The computer-readable storage medium according to claim 16, wherein said constructing a time series knowledge graph based on said risk perception factor set comprises:
    抽取所述风险感知因子集中的实体和实体关系;extracting entities and entity relationships in the set of risk perception factors;
    基于所述实体和所述实体关系进行图谱构建,得到时序知识图谱。A graph is constructed based on the entity and the entity relationship to obtain a time series knowledge graph.
PCT/CN2022/071241 2021-10-19 2022-01-11 Risk prediction method and apparatus, and device and storage medium WO2023065545A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111216347.6 2021-10-19
CN202111216347.6A CN113822494B (en) 2021-10-19 2021-10-19 Risk prediction method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2023065545A1 true WO2023065545A1 (en) 2023-04-27

Family

ID=78917077

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/071241 WO2023065545A1 (en) 2021-10-19 2022-01-11 Risk prediction method and apparatus, and device and storage medium

Country Status (2)

Country Link
CN (1) CN113822494B (en)
WO (1) WO2023065545A1 (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116452323A (en) * 2023-04-28 2023-07-18 广州市良策网络技术有限公司 Risk assessment method, system, equipment and storage medium
CN116564408A (en) * 2023-04-28 2023-08-08 上海科技大学 Synthetic lethal gene pair prediction method, device, equipment and medium based on knowledge-graph reasoning
CN116611523A (en) * 2023-07-18 2023-08-18 山东师范大学 Method and system for predicting interpretable faults of turbofan engine
CN116629604A (en) * 2023-05-15 2023-08-22 国网冀北电力有限公司信息通信分公司 Method and device for processing and analyzing power grid operation risk
CN116662577A (en) * 2023-08-02 2023-08-29 北京网智天元大数据科技有限公司 Knowledge graph-based large language model training method and device
CN116668095A (en) * 2023-05-16 2023-08-29 江苏信创网安数据科技有限公司 Intelligent network security assessment method and system
CN116718198A (en) * 2023-08-10 2023-09-08 湖南璟德科技有限公司 Unmanned aerial vehicle cluster path planning method and system based on time sequence knowledge graph
CN116847222A (en) * 2023-09-01 2023-10-03 西安格威石油仪器有限公司 Remote monitoring method and system applied to petroleum underground measuring equipment
CN116882767A (en) * 2023-09-08 2023-10-13 之江实验室 Risk prediction method and device based on imperfect heterogeneous relation network diagram
CN116911618A (en) * 2023-09-07 2023-10-20 北京网藤科技有限公司 Artificial intelligence decision-making system and method for safety production risk
CN117035087A (en) * 2023-10-09 2023-11-10 北京壹永科技有限公司 Method, device, equipment and medium for generating a rational map for medical reasoning
CN117093728A (en) * 2023-10-19 2023-11-21 杭州同花顺数据开发有限公司 Financial field management map construction method, device, equipment and storage medium
CN117113148A (en) * 2023-08-30 2023-11-24 上海智租物联科技有限公司 Risk identification method, device and storage medium based on time sequence diagram neural network
CN117112808A (en) * 2023-10-24 2023-11-24 中国标准化研究院 Information knowledge graph construction method of credit belief-losing main body
CN117196323A (en) * 2023-11-07 2023-12-08 之江实验室 Gao Weitu network-based early warning method for complex social events
CN117196025A (en) * 2023-09-13 2023-12-08 安徽国麒科技有限公司 Knowledge-graph higher-order relation prediction method and system based on multi-view decoder
CN117235465A (en) * 2023-11-15 2023-12-15 国网江西省电力有限公司电力科学研究院 Transformer fault type diagnosis method based on graph neural network wave recording analysis
CN117332261A (en) * 2023-12-01 2024-01-02 深圳市伟昊净化设备有限公司 Efficiency evaluation method, system and storage medium for filter element
CN117350547A (en) * 2023-11-14 2024-01-05 深圳市明心数智科技有限公司 Method, device, equipment and storage medium for determining risk processing scheme of order
CN117540035A (en) * 2024-01-09 2024-02-09 安徽思高智能科技有限公司 RPA knowledge graph construction method based on entity type information fusion
CN117574328A (en) * 2024-01-15 2024-02-20 德阳联动检测科技有限公司 Coupling slip analysis method and system based on torque data
CN117573809A (en) * 2024-01-12 2024-02-20 中电科大数据研究院有限公司 Event map-based public opinion deduction method and related device
CN117788203A (en) * 2024-02-28 2024-03-29 西安华联电力电缆有限公司 Improved high-efficiency production and preparation method of crosslinked polyethylene insulated power cable

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113822494B (en) * 2021-10-19 2022-07-22 平安科技(深圳)有限公司 Risk prediction method, device, equipment and storage medium
CN113992429B (en) * 2021-12-22 2022-04-29 支付宝(杭州)信息技术有限公司 Event processing method, device and equipment
CN115964503B (en) * 2021-12-28 2023-07-07 北方工业大学 Safety risk prediction method and system based on community equipment facilities
CN114357191B (en) * 2021-12-31 2023-05-30 科大讯飞(苏州)科技有限公司 Knowledge extraction, question and answer and recommendation method, related device, equipment and medium
CN114022058A (en) * 2022-01-06 2022-02-08 成都晓多科技有限公司 Small and medium-sized enterprise confidence loss risk prediction method based on time sequence knowledge graph
US20230281565A1 (en) * 2022-03-04 2023-09-07 HireTeamMate Incorporated System and method for generating lower-dimension graph representations in talent acquisition platforms
CN114357197B (en) * 2022-03-08 2022-07-26 支付宝(杭州)信息技术有限公司 Event reasoning method and device
CN114611816B (en) * 2022-03-21 2024-02-27 中国电信股份有限公司 Potential event prediction method, device, equipment and storage medium
CN115393270B (en) * 2022-07-14 2023-06-23 北京建筑大学 Automatic identification and prediction method, device and equipment for building heritage diseases
CN115225532B (en) * 2022-07-21 2023-04-07 北京天融信网络安全技术有限公司 Network security situation prediction method, device, equipment and storage medium
CN115063056B (en) * 2022-08-18 2022-11-18 西南交通大学 Construction behavior safety risk dynamic analysis method based on graph topology analysis improvement
CN115345727B (en) * 2022-08-19 2023-04-28 中电金信软件有限公司 Method and device for identifying fraudulent loan application
CN115374296B (en) * 2022-10-25 2023-04-04 科大讯飞(苏州)科技有限公司 Question-answering method based on time sequence knowledge graph, entity representation method and related device
CN115797044B (en) * 2022-11-15 2024-03-29 东方微银科技股份有限公司 Credit wind control early warning method and system based on cluster analysis
CN116187448B (en) * 2023-04-25 2023-08-01 之江实验室 Information display method and device, storage medium and electronic equipment
CN116821374A (en) * 2023-07-27 2023-09-29 中国人民解放军陆军工程大学 Event prediction method based on information
CN117012374B (en) * 2023-10-07 2024-01-26 之江实验室 Medical follow-up system and method integrating event map and deep reinforcement learning

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110795568A (en) * 2019-09-30 2020-02-14 北京淇瑀信息科技有限公司 Risk assessment method and device based on user information knowledge graph and electronic equipment
CN111026874A (en) * 2019-11-22 2020-04-17 海信集团有限公司 Data processing method and server of knowledge graph
US20210019674A1 (en) * 2015-10-28 2021-01-21 Qomplx, Inc. Risk profiling and rating of extended relationships using ontological databases
CN112395423A (en) * 2020-09-09 2021-02-23 北京邮电大学 Recursive time-series knowledge graph completion method and device
CN113822494A (en) * 2021-10-19 2021-12-21 平安科技(深圳)有限公司 Risk prediction method, device, equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI702557B (en) * 2017-12-12 2020-08-21 浩霆 黃 Financial risk forecast system and the method thereof
CN109857871B (en) * 2019-01-28 2021-04-20 重庆邮电大学 User relationship discovery method based on social network mass contextual data
CN112927082A (en) * 2021-03-22 2021-06-08 中国工商银行股份有限公司 Credit risk prediction method, apparatus, device, medium, and program product

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210019674A1 (en) * 2015-10-28 2021-01-21 Qomplx, Inc. Risk profiling and rating of extended relationships using ontological databases
CN110795568A (en) * 2019-09-30 2020-02-14 北京淇瑀信息科技有限公司 Risk assessment method and device based on user information knowledge graph and electronic equipment
CN111026874A (en) * 2019-11-22 2020-04-17 海信集团有限公司 Data processing method and server of knowledge graph
CN112395423A (en) * 2020-09-09 2021-02-23 北京邮电大学 Recursive time-series knowledge graph completion method and device
CN113822494A (en) * 2021-10-19 2021-12-21 平安科技(深圳)有限公司 Risk prediction method, device, equipment and storage medium

Cited By (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116452323A (en) * 2023-04-28 2023-07-18 广州市良策网络技术有限公司 Risk assessment method, system, equipment and storage medium
CN116564408A (en) * 2023-04-28 2023-08-08 上海科技大学 Synthetic lethal gene pair prediction method, device, equipment and medium based on knowledge-graph reasoning
CN116452323B (en) * 2023-04-28 2024-01-30 广州市良策网络技术有限公司 Risk assessment method, system, equipment and storage medium
CN116564408B (en) * 2023-04-28 2024-03-01 上海科技大学 Synthetic lethal gene pair prediction method, device, equipment and medium based on knowledge-graph reasoning
CN116629604A (en) * 2023-05-15 2023-08-22 国网冀北电力有限公司信息通信分公司 Method and device for processing and analyzing power grid operation risk
CN116668095A (en) * 2023-05-16 2023-08-29 江苏信创网安数据科技有限公司 Intelligent network security assessment method and system
CN116668095B (en) * 2023-05-16 2024-03-29 江苏信创网安数据科技有限公司 Intelligent network security assessment method and system
CN116611523B (en) * 2023-07-18 2023-09-26 山东师范大学 Method and system for predicting interpretable faults of turbofan engine
CN116611523A (en) * 2023-07-18 2023-08-18 山东师范大学 Method and system for predicting interpretable faults of turbofan engine
CN116662577A (en) * 2023-08-02 2023-08-29 北京网智天元大数据科技有限公司 Knowledge graph-based large language model training method and device
CN116662577B (en) * 2023-08-02 2023-11-03 北京网智天元大数据科技有限公司 Knowledge graph-based large language model training method and device
CN116718198B (en) * 2023-08-10 2023-11-03 湖南璟德科技有限公司 Unmanned aerial vehicle cluster path planning method and system based on time sequence knowledge graph
CN116718198A (en) * 2023-08-10 2023-09-08 湖南璟德科技有限公司 Unmanned aerial vehicle cluster path planning method and system based on time sequence knowledge graph
CN117113148A (en) * 2023-08-30 2023-11-24 上海智租物联科技有限公司 Risk identification method, device and storage medium based on time sequence diagram neural network
CN116847222B (en) * 2023-09-01 2023-11-14 西安格威石油仪器有限公司 Remote monitoring method and system applied to petroleum underground measuring equipment
CN116847222A (en) * 2023-09-01 2023-10-03 西安格威石油仪器有限公司 Remote monitoring method and system applied to petroleum underground measuring equipment
CN116911618A (en) * 2023-09-07 2023-10-20 北京网藤科技有限公司 Artificial intelligence decision-making system and method for safety production risk
CN116911618B (en) * 2023-09-07 2023-12-05 北京网藤科技有限公司 Artificial intelligence decision-making system and method for safety production risk
CN116882767A (en) * 2023-09-08 2023-10-13 之江实验室 Risk prediction method and device based on imperfect heterogeneous relation network diagram
CN116882767B (en) * 2023-09-08 2024-01-05 之江实验室 Risk prediction method and device based on imperfect heterogeneous relation network diagram
CN117196025A (en) * 2023-09-13 2023-12-08 安徽国麒科技有限公司 Knowledge-graph higher-order relation prediction method and system based on multi-view decoder
CN117035087B (en) * 2023-10-09 2023-12-26 北京壹永科技有限公司 Method, device, equipment and medium for generating a rational map for medical reasoning
CN117035087A (en) * 2023-10-09 2023-11-10 北京壹永科技有限公司 Method, device, equipment and medium for generating a rational map for medical reasoning
CN117093728A (en) * 2023-10-19 2023-11-21 杭州同花顺数据开发有限公司 Financial field management map construction method, device, equipment and storage medium
CN117093728B (en) * 2023-10-19 2024-02-02 杭州同花顺数据开发有限公司 Financial field management map construction method, device, equipment and storage medium
CN117112808B (en) * 2023-10-24 2024-01-19 中国标准化研究院 Information knowledge graph construction method of credit belief-losing main body
CN117112808A (en) * 2023-10-24 2023-11-24 中国标准化研究院 Information knowledge graph construction method of credit belief-losing main body
CN117196323B (en) * 2023-11-07 2024-02-27 之江实验室 Gao Weitu network-based early warning method for complex social events
CN117196323A (en) * 2023-11-07 2023-12-08 之江实验室 Gao Weitu network-based early warning method for complex social events
CN117350547A (en) * 2023-11-14 2024-01-05 深圳市明心数智科技有限公司 Method, device, equipment and storage medium for determining risk processing scheme of order
CN117350547B (en) * 2023-11-14 2024-03-26 深圳市明心数智科技有限公司 Method, device, equipment and storage medium for determining risk processing scheme of order
CN117235465A (en) * 2023-11-15 2023-12-15 国网江西省电力有限公司电力科学研究院 Transformer fault type diagnosis method based on graph neural network wave recording analysis
CN117235465B (en) * 2023-11-15 2024-03-12 国网江西省电力有限公司电力科学研究院 Transformer fault type diagnosis method based on graph neural network wave recording analysis
CN117332261B (en) * 2023-12-01 2024-03-22 深圳市伟昊净化设备有限公司 Efficiency evaluation method, system and storage medium for filter element
CN117332261A (en) * 2023-12-01 2024-01-02 深圳市伟昊净化设备有限公司 Efficiency evaluation method, system and storage medium for filter element
CN117540035A (en) * 2024-01-09 2024-02-09 安徽思高智能科技有限公司 RPA knowledge graph construction method based on entity type information fusion
CN117573809A (en) * 2024-01-12 2024-02-20 中电科大数据研究院有限公司 Event map-based public opinion deduction method and related device
CN117574328A (en) * 2024-01-15 2024-02-20 德阳联动检测科技有限公司 Coupling slip analysis method and system based on torque data
CN117574328B (en) * 2024-01-15 2024-03-29 德阳联动检测科技有限公司 Coupling slip analysis method and system based on torque data
CN117788203A (en) * 2024-02-28 2024-03-29 西安华联电力电缆有限公司 Improved high-efficiency production and preparation method of crosslinked polyethylene insulated power cable

Also Published As

Publication number Publication date
CN113822494A (en) 2021-12-21
CN113822494B (en) 2022-07-22

Similar Documents

Publication Publication Date Title
WO2023065545A1 (en) Risk prediction method and apparatus, and device and storage medium
Swathi et al. An optimal deep learning-based LSTM for stock price prediction using twitter sentiment analysis
WO2022057669A1 (en) Method for pre-training knowledge graph on the basis of structured context information
CN111737552A (en) Method, device and equipment for extracting training information model and acquiring knowledge graph
CN113392209B (en) Text clustering method based on artificial intelligence, related equipment and storage medium
CN113505244A (en) Knowledge graph construction method, system, equipment and medium based on deep learning
CN112905801A (en) Event map-based travel prediction method, system, device and storage medium
CN113254675B (en) Knowledge graph construction method based on self-adaptive few-sample relation extraction
CN113626607B (en) Abnormal work order identification method and device, electronic equipment and readable storage medium
CN116383399A (en) Event public opinion risk prediction method and system
Dai et al. Hybrid deep model for human behavior understanding on industrial internet of video things
CN113628059A (en) Associated user identification method and device based on multilayer graph attention network
CN116205482A (en) Important personnel risk level assessment method and related equipment
CN113704392A (en) Method, device and equipment for extracting entity relationship in text and storage medium
CN113837307A (en) Data similarity calculation method and device, readable medium and electronic equipment
CN115049397A (en) Method and device for identifying risk account in social network
CN116821373A (en) Map-based prompt recommendation method, device, equipment and medium
CN116402352A (en) Enterprise risk prediction method and device, electronic equipment and medium
CN116340516A (en) Entity relation cluster extraction method, device, equipment and storage medium
WO2023284516A1 (en) Information recommendation method and apparatus based on knowledge graph, and device, medium, and product
CN114898426A (en) Synonym label aggregation method, device, equipment and storage medium
Shao et al. Web and Big Data: Third International Joint Conference, APWeb-WAIM 2019, Chengdu, China, August 1–3, 2019, Proceedings, Part I
Song et al. Prior-guided multi-scale fusion transformer for face attribute recognition
CN116975256B (en) Method and system for processing multisource information in construction process of underground factory building of pumped storage power station
US20230129870A1 (en) Method, apparatus, electronic device and medium for training models

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22882172

Country of ref document: EP

Kind code of ref document: A1