CN111222681A

CN111222681A - Data processing method, device, equipment and storage medium for enterprise bankruptcy risk prediction

Info

Publication number: CN111222681A
Application number: CN201911075577.8A
Authority: CN
Inventors: 宋仲伟
Original assignee: Quantum Shuju Beijing Technology Co ltd
Current assignee: Quantum Shuju Beijing Technology Co ltd
Priority date: 2019-11-05
Filing date: 2019-11-05
Publication date: 2020-06-02

Abstract

The application discloses a data processing method, a data processing device, data processing equipment and a data processing storage medium for enterprise bankruptcy risk prediction. The method comprises the steps of establishing an enterprise relation map according to collected enterprise data; according to at least enterprise node attributes and enterprise node relations included in the enterprise relation graph as input, training to obtain a prediction model; the prediction model is based on enterprise classification of the graph neural network, and whether the enterprise is bankruptcy is used as a data tag; and predicting the bankruptcy risk of the enterprise through the prediction model obtained by training according to the received enterprise information. The method and the device solve the technical problem of poor effect of enterprise bankruptcy risk prediction. According to the method and the device, the correlation information among enterprises is quantized by constructing the enterprise relational graph, and meanwhile, the multidimensional edge characteristics are utilized more fully by providing a new graph neural network model, so that the accuracy of graph node classification prediction can be improved.

Description

Data processing method, device, equipment and storage medium for enterprise bankruptcy risk prediction

Technical Field

The application relates to the field of data processing, in particular to a data processing method, a data processing device, data processing equipment and a data processing storage medium for enterprise bankruptcy risk prediction.

Background

Traditional enterprise risk prediction is only evaluated according to self information of a single enterprise, including enterprise profile, legal representatives and high-level conditions of the enterprise, registered capital and composition conditions, enterprise intellectual property and other information, and enterprise credit information is analyzed through manual or machine learning methods such as SVM, XGboost and the like to provide enterprise credit rating.

The inventor finds that the machine learning method only inputs basic information of a single enterprise, cannot correlate the conditions of high management and stockholder relation among enterprises, external investment cooperation and the like, and loses important reference information of enterprise risks. Further, the accuracy of enterprise risk prediction is also insufficient.

Aiming at the problem of poor effect of enterprise bankruptcy risk prediction in the related technology, no effective solution is provided at present.

Disclosure of Invention

The application mainly aims to provide a data processing method, a data processing device, equipment and a storage medium for enterprise bankruptcy risk prediction, so as to solve the problem that the effect of enterprise bankruptcy risk prediction is poor.

In order to achieve the above object, according to one aspect of the present application, a data processing method for enterprise bankruptcy risk prediction is provided.

The data processing method for enterprise bankruptcy risk prediction comprises the following steps: establishing an enterprise relation map according to the collected enterprise data; according to at least enterprise node attributes and enterprise node relations included in the enterprise relation graph as input, training to obtain a prediction model; the prediction model is based on enterprise classification of the graph neural network, and whether the enterprise is bankruptcy is used as a data tag; and predicting the bankruptcy risk of the enterprise through the prediction model obtained by training according to the received enterprise information.

Further, the training to obtain the prediction model according to at least the enterprise node attributes and the enterprise node relationships included in the enterprise relationship graph as inputs includes:

for an enterprise relationship graph of N enterprise nodes, the N × F matrix of node features of the entire graph is denoted as X, and for a graph G ═ node features in (V, E):

X_ijthe jth feature vector representing node i,

where N represents the number of nodes and F represents the feature dimension of each node.

for an enterprise relationship graph of N enterprise nodes, the N × F matrix of node features of the entire graph is denoted as X, and for graph G ═ edge features in (V, E):

E_ijan edge feature vector representing node i and node j,

E_ijprepresents E_ijIs generated from the p-dimensional feature vector of (1),

e is the NxNxP tensor of the edge feature of the graph, and E is the time when no link exists between two nodes_ij＝0。

Further, predicting the enterprise risk through the trained prediction model according to the received enterprise information comprises:

and acquiring low-dimensional potential feature representation of nodes in the graph neural network by adopting a network embedding method according to the received enterprise information, and taking the feature representation as the feature of a graph-based classification task.

and (3) fusing node attribute information and node association information of enterprises in the enterprise relational graph, mapping the high-dimensional sparse matrix of the graph to a low-dimensional dense vector, and training to obtain a prediction model for node classification prediction.

In order to achieve the above object, according to another aspect of the present application, a data processing apparatus for enterprise bankruptcy risk prediction is provided.

The data processing device for enterprise bankruptcy risk prediction according to the application comprises: the enterprise relation map module is used for establishing an enterprise relation map according to the collected enterprise data; the model training module is used for training to obtain a prediction model according to enterprise node attributes and enterprise node relations at least included in the enterprise relation graph as input; the prediction model is based on enterprise classification of the graph neural network, and whether the enterprise is bankruptcy is used as a data tag; and the risk prediction module is used for predicting the bankruptcy risk of the enterprise through the prediction model obtained by training according to the received enterprise information.

Further, the model training module is used for

For an enterprise relationship graph of N enterprise nodes, the N × F matrix of node features of the entire graph is denoted X, and for a node feature in graph G ═ V, E):

X_ijthe jth feature vector representing node i,

Further, the model training module is used for

E_ijan edge feature vector representing node i and node j,

E_ijprepresents E_ijIs generated from the p-dimensional feature vector of (1),

In order to achieve the above object, according to yet another aspect of the present application, there is provided an electronic device including a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor executes the computer program to implement the steps of the data processing method for enterprise bankruptcy risk prediction.

In order to achieve the above object, according to yet another aspect of the present application, a computer-readable storage medium is provided, on which a computer program is stored, which, when being executed by a processor, implements the steps of the data processing method for enterprise bankruptcy risk prediction.

According to the data processing method, the data processing device, the data processing equipment and the data processing storage medium for enterprise bankruptcy risk prediction, an enterprise relation map is established according to collected enterprise data, a prediction model is obtained through training according to enterprise node attributes and enterprise node relations at least included in the enterprise relation map, the purpose that enterprise bankruptcy risks are predicted through the prediction model obtained through training according to received enterprise information is achieved, the technical effect of improving the prejudgment capacity of enterprise operation risks is achieved, and the technical problem that the effect of enterprise bankruptcy risk prediction is poor is solved.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this application, serve to provide a further understanding of the application and to enable other features, objects, and advantages of the application to be more apparent. The drawings and their description illustrate the embodiments of the invention and do not limit it. In the drawings:

FIG. 1 is a schematic flow chart of a data processing method for enterprise bankruptcy risk prediction according to an embodiment of the present application;

FIG. 2 is a schematic structural diagram of a data processing apparatus for enterprise bankruptcy risk prediction according to an embodiment of the present application;

FIG. 3 is a schematic diagram of an apparatus according to an embodiment of the present application;

FIG. 4 is a schematic diagram of an implementation principle according to an embodiment of the present application;

FIG. 5 is a schematic diagram of business relationships according to an embodiment of the present application;

FIG. 6 is a diagram of a neural network model architecture for enterprise risk oriented mapping according to an embodiment of the present application.

Detailed Description

In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the application described herein may be used. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

In this application, the terms "upper", "lower", "left", "right", "front", "rear", "top", "bottom", "inner", "outer", "middle", "vertical", "horizontal", "lateral", "longitudinal", and the like indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings. These terms are used primarily to better describe the present application and its embodiments, and are not used to limit the indicated devices, elements or components to a particular orientation or to be constructed and operated in a particular orientation.

Moreover, some of the above terms may be used to indicate other meanings besides the orientation or positional relationship, for example, the term "on" may also be used to indicate some kind of attachment or connection relationship in some cases. The specific meaning of these terms in this application will be understood by those of ordinary skill in the art as appropriate.

Furthermore, the terms "mounted," "disposed," "provided," "connected," and "sleeved" are to be construed broadly. For example, it may be a fixed connection, a removable connection, or a unitary construction; can be a mechanical connection, or an electrical connection; may be directly connected, or indirectly connected through intervening media, or may be in internal communication between two devices, elements or components. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art as appropriate.

It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.

As shown in fig. 1, the method includes steps S101 to S103 as follows:

step S101, establishing an enterprise relation map according to the collected enterprise data;

and establishing an enterprise relationship map according to the collected enterprise data. And in the data collection and preprocessing stage, the system collects enterprise data and constructs an enterprise relationship map through enterprise information.

Specifically, the relation and entity attributes between knowledge graph entities can be defined through ontology modeling, and the construction of the knowledge graph is realized by utilizing algorithms such as Chinese word segmentation, entity extraction, attribute extraction, entity alignment, entity disambiguation, semantic understanding, knowledge inference, knowledge fusion, semantic matching and the like.

Alternatively, for semi-structured data and unstructured data, the representation of knowledge points can be obtained through knowledge extraction such as entity extraction, relationship extraction and attribute extraction, the representation of knowledge points can also be obtained through importing of structured data, and after the knowledge representation is obtained, nodes representing the same entity are merged together after the knowledge is aligned through the entity.

Step S102, according to at least enterprise node attributes and enterprise node relations included in the enterprise relation graph as input, training to obtain a prediction model;

the prediction model is based on enterprise classification of the graph neural network, and whether the enterprise is bankruptcy is used as a data tag.

And training to obtain a prediction model according to the enterprise node attributes and the enterprise node relations in the enterprise relation graph and taking the enterprise node attributes and the enterprise node relations as input.

Specifically, based on the node and structure representation of the enterprise relational graph, the enterprise node attributes and the relational structure between the nodes in the graph are vectorially represented and used for training input of a subsequent model.

And S103, predicting the bankruptcy risk of the enterprise through the prediction model obtained through training according to the received enterprise information.

And according to the received enterprise information without the label, the prediction model obtained by training in the steps predicts the probability of the enterprise failure risk.

Specifically, an enterprise node classification algorithm based on a graph neural network is constructed, whether enterprises are bankruptcy is taken as a label, enterprise node attributes and relationship structure vectors in a graph are simultaneously input into the improved graph neural network, and the bankruptcy probability of the enterprise nodes is trained; and (4) predicting the bankruptcy probability of the unlabeled data by using the model, so as to realize enterprise risk prediction.

From the above description, it can be seen that the following technical effects are achieved by the present application:

According to the embodiment of the present application, as a preferred embodiment in the present embodiment, training the obtained prediction model according to at least the enterprise node attributes and the enterprise node relationships included in the enterprise relationship graph as inputs includes:

X_ijthe jth feature vector representing node i,

Specifically, given has NThe enterprise relationship graph of the enterprise nodes has X as an N X F matrix representation of the node characteristics of the whole graph. The elements of the matrix or tensor are denoted by indices in the subscripts. For graph G ═ V, E, the node characteristics are used: by X_ijThe jth eigenvector representing node i can be represented by an N × F matrix X. Where N represents the number of nodes and F represents the feature dimension of each node.

E_ijan edge feature vector representing node i and node j,

E_ijprepresents E_ijIs generated from the p-dimensional feature vector of (1),

In particular, given an enterprise relationship graph having N enterprise nodes, let X be an N F matrix representation of the node features of the entire graph. The elements of the matrix or tensor are denoted by indices in the subscripts. For graph G ═ (V, E), edge characteristics: e_ijEdge feature vectors representing nodes i and j, E_ijpRepresents E_ijE is the nxnxnxp tensor of edge features of the graph, E is the number of links between two nodes_ij＝0。

According to the embodiment of the present application, as a preferred embodiment in the present embodiment, predicting the enterprise risk by the prediction model obtained by training according to the received enterprise information includes:

In particular, a Network Embedding method (Network Embedding) is adopted to learn low-dimensional potential representation of nodes in a Network, and the learned feature representation can be used as features of various tasks based on a graph, such as classification, clustering and other tasks. The enterprise risk prediction is converted into a node classification problem of an enterprise relation graph, enterprise relation information is input into a neural network as important features to be represented, the complex relations of enterprise nodes are fully utilized to establish relations, a mining model is combined with enterprise operation and management features, and enterprises and individuals with close relations are mined to extract enterprise operation and management conditions and bankruptcy risks, so that the pre-judging capability of enterprise operation risks is improved.

Optionally, in the node classification problem setting, each node v is represented as its feature x _ v and is associated with a label t _ v, each node is represented by a d-dimensional state vector h _ v, wherein information of its neighborhood is contained, finally an embedded state h _ v containing adjacent information of each vertex v is learned, and unlabeled nodes are predicted by using h _ v.

h_v＝f(X_v,X_co[v],h_ne[v],X_ne[v])

And mapping the high-dimensional sparse matrix of the graph to a low-dimensional dense vector by fusing the node attribute information and the node association information of the enterprises in the enterprise relational graph, and training to obtain a prediction model for node classification prediction.

Specifically, by constructing an enterprise relationship map, the associated information among enterprises is expressed in a quantitative mode, and the map structure information of high management, stockholders, external investment and the like of the enterprises is used as characteristic input, so that the problem that the associated information among the enterprises is difficult to quantify is solved. Meanwhile, the relation characteristics are used as the input of a subsequent risk prediction model, effective utilization of side information by the neural network cannot be limited, and the accuracy of enterprise risk prediction is improved.

It should be noted that the steps illustrated in the flowcharts of the figures may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowcharts, in some cases, the steps illustrated or described may be performed in an order different than presented herein.

According to an embodiment of the present application, there is also provided a data processing apparatus for enterprise bankruptcy risk prediction, for implementing the above method, as shown in fig. 2, the apparatus includes: the enterprise relationship map module 10 is used for establishing an enterprise relationship map according to the collected enterprise data; the model training module 11 is configured to train to obtain a prediction model according to at least enterprise node attributes and enterprise node relationships included in the enterprise relationship graph as inputs; the prediction model is based on enterprise classification of the graph neural network, and whether the enterprise is bankruptcy is used as a data tag; and the risk prediction module 12 is used for predicting the bankruptcy risk of the enterprise through the prediction model obtained by training according to the received enterprise information.

In the enterprise relationship graph module 10 according to the embodiment of the present application, an enterprise relationship graph may be established according to the collected enterprise data. And in the data collection and preprocessing stage, the system collects enterprise data and constructs an enterprise relationship map through enterprise information.

In the model training module 11 of the embodiment of the present application, a prediction model is obtained by training according to the enterprise node attributes and the enterprise node relationships in the enterprise relationship graph and using the enterprise node attributes and the enterprise node relationships as inputs.

According to the enterprise information without the label, the risk prediction module 12 of the embodiment of the application predicts the probability of the enterprise failure risk through the prediction model obtained through the training of the above steps.

According to the embodiment of the present application, as a preferred embodiment in the present application, the model training module 11 is configured to, for an enterprise relationship graph of N enterprise nodes, represent an N × F matrix of node features of the whole graph as X, and for a node feature in (V, E): x_ijThe jth feature vector representing node i, where N represents the number of nodes and F represents the feature dimension of each node.

In particular, given an enterprise relationship graph having N enterprise nodes, let X be an N F matrix representation of the node features of the entire graph. The elements of the matrix or tensor are denoted by indices in the subscripts. For graph G ═ V, E, the node characteristics are used: by X_ijThe jth eigenvector representing node i can be represented by an N × F matrix X. Where N represents the number of nodes and F represents the feature dimension of each node.

According to the embodiment of the present application, as a preferred embodiment in the present application, the model training module 11 is configured to, for an enterprise relationship graph of N enterprise nodes, represent an N × F matrix of node features of the whole graph as X, and for an edge feature in (V, E): e_ijEdge feature vectors representing nodes i and j, E_ijpRepresents E_ijE is the nxnxnxp tensor of edge features of the graph, E is the number of links between two nodes_ij＝0。

The embodiment of the application also provides computer equipment. As shown in fig. 3, the computer device 20 may include: the at least one processor 201, e.g., CPU, the at least one network interface 204, the user interface 203, the memory 205, the at least one communication bus 202, and optionally, a display 206. Wherein a communication bus 202 is used to enable the connection communication between these components. The user interface 203 may include a touch screen, a keyboard or a mouse, among others. The network interface 204 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface), and a communication connection may be established with the server via the network interface 204. The memory 205 may be a high-speed RAM memory or a non-volatile memory (non-volatile memory), such as at least one disk memory, and the memory 205 includes a flash in the embodiment of the present invention. The memory 205 may optionally be at least one memory system located remotely from the processor 201. As shown in fig. 3, memory 205, which is a type of computer storage medium, may include an operating system, a network communication module, a user interface module, and program instructions.

It should be noted that the network interface 204 may be connected to a receiver, a transmitter or other communication module, and the other communication module may include, but is not limited to, a WiFi module, a bluetooth module, etc., and it is understood that the computer device in the embodiment of the present invention may also include a receiver, a transmitter, other communication module, etc.

Processor 201 may be used to call program instructions stored in memory 205 and cause computer device 20 to perform the following operations:

establishing an enterprise relation map according to the collected enterprise data;

according to at least enterprise node attributes and enterprise node relations included in the enterprise relation graph as input, training to obtain a prediction model; the prediction model is based on enterprise classification of the graph neural network, and whether the enterprise is bankruptcy is used as a data tag;

and predicting the bankruptcy risk of the enterprise through the prediction model obtained by training according to the received enterprise information.

Please refer to fig. 4 to fig. 6, which illustrate the present application in detail:

as shown in fig. 4, step 1, the first phase is a data collection and preprocessing phase, the system collects enterprise data, and an enterprise relationship map is constructed through enterprise information;

step 2, expressing the node and the structure of the enterprise relational graph in a vectorization way, and using the enterprise node attributes and the relational structure between the nodes in the graph for the training input of a subsequent model;

step 3, constructing an enterprise node classification algorithm based on the graph neural network, taking whether enterprises are bankruptcy as a label, inputting enterprise node attributes and relationship structure vectors in the graph into the improved graph neural network at the same time, and training the bankruptcy probability of the enterprise nodes;

and 4, predicting the bankruptcy probability of the unlabeled data by using the model, so as to realize enterprise risk prediction.

A preferred embodiment of the present application is described below with reference to the accompanying drawings:

enterprise relation knowledge graph construction

The method comprises the steps of defining the relation among knowledge graph entities and entity attributes through ontology modeling, and constructing the knowledge graph by utilizing algorithms of Chinese word segmentation, entity extraction, attribute extraction, entity alignment, entity disambiguation, semantic understanding, knowledge inference, knowledge fusion, semantic matching and the like. The method comprises the steps of obtaining the representation of knowledge points through knowledge extraction (entity extraction, relation extraction and attribute extraction) on semi-structured data and unstructured data, obtaining the representation of the knowledge points through importing of structured data, and merging nodes representing the same entity after the knowledge representation is obtained and the knowledge is aligned through the entity.

As shown in fig. 5, the data sources include basic information of the enterprise inside the bank, transaction data, external public opinion data, complaint data, and business data. And extracting the closely related triple information of the enterprises by using various algorithms. The frequency and quantity of the fund coming and going among the enterprises are larger than the threshold value, and the close association is considered to exist. There are a relatively large number of cross stakeholders, or jurisdictions, between enterprises that may be considered to be in close association. There are a large number of litigation events between enterprises, and the two can be considered to be closely related. There are a number of financial transactions between the business and the individual greater than a threshold value, and a close association may be considered to exist. The relationship of the enterprise and the individual such as the competent pipe and the stockholder can be considered to be closely related. There are many dispute relations between enterprises and individuals, and it can be considered that there are close relations.

(II) node and structure representation method based on enterprise relation graph

Given an enterprise relationship graph having N enterprise nodes, let X be an N F matrix representation of the node features of the entire graph. The elements of the matrix or tensor are denoted by indices in the subscripts. For graph G ═ (V, E), we use the following features:

node characteristics: by X_ijThe jth eigenvector representing node i can be represented by an N × F matrix X. Where N represents the number of nodes and F represents the feature dimension of each node.

Edge characteristics: e_ijEdge feature vectors representing nodes i and j, E_ijpRepresents E_ijE is the nxnxnxp tensor of edge features of the graph, E is the number of links between two nodes_ij＝0。

Enterprise map edge characteristic scale

Serial number	Feature(s)	Numerical value	Dimension (d) of
				1	Common shareholder	Proportion of the share	1×3
2	Common high pipe	Number of	1×2
				3	Enterprises mutually stock	Proportion of the share	1×3
4	Inter-enterprise investment	Amount of investment, sum of money	1×4
				5	Litigation between enterprises	Quantity and categories of litigation	1×4
6	Upstream and downstream relations of enterprise	Upstream and downstream	1×2

(III) classified learning method based on enterprise relation structure representation

Network Embedding (Network Embedding) aims at learning low-dimensional potential representations of nodes in a Network, and the learned feature representations can be used as features of various tasks based on a graph, such as classification, clustering and the like.

In the node classification problem setting, each node v is represented as a characteristic x _ v of the node and is associated with a label t _ v, each node is represented by a d-dimensional state vector h _ v, the information of the neighborhood of each node is contained, finally, an embedded state h _ v containing the adjacent information of each vertex v is learned, and the unmarked node is predicted by using the h _ v.

h_v＝f(X_v,X_co[v],h_ne[v],X_ne[v])

Where x _ co [ v ] represents the characteristic of the edge connected to v, h _ ne [ v ] represents the neighbor node to v, and x _ ne [ v ] represents the characteristic of the neighbor node to v. The function f is a mapping function that projects these inputs into a d-dimensional space. Since a unique solution for h _ v needs to be found, the Banach stationary point theorem can be applied and the above equation rewritten as an iterative update process.

H^t+1＝F(H^t,X)

H and X represent the set of all H and X, respectively. The output of the GNN is calculated by passing the state h _ v and the feature x _ v to the output function g:

O_v＝g(h_v,X_v)

where F and G represent the transfer function and the output function, respectively, using a feed forward neural network. The H after t +1 th iteration can be iteratively solved according to the panah stationary point theorem to converge on any H0. Assuming that the target information is tv, the L1 loss function is:

training is carried out on a training set (data with enterprise bankruptcy labels), then the death probability of the enterprise is deduced on a testing set, wherein p is the number of supervised vertexes, and the training is carried out through a gradient descent method. The model structure is shown in fig. 6.

It will be apparent to those skilled in the art that the modules or steps of the present application described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and they may alternatively be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, or fabricated separately as individual integrated circuit modules, or fabricated as a single integrated circuit module from multiple modules or steps. Thus, the present application is not limited to any specific combination of hardware and software.

The above description is only a preferred embodiment of the present application and is not intended to limit the present application, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims

1. A data processing method for enterprise bankruptcy risk prediction is characterized by comprising the following steps:

2. The data processing method for enterprise bankruptcy risk prediction according to claim 1, wherein training a prediction model based on at least enterprise node attributes and enterprise node relationships included in the enterprise relationship graph as inputs comprises:

X_ijthe jth feature vector representing node i,

3. The data processing method for enterprise bankruptcy risk prediction according to claim 1, wherein training a prediction model based on at least enterprise node attributes and enterprise node relationships included in the enterprise relationship graph as inputs comprises:

E_ijan edge feature vector representing node i and node j,

E_ijprepresents E_ijIs generated from the p-dimensional feature vector of (1),

4. The data processing method for enterprise bankruptcy risk prediction as defined in claim 1, wherein predicting enterprise risk by the trained predictive model based on the received enterprise information comprises:

5. The data processing method for enterprise bankruptcy risk prediction as defined in claim 1, wherein predicting enterprise risk by the trained predictive model based on the received enterprise information comprises:

6. A data processing apparatus for enterprise bankruptcy risk prediction, comprising:

the enterprise relation map module is used for establishing an enterprise relation map according to the collected enterprise data;

the model training module is used for training to obtain a prediction model according to enterprise node attributes and enterprise node relations at least included in the enterprise relation graph as input; the prediction model is based on enterprise classification of the graph neural network, and whether the enterprise is bankruptcy is used as a data tag;

and the risk prediction module is used for predicting the bankruptcy risk of the enterprise through the prediction model obtained by training according to the received enterprise information.

7. The data processing apparatus for enterprise bankruptcy risk prediction of claim 6, wherein the model training module is configured to train the model to be used for enterprise bankruptcy risk prediction

X_ijthe jth feature vector representing node i,

8. The data processing apparatus for enterprise bankruptcy risk prediction of claim 6, wherein the model training module is configured to train the model to be used for enterprise bankruptcy risk prediction

E_ijrepresenting nodes i and jThe edge feature vector of the point j,

E_ijprepresents E_ijIs generated from the p-dimensional feature vector of (1),

9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executing the program implements the steps of the data processing method for enterprise bankruptcy risk prediction of any of claims 1 to 5.

10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the data processing method for enterprise bankruptcy risk prediction according to any one of claims 1 to 5.