CN113177831A

CN113177831A - Financial early warning system and method constructed by applying public data

Info

Publication number: CN113177831A
Application number: CN202110271010.9A
Authority: CN
Inventors: 廖丽娟; 林巍
Original assignee: Xian University of Technology
Current assignee: Xian University of Technology
Priority date: 2021-03-12
Filing date: 2021-03-12
Publication date: 2021-07-27
Anticipated expiration: 2041-03-12

Abstract

The invention relates to a financial early warning system constructed by applying public data, which comprises: the searching and alarming module is used for monitoring internet information to obtain input data; the warning sign identifying and analyzing module is used for extracting document data characteristics in the process of converting input data into warning signs so as to clearly identify event information related to early warning enterprises; the pre-warning degree module is used for predicting the warning degree information of an enterprise, early warning the financial risk and dividing the risk level; the invention also discloses a financial early warning method constructed by applying the public data, which can efficiently, quickly and accurately obtain early warning information.

Description

Financial early warning system and method constructed by applying public data

Technical Field

The invention belongs to the technical field of data processing, and particularly relates to a financial early warning system constructed by using public data.

Background

The financial risk early warning system is an early warning system established by a financial supervision authority for better and effectively monitoring financial operation institutions and is used for early warning and forecasting the financial risk which may occur. It is a dual function of risk management and operation evaluation, especially for the prevention and warning of finance and business of listed companies. The meaning of the method is that according to the related regulatory law and the operation principle of the company financial business, a set of pre-warning function, index (Indicator), critical Value or reference Value (default Value), or discriminant model, etc. are selected and defined for a plurality of variables, and in the aspect of financial pre-warning operation, for the prediction of the financial crisis company, the supervisor mostly monitors the operation condition of the financial operation organization by the traditional financial index.

However, the existing early warning system only performs early warning on financial risks according to the volume price data directly related to finance, neglects the influence of the mass text data in the internet and has the problem of low early warning accuracy.

Disclosure of Invention

The invention aims to provide a financial early warning system constructed by applying public data, which can be used for early warning financial risks.

The second purpose of the invention is to provide a financial early warning method constructed by applying public data, which can provide an effective financial risk early warning mode.

The financial early warning system is constructed by applying public data, and comprises a searching alarm source module, a searching alarm source module and a monitoring module, wherein the searching alarm source module is used for monitoring internet information to obtain input data;

the warning sign identifying and analyzing module is used for extracting document data characteristics in the process of converting input data into warning signs so as to clearly identify event information related to early warning enterprises;

and the pre-alarming degree module is used for predicting the alarming degree information of the enterprise.

The present invention is also characterized in that,

the internet information comprises official business networks, financial and newspaper, transactions, news, impurities, media, networks and public opinion public data.

The alert level comprises two levels of alert and no alert.

The second technical scheme adopted by the invention is that the financial early warning method constructed by applying public data is implemented according to the following steps:

step 1, monitoring internet information in real time to obtain input data;

step 2, document data characteristics of a process of changing input data into warning signals are extracted by using a BERT model so as to clarify event information related to early warning enterprises;

and 3, predicting the police degree information of the enterprise by using the LSTM and dividing different police degree grades.

The present invention is also characterized in that,

in the step 1, the internet information comprises official business networks, financial and newspaper, transactions, news, impurities, media, networks and public opinion public data.

The specific process of the step 2 is as follows:

step 2.1, word embedding is carried out, and the expression of word embedding is as follows:

X＝E_pos(onehot(S)+E_seg) (1)

in the formula (1), onehot represents word insertion, E_segRepresenting sentence embedding, E_posIndicating position embedding;

step 2.2, defining a single attention machine mechanism:

in the formula (2), W_Q、W_K、W_VIs a trainable parameter; softmax represents a softmax normalization function, mapping values to the range of 0-1; d represents the dimension of the vector;

step 2.3, combining a plurality of single attention mechanisms to form a multi-head attention mechanism layer, wherein the expression of the multi-head attention mechanism layer is as follows:

MultiHead(X)＝Concat(Att₁,...,Att_h)W^O (3)

in the formula (3), Concate represents a link operation, h represents the number of attention heads, W^ORepresenting trainable parameters;

step 2.4, the output vector of the multi-head attention mechanism layer is added with the word in an embedding mode to form a residual error network, and the final output of the attention mechanism layer is obtained through batch normalization:

M_att＝BN(MultiHead(X)+X) (4)

in formula (4), BN represents batch normalization;

step 2.5, the final output obtained in step 2.4 passes through a full connection layer with a residual error network and batch normalization to obtain a document file characteristic B:

B＝BERT(X)＝BN(Dense(M_att)+M_att) (5)；

for a document S with n words, the character file characteristic B ═ B₁,…,b_n+1},

Where d represents the dimension of the vector.

The specific process of the step 3 is as follows:

step 3.1, assume the hidden layer output h of the last LSTM_t-1And the tth document vector B_tThen LSTM can be expressed as:

f_t＝σ(W_f·Y+b_f) (7)

i_t＝σ(W_i·Y+b_i) (8)

o_t＝σ(W_o·Y+b_o) (9)

c_t＝f_t⊙c_t-1+i_t⊙tanh(W_c·Y+b_c) (10)

h_t＝o_t⊙tanh(c_t) (11)

in the formula, W_i，W_f，W_o，W_c，b_i，b_f，b_o，b_cIs a trainable parameter, σ is a sigmod initiation function,. indicates a Hadamard product, c_tRepresents the output of the t-th document;

step 3.2, vector h obtained by step 3.1_tPredicting the alert level of the enterprise:

l_t＝softmax(Dense(h_t)) (12)。

when l is_tA warning is indicated by more than or equal to 0.5, namely abnormal; when l is_t< 0.5 indicates no alarm, i.e., normal.

The invention has the beneficial effects that:

(1) the financial early warning system constructed by applying the public data avoids the problem of low accuracy caused by the early warning mode of financial risks according to the volume price data directly related to finance;

(2) according to the financial early warning method established by applying the public data, the police degree information of a prediction enterprise is generated by extracting the characteristics of mass text data in the Internet, and the accuracy of financial risk prediction is improved.

Drawings

FIG. 1 is a flow chart of a financial early warning method using public data construction according to the present invention;

fig. 2 is a flowchart of step 2 in the financial early warning method constructed by applying public data according to the present invention.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

The invention relates to a financial early warning system constructed by applying public data, which comprises: the system comprises a searching and warning source module, a searching and warning module and a warning display module, wherein the searching and warning source module is used for monitoring internet information to obtain input data, and the internet information comprises public data such as official networks, financial reports, transactions, news, impurities, media, networks, public opinions and the like;

the pre-alarming degree module is used for predicting the alarming degree information of the enterprise, and the alarming degree grade is divided into two grades of alarming and non-alarming.

The invention provides a financial early warning method constructed by applying public data, which is realized by a financial early warning system constructed by applying the public data, and is specifically implemented according to the following steps as shown in figure 1:

step 1, monitoring internet information in real time to obtain input data, wherein the internet information comprises public data such as enterprise official networks, financial newspapers, transactions, news, impurities, media, networks, public opinions and the like;

step 2, extracting document data characteristics in the process of converting input data into warning signals by using a BERT model pre-trained by financial news data so as to clarify event information related to early warning enterprises;

BERT is a language model proposed by google in 2018, which improves the result on a specific task by pre-training on a large number of documents, and BERT network mainly consists of three parts, a word embedding layer, an attention mechanism layer and a full link layer. The word embedding layer converts the words into codes, and the codes pass through a multi-layer network module consisting of an attention mechanism layer and a full connection layer, and are finally obtained output. The word embedding layer includes three kinds of word embedding, sentence embedding, and position embedding. The word embedding is to code the words, the sentence embedding is to express different sentences by using codes, and the position embedding is to use the learned position embedding;

step 2.1, assume that the text file set S of n text files is ═ S₁,s₂,…,s_nAnd (4) sorting the text file set according to time, and embedding words, wherein the word embedding expression is as follows:

X＝E_pos(onehot(S)+E_seg) (1)

step 2.2, defining a single attention machine mechanism:

MultiHead(X)＝Concat(Att₁(X),...,Att_h(X))W^O (3)

M_att＝BN(MultiHead(X)+X) (4)

in formula (4), BN represents batch normalization;

B＝BERT(X)＝BN(Dense(M_att)+M_att) (5)；

Wherein d represents the dimension of the vector, and Dense represents the fully-connected layer;

step 3, predicting the police degree information of the enterprise by using the LSTM, and dividing different police degree grades;

long-term memory (LSTM) is a neural network used to process column data. He can process the data that changes compared to a normal network. For example, the meaning of a certain document has different meanings due to different contents of the previous document, and the LSTM can well solve the problems;

step 3.1, since there are two inputs for each LSTM unit, assume that the hidden layer output h of the last LSTM is_t-1And the tth document vector B_tThen LSTM can be expressed as:

f_t＝σ(W_f·Y+b_f) (7)

i_t＝σ(W_i·Y+b_i) (8)

o_t＝σ(W_o·Y+b_o) (9)

c_t＝f_t⊙c_t-1+i_t⊙tanh(W_c·Y+b_c) (10)

h_t＝o_t⊙tanh(c_t) (11)

l_t＝softmax(Dense(h_t)) (12)

when l is_tA warning is indicated by more than or equal to 0.5, namely abnormal;

when l is_t< 0.5 indicates no alarm, i.e., normal.

Claims

1. A financial early warning system constructed by applying public data is characterized by comprising:

the searching and alarming module is used for monitoring internet information to obtain input data;

2. The financial early warning system constructed by public data as claimed in claim 1, wherein the internet information comprises corporate public data, financial reports, transactions, news, impurities, media, networks and public opinion public data.

3. The financial early warning system constructed by applying the public data as claimed in claim 1, wherein the alert level comprises two levels of alert and no alert.

4. A financial early warning method constructed by using public data is characterized in that the financial early warning system constructed by using the public data according to any one of claims 1 to 3 is implemented by the following steps:

step 1, monitoring internet information in real time to obtain input data;

5. The financial early warning method built by public data as claimed in claim 4, wherein in the step 1, the internet information comprises official business networks, financial and newspaper, transactions, news, impurities, media, network and public opinion public data.

6. The financial early warning method constructed by applying the public data as claimed in claim 4, wherein the specific process of the step 2 is as follows:

X＝E_pos(onehot(S)+E_seg) (1)

step 2.2, defining a single attention machine mechanism:

MultiHead(X)＝Concat(Att₁，...，Att_h)W^O (3)

M_att＝BN(MultiHead(X)+X) (4)

in formula (4), BN represents batch normalization;

B＝BERT(X)＝BN(Dense(M_att)+M_att) (5)；

for a document S with n words, the character file characteristic B ═ B₁，...，b_n+1}，

Where d represents the dimension of the vector.

7. The financial early warning method constructed by applying the public data as claimed in claim 4, wherein the specific process of the step 3 is as follows:

f_t＝σ(W_f·Y+b_f) (7)

i_t＝σ(W_i·Y+b_i) (8)

o_t＝σ(W_o·Y+b_o) (9)

c_t＝f_t⊙c_t-1+i_t⊙tanh(W_c·Y+b_c) (10)

h_t＝o_t⊙tanh(c_t) (11)

l_t＝softmax(Dense(h_t)) (12)。

8. the financial early warning method constructed by applying public data as claimed in claim 7, wherein when l is equal to_tA warning is indicated by more than or equal to 0.5, namely abnormal; when l is_t< 0.5 indicates no alarm, i.e., normal.