CN115482101A

CN115482101A - Stock prediction method based on historical data screening and momentum overflow effect

Info

Publication number: CN115482101A
Application number: CN202210943495.6A
Authority: CN
Inventors: 高忠科; 苏静钰; 郭嘉仪; 田源; 薄地阔
Original assignee: Shanghai Chunda Asset Management Co ltd; Tianjin University
Current assignee: Shanghai Chunda Asset Management Co ltd; Tianjin University
Priority date: 2022-08-08
Filing date: 2022-08-08
Publication date: 2022-12-16

Abstract

A stock forecasting method based on historical data screening and momentum overflow effect comprises the following steps: collecting historical trading data and text information of the target stock and stocks related to the target stock; data preprocessing, namely converting historical transaction data and text information into two-dimensional feature matrixes respectively; constructing a relation network matrix among N stocks; constructing a fusion model of a time convolution network with a channel domain attention mechanism and a graph convolution network; and training the time convolution network with the channel domain attention mechanism by combining with a fusion model of the graph convolution network, and finally selecting the super-parameter combination with the minimum cross entropy loss value as the optimal super-parameter combination according to a training result. The invention collects the text information on the basis of collecting the historical trading data of the target stock and the related stocks, and realizes the fusion of multi-source information; the historical data is screened by a deep learning method, the price rise and fall of the stocks are predicted, and the precision of predicting the price rise and fall of the stocks is improved.

Description

Stock prediction method based on historical data screening and momentum overflow effect

Technical Field

The invention relates to a stock forecasting method. In particular to a stock forecasting method based on historical data screening and momentum overflow effect.

Background

The stock market is one of important components of the financial market, and the action of investing through buying and selling stocks becomes a common phenomenon in daily life of people, so that the method for predicting the change trend of the stock price by applying a scientific method is very important, a basis can be provided for investors to make investment plans, and risks are avoided to a certain extent. In recent years, many ideas and implementation methods have been proposed by scholars in the financial and computer fields for stock forecasting.

In the aspect of feature selection, the forecasting of the stocks all the time depends on the analysis of historical data of the stocks, such as opening prices, highest prices, lowest prices, volume of interest, market profitability and the like. However, the stock price sequence can be influenced by aspects such as trading behavior, market information and the like, and has the characteristics of nonlinearity, volatility and chaos. Nowadays, investment behaviors of stockholders are extremely easy to be influenced by public opinions, and whether related news reports or opinions found by people of a plurality of investors can influence transaction strategies of partial investors. A great deal of text information on channels such as an investor forum, a financial and economic news platform and the like not only reflects the stock price change situation in the Chinese and foreign languages, but also reveals the direct opinion of investors. The mining, fusion and analysis of the massive information characteristics become one of the keys for analyzing the stock price change.

In the selection of the prediction model, the evolution from a traditional time series model to a machine learning model and then to a deep learning model is undergone. Many practices prove that the deep learning model achieves better effects on stock forecasting problems due to strong adaptability, excellent portability and data analysis capability, and the recurrent neural network and the variant thereof become more widely applied methods. However, the recurrent neural network adopts a sequential processing sequence and a weight sharing mechanism, so that the network occupies a large amount of memory under the condition of large data volume, and each transaction day for prediction is given the same attention. It is therefore a better choice to find a network that processes data in parallel, adding a mechanism of attention.

The momentum spillover effect indicates that the revenue of a company can be predicted from the past revenue of the company because stock prices are generally slow to react to the news related to its associated company. Therefore, analysis of inter-enterprise relationships is an essential link in prediction, and changes of a certain stock may affect other stocks in the industry to different degrees.

Disclosure of Invention

The invention aims to solve the technical problem of providing a stock prediction method based on historical data screening and momentum overflow effect, which can improve the prediction accuracy, for overcoming the defects of the prior art.

The technical scheme adopted by the invention is as follows: a stock forecasting method based on historical data screening and momentum overflow effect is characterized by comprising the following steps:

1) Collecting historical trading data and text information of the target stock and stocks related to the target stock;

2) Data preprocessing, namely converting historical transaction data and text information into two-dimensional feature matrixes respectively;

3) Constructing a relationship network matrix among N stocks, comprising the following steps:

(3.1) constructing a space-time dynamic hierarchical complex network by using each trading day in each historical trading data to obtain a stock relation network under the span of T trading days;

(3.2) calculating degree correlation between layers in the N-layer stock relation network;

(3.3) calculating the clustering coefficient correlation between layers in the N layers of stock relation networks;

(3.4) defining the average value of degree correlation and clustering coefficient correlation among N stocks as the stock correlation R _i,j Is represented as

Normalizing the stock relativity to obtain normalized stock relativity

Thereby obtaining N stock relation network matrixes

4) Constructing a fusion model of a time convolution network with a channel domain attention mechanism and a graph convolution network;

5) Continuously changing the hyper-parameters of the fusion model of the time convolution network combined graph convolution network with the channel domain attention mechanism to train the fusion model for many times, namely calculating the cross entropy loss value of the predicted value and the true value of the target stock closing price, continuously updating the parameters of the fusion model of the time convolution network combined graph convolution network with the channel domain attention mechanism, and finally selecting the hyper-parameter combination with the minimum cross entropy loss value as the optimal hyper-parameter combination according to the training result.

The stock forecasting method based on historical data screening and momentum overflow effect collects text information on the basis of collecting the historical trading data of the target stock and the stock related to the target stock, thereby adding the consideration to the emotion of an investor and realizing the fusion of multi-source information; based on the principle of momentum overflow effect, a stock relation network is added by using a space-time hierarchical complex network method, the influence of information of related stocks of the target stock on the target stock is integrated, and the overall development rule of the industry is more conformed; the historical data is screened by a deep learning method, and the stock price fluctuation is predicted, so that different attention degrees are given to different trading days, and the precision of the stock price fluctuation prediction is improved.

Drawings

FIG. 1 is a flow chart of a stock forecasting method based on historical data screening and momentum overflow effect according to the present invention;

FIG. 2 is a diagram of a converged model architecture for a time convolution network with a channel domain attention mechanism in combination with a graph convolution network;

FIG. 3 is a schematic diagram of a single stock network constructed by using a space-time dynamic hierarchical complex network when a finite traversal view distance d = 1;

fig. 4 is a schematic diagram of a single stock network node relationship when the limited traversal line-of-sight d = 1.

Detailed Description

The stock forecasting method based on historical data screening and momentum overflow effect of the invention is described in detail with reference to the embodiments and the accompanying drawings.

The invention discloses a stock forecasting method based on historical data screening and momentum overflow effect, which is creative in that fusion of multi-source data is realized by combining a time-space convolution network with channel domain attention and a fusion model of a graph convolution network, and a stock relation network is constructed according to the momentum overflow effect principle, so that the forecasting of the target stock closing price rise and fall is realized. Therefore, the invention collects the historical trading data of the target stock and the stocks related to the target stock and obtains the text information in the financial news and the investor forum. And a channel domain attention mechanism and a space-time convolution network are adopted to carry out historical data screening and data parallel processing on a fusion characteristic vector obtained after the historical transaction data characteristics and the text information characteristics are fused, so that more attention is given to important transaction days while the training cost is reduced.

On the aspect of analyzing the relationship among enterprises, the invention provides a method for constructing a relationship network among a plurality of stocks by utilizing a space-time dynamic hierarchical complex network, and further fusing a deep fusion feature vector and a stock relationship network by using a graph convolution network. And finally, constructing a model output layer by utilizing a feedforward neural network to predict the development trend of the price rise and fall of the target stock.

The stock forecasting method based on historical data screening and momentum overflow effect comprises the steps of collecting historical trading data and text information of a target stock and N-1 stocks related to the target stock, preprocessing the historical trading data and the text information, and meanwhile, obtaining a stock relation network by constructing a space-time dynamic hierarchical complex network. By training the time convolution network with the channel domain attention mechanism and the fusion model of the graph convolution network, the comprehensive characteristics of the historical trading data characteristics and the text information characteristics are subjected to data screening, and further characteristic extraction is performed by combining the stock relation network, and finally the rise and fall prediction of the target stock closing price is realized. As shown in fig. 1, the method comprises the following steps:

the closing price of the target stock on the t-th trading day is higher than that of the previous day, and is classified as rising, and the closing price of the target stock on the t-th trading day is lower than that of the previous day, and is classified as falling. Historical trading data and text information for N stocks, including the target stock, is collected.

The historical transaction data comprises five indexes which are respectively as follows: opening price, closing price, transaction amount, highest price and lowest price; the text information comprises financial news for selecting the target stock and the stock related to the target stock, and comments issued by investors in the eastern wealth network forum.

2) Data preprocessing, namely converting historical transaction data and text information into two-dimensional feature matrixes respectively; the method comprises the following steps:

performing data cleaning on the collected historical transaction data (the data cleaning is the last procedure for finding and correcting recognizable errors in the data file and comprises checking data consistency, processing invalid values, missing values and the like); performing text quantization on the collected text information, and obtaining daily emotional tendency vectors of each stock through an emotional dictionary to form daily text information characteristics of each stock; setting the ith stock at the tThe historical transaction data of each transaction day is characterized by

Wherein L' is a characteristic dimension of the historical transaction data; similarly, the text message characteristic of the ith stock on the tth trading day is set as

Where L is the feature dimension of the text information feature.

(3.1) constructing a space-time dynamic hierarchical complex network by using each trading day in each historical trading data to obtain a stock relation network under the span of T trading days; the method comprises the following steps:

regarding each trading day as a node of a single stock network, two nodes

And

between the vertical bars, a horizontal connecting line is constructed, wherein, the nodes

Indicating that the ith stock is at the t _m The closing price of each trading day is

Node point

Denoted as the ith stock at the t _n The closing price of each trading day is

The height of the connecting line being the minimum of the square bars, i.e.

t _m ,t _n E [ T-T, T) represents the T-th transaction day ranging from the T-T transaction day to the T-th transaction day _m Date of transaction and tth _n Setting the limited crossing visual range as d on each trading day, and setting the two nodes if

And

the horizontal connecting line between the two nodes is intersected with d intermediate nodes or less, then

And

a connecting edge exists between the two, otherwise, the connecting edge does not exist; fig. 3 shows a process of constructing a single stock network by using a spatio-temporal dynamic hierarchical complex network when a limited traversal line-of-sight d =1 is set, in which all network nodes with connecting edges are summarized, and a black solid line represents that no intermediate node exists in two histogram horizontal connecting lines; the black dashed line indicates that there is an intermediate node between the two histogram horizontal lines, and fig. 4 shows the relationship corresponding to the network node in fig. 3. Respectively constructing a layer of single stock network for each stock in the N stocks to obtain N layers of stock relation networks

Where N is the total number of the target stock and the stocks associated with the target stock.

(3.2) calculating degree correlation between layers in the N-layer stock relation network; the method comprises the following steps:

nodes according to ith stock network

Value of (A)

And j (th) nodes of stock network

Value of (A)

Calculating the value sequence of the ith stock network

And j-th stock network value sequence

The mutual information between the stock and the stock is used for representing the degree correlation of the ith stock and the jth stock in the T trading days

Namely degree correlation between layers in the N-layer stock relation network, the calculation formula is as follows:

wherein, p (k) ⁱ ) Is the degree distribution of the ith stock, p (k) ^j ) Is the degree distribution of the jth stock, p (k) ⁱ ,k ^j ) Is the joint degree distribution of the ith stock and the jth stock.

(3.3) calculating the clustering coefficient correlation between layers in the N layers of stock relation networks; the method comprises the following steps:

node according to ith layer single stock network

Cluster coefficient of (2)

Node of single stock network of j-th layer

Cluster coefficient of (2)

Calculating the clustering coefficient sequence of the ith layer of single stock network

And the clustering coefficient sequence of the j-th layer single stock network

The mutual information between the stock and the stock is used for expressing the clustering coefficient correlation of the ith stock and the jth stock in the T trading days

The calculation formula is as follows:

wherein, p (z) ⁱ ) Is the clustering coefficient distribution of the ith stock, p (z) ^j ) Is the clustering coefficient distribution of the jth stock, p (z) ⁱ ,z ^j ) Is the joint clustering coefficient distribution of the ith stock and the jth stock.

(3.4) defining the average value of degree correlation and clustering coefficient correlation among N stocks as the stock correlation R _i,j Is shown as

Normalizing the stock relativity to obtain normalized stock relativity

Thereby obtaining N stock relation network matrixes

4) Constructing a fusion model of a time convolution network with a channel domain attention mechanism and a graph convolution network; as shown in fig. 2, includes:

(4.1) Using K-dimensional bilinear tensor product terms

The calculation method of (1) fuses the historical transaction data characteristics and the text information characteristics of each stock, and calculates the kth item in the K-dimensional bilinear tensor product through tensor slicing, wherein the calculation formula is as follows:

wherein the content of the first and second substances,

is a third order tensor, Γ ^[1:K] ＝[Γ ¹ ，...，Γ ^k ，...，Γ ^K ]L' is the dimension of table historical transaction data characteristics, L is the dimension of text information characteristics, K is the dimension of bilinear vector product items,

the characteristic of the ith dimension in the historical trading data characteristic representing the ith stock,

the ith dimension of the text information features representing the ith stock is characterized by the historical trading data

And text information features

By a weight matrix

Performing series and linear transformation to obtain the fusion characteristic vector of the ith stock

Wherein, | | represents a concatenation,

representing the offset, tanh is the activation function;

(4.2) processing the fusion feature vector of each stock by using a channel attention mechanism, wherein the channel attention gives corresponding weight according to the influence degree of the fusion feature vectors of different trading days on the current trading day to complete the screening of historical data, and the specific processing formula is as follows:

wherein, the first and the second end of the pipe are connected with each other,

is a channel attention feature vector of the ith stock in T trading days, f is a feature dimension of each trading day output by the channel attention mechanism,

representing the fusion characteristic vector set of the ith stock in T trading days; CA represents the channel attention mechanism;

(4.3) respectively carrying out high-order feature extraction on the channel attention feature vector of each stock through a time convolution network model, wherein the extraction formula is as follows:

wherein the content of the first and second substances,

the depth fusion characteristic vector of the ith stock output by the time convolution network model on the tth trading day is represented, and D is

TCN represents a time convolutional network model;

(4.4) deep fusion feature vector of N stocks and N layers of stock relation network

The characteristic extraction is carried out by using a graph convolution network with a gating mechanism, and the introduction of the gating mechanism can abandon small-amplitude price change and target stocks s which do not influence the stock price fluctuation _a And with the target stock s _a Relationship feature vector between related stocks

Is represented as follows:

wherein the content of the first and second substances,

representing the target stock s normalized on the tth trading day _a And with the target stock s _a Related stocks s _b Normalized stock correlation therebetween, and

the larger the value is, the stronger the correlation between the two stocks is, and the weaker the correlation is;

is a target stock s _a A weight matrix shared with stocks related to the target stock, wherein D' is a dimension of the outputted relational feature vector; sigma represents a sigmoid function;

represents a series connection; p (-) represents a gating mechanism,

is a weight matrix in the gating mechanism;

is an offset;

(4.5) sending the obtained relation characteristic vector and the obtained depth fusion characteristic vector into an output layer, wherein the output layer is a two-layer feedforward neural network with a softmax function, and the output result is a prediction result of future fluctuation of the target stock:

wherein the content of the first and second substances,

is the target stock s _a And with the target stock s _a The relationship feature vector between the related stocks,

is a target stock s _a The depth-fused feature vector of (a),

is a weight matrix obtained by neural network training, P represents the number of classifications,

which represents the amount of offset, is,

is to the target stock _sa The result of the future rise and fall prediction is also the final output of the fusion model of the time convolution network with the channel domain attention mechanism and the graph convolution network.

5) And continuously changing the hyper-parameters of the fusion model of the time convolution network with the channel domain attention mechanism combined with the graph convolution network to train the fusion model for multiple times, namely calculating the cross entropy loss value of the predicted value and the true value of the target stock closing price, continuously updating the parameters of the fusion model of the time convolution network with the channel domain attention mechanism combined with the graph convolution network, and finally selecting the hyper-parameter combination with the minimum cross entropy loss value as the optimal hyper-parameter combination according to the training result.

The above description of the present invention and the embodiments is not limited thereto, and the description of the embodiments is only one of the implementation manners of the present invention, and any structure or embodiment similar to the technical solution without inventive design is within the protection scope of the present invention without departing from the inventive spirit of the present invention.

Claims

1. A stock forecasting method based on historical data screening and momentum overflow effect is characterized by comprising the following steps:

Normalizing the stock relativity to obtain normalized stock phaseSex of concern

Thereby obtaining N stock relation network matrixes

2. The method as claimed in claim 1, wherein the historical trading data of step 1) includes five indexes, which are: opening price, closing price, transaction amount, highest price and lowest price; the text information comprises financial news for selecting the target stock and the stocks related to the target stock, and comments issued by investors in the oriental wealth network forum.

3. The stock forecasting method based on historical data screening and momentum overflow effect as claimed in claim 1, wherein the step 2) comprises:

performing data cleaning on the collected historical transaction data; performing text quantization on the collected text information, and obtaining daily emotional tendency vectors of each stock through an emotional dictionary to form daily text information characteristics of each stock; setting the historical trading data characteristics of the ith stock on the tth trading day as

Where L is the feature dimension of the text information feature.

4. The stock forecasting method based on historical data screening and momentum overflow effect as claimed in claim 1, wherein the step 3) the (3.1) th step comprises:

regarding each trading day as a node of a single stock network, two nodes

And

between the vertical bars of (1) a horizontal connecting line is constructed, wherein, the nodes

Node point

Denoted as the ith stock at the t _n The closing price of each trading day is

The height of the connecting line being the minimum of the square bars, i.e.

Indicating the tth transaction day ranging from tth-tth transaction day to tth transaction day _m Date of transaction and tth _n Setting the limited crossing visual range as d on each trading day, and setting the two nodes if

And

the horizontal connecting line between the intermediate nodes is intersected with d intermediate nodes which are less than or equal to

And

a connecting edge exists between the two, otherwise, the connecting edge does not exist; respectively constructing a layer of single stock network for each stock in the N stocks to obtain N layers of stock relation networks

5. The stock forecasting method based on historical data screening and momentum overflow effect as claimed in claim 1, wherein the step 3) and the step (3.2) comprise:

nodes according to ith stock network

Value of (A)

And j (th) nodes of stock network

Value of (A)

Calculating the value sequence of the ith stock network

And the value sequence of the j' th stock network

Mutual information between them, which is used to represent the degree correlation between the ith stock and the jth stock in T trading days

6. The stock forecasting method based on historical data screening and momentum overflow effect as claimed in claim 1, wherein the step 3) and the (3.3) step comprise:

nodes according to ith layer single stock network

Cluster coefficient of (2)

Node of single stock network of j layer

Cluster coefficient of (2)

And the clustering coefficient sequence of the j-th layer single stock network

The calculation formula is as follows:

wherein, p (z) ⁱ ) Is the clustering coefficient distribution of the ith stock, p (z) ^j ) Is the clustering coefficient distribution of the j-th stock, p (z) ⁱ ,z ^j ) Is the joint clustering coefficient distribution of the ith stock and the jth stock.

7. The stock forecasting method based on historical data screening and momentum overflow effect as claimed in claim 1, wherein the step 4) comprises:

(4.1) Using K-dimensional bilinear tensor product terms

wherein the content of the first and second substances,

is a third order tensor, gamma ^[1:K] ＝[Γ ¹ ，...，Γ ^k ，...，Γ ^K ]L' is the dimension of the table historical transaction data characteristic, L is the dimension of the text information characteristic, K is the dimension of the bilinear vector product item,

the ith dimension of the text information feature representing the ith stock is used for characterizing the historical transaction data

And text information features

By a weight matrix

Wherein, | | represents a concatenation,

representing the offset, tanh is the activation function;

(4.2) processing the fusion feature vector of each stock by using a channel attention mechanism to complete the screening of historical data, wherein the specific processing formula is as follows:

wherein the content of the first and second substances,

TCN represents a time convolutional network model;

Feature extraction using graph convolution networks with gating mechanisms, target stocks s _a And with target stocks s _a Related strandRelationship feature vector between tickets

Is represented as follows:

wherein the content of the first and second substances,

is the target stock s _a A weight matrix shared with stocks related to the target stock, wherein D' is a dimension of the outputted relational feature vector; sigma represents a sigmoid function;

represents a series connection; p (-) represents a gating mechanism,

is a weight matrix in the gating mechanism;

is an offset;

wherein the content of the first and second substances,

is the target stock s _a The depth-fused feature vector of (a),

which represents the amount of offset, is,

is to the target stock s _a The result of the future rise and fall prediction is also the final output of the fusion model of the time convolution network with the channel domain attention mechanism and the graph convolution network.