CN109033521B

CN109033521B - Newly-built railway slope-limiting optimization decision method

Info

Publication number: CN109033521B
Application number: CN201810658482.8A
Authority: CN
Inventors: 蒲浩; 张洪; 李伟; 王雷; 宋陶然; 李晓明; 谢佳; 王杰; 彭先宝; 胡建平
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2018-06-25
Filing date: 2018-06-25
Publication date: 2021-04-20
Anticipated expiration: 2038-06-25
Also published as: CN109033521A

Abstract

The invention discloses a newly-built railway slope-limiting optimization decision method, which comprises the following steps: firstly, constructing a deep convolutional neural network model; then establishing a railway case database, representing various factors influencing the decision of limiting the gradient of the newly-built railway into a gray-scale image, and fusing the gray-scale image into a multi-channel image for training a network model; and finally, providing a sliding scanning technology, and performing railway slope limit decision by combining the trained deep convolutional neural network model. Compared with the prior art, the method has the advantages of high automation degree, strong practicability, high operation efficiency, good application prospect and the like.

Description

Newly-built railway slope-limiting optimization decision method

Technical Field

The invention relates to a railway design method, in particular to a newly-built railway slope-limiting optimization decision method.

Background

The grade limit is a main technical standard of railways with global significance, and directly influences the transportation capacity, engineering cost, operation cost and traffic safety of a line, and even possibly determines the trend of the line. With the rapid development of economy in China, the railway transportation demand is continuously increased, meanwhile, railway construction is gradually changed from the eastern plain to the western mountain area, and the contradiction between railway engineering construction and the increasing transportation demand is more prominent due to the complex environment of the hard mountain area: in order to better adapt to complex terrain and geological conditions, shorten the line length and save the engineering construction cost, the adoption of a larger limited gradient is an effective means; however, the line transportation capacity is also affected by the maximum limit gradient, and in the case of the same model number (i.e. the same traction power), the use of a larger limit gradient will reduce the traction tonnage of the locomotive, thereby reducing the line transportation capacity, and also increasing the operation cost and the risk of the downhill section. In addition, the limiting gradient is a fixed equipment standard and is difficult to modify once the railway is built. Therefore, how to scientifically and reasonably decide the limit gradient which is optimally matched with the natural, economic and social environments is a great problem in the design of the railway line at present.

The decision of limiting the gradient of the newly-built railway is essentially to explore the mapping rule of multi-dimensional influence factors (such as terrain conditions, transportation requirements and the like) and the limiting gradient, so as to select the optimal scheme. In a traditional limited gradient optimization decision method, a rule between elements is assumed to conform to a certain mathematical model expression, and then a mapping rule is obtained by counting regression model parameters. For example, the Wang mansion of southwest university of transportation obtains a general empirical formula (1) of a limited slope and engineering cost mapping rule by performing statistical regression on design data of railways in thousands of kilometers of mountain areas in China. Wherein A is engineering cost, I is limiting gradient, and a, b and c are model parameters related to terrain conditions obtained through statistical regression.

However, the mapping rule between the multidimensional influencing factor and the limiting gradient is complex and nonlinear, and is difficult to completely and accurately express through a fixed functional relation. Therefore, a method for comprehensively and accurately identifying the mapping rule between the multidimensional influence factors and the limited gradient is urgently needed, and the optimization decision of the limited gradient of the newly-built railway is realized.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the method can comprehensively and accurately identify the mapping rule between the multidimensional influence factors and the limited gradient, and further realize the optimization decision of the limited gradient of the newly-built railway.

In order to solve the technical problems, the invention adopts the technical scheme that: a newly-built railway slope-limiting optimization decision method comprises the following steps:

S₁: constructing a deep convolution neural network model for newly building a railway slope limiting optimization decision;

S₂: establishing a training data set D for training a deep convolutional neural network_trainAnd validating the data set D_validate；

S_2-1: collecting N₁Establishing a railway case data set D by adopting built passenger-cargo collinear railway cases with different limiting slopes₁；

S_2-2: based on the railway case data set D₁Dividing rectangular research areas of each railway case at the starting and ending positions of each railway line, extracting grid elevation data information in each rectangular research area, and establishing a railway case elevation data set D₂；

S_2-3: based on D₂Drawing the elevation gray-scale map P of each rectangular research area according to the grid elevation data information of each railway case research area_elevationEstablishing an elevation gray level atlas D for representing the terrain elevation change characteristics of each railway case research area_elevation；

S_2-4: based on D₂Drawing gradient gray level graph P of each rectangular research area according to grid elevation data information of each railway case research area_slopeAnd establishing a gradient gray level atlas D for representing the terrain gradient characteristics of the railway case research area_slope；

S_2-5: representing different railway grades as grey-scale maps with different grey-scale values according to D₁Each of which isActual grade of railway case, drawing railway grade gray-scale map P corresponding to each railway case_{classification}Establishing a railway grade gray level map set D_{classification}；

S_2-6: characterizing different locomotive models as gray-scale maps with different gray-scale values according to D₁The actual locomotive model used by each railway case is drawn, and a locomotive model gray scale map P corresponding to each railway case is drawn_locomotiveEstablishing a model gray level map set D of the locomotive_locomotive；

S_2-7: elevation gray scale atlas D based on establishment_elevationGradient gray scale atlas D_slopeRailway grade gray scale atlas D_{classification}Locomotive model gray scale atlas D_locomotiveFused to D₁Elevation gray-scale map P of each railway case_elevationGradient gray scale map P_slopeRailway grade gray scale map P_{classification}Gray scale map P of motor vehicle model_locomotiveForming a four-channel map P capable of representing the information of each railway case_mergeAnd creating a data set D_merge；

S_2-8: data set D_mergeCutting a four-channel image representing information of each railway case into images with the size of 333 multiplied by 333 pixels, and giving label data, wherein the label data is a limited gradient value actually used by each railway case;

S_2-9: will S_2-8Dividing the obtained labeled data graph according to the proportion of 4:1, and establishing a training data set D for training the deep convolutional neural network_trainAnd validating the data set D_validate；

S₃: by using S₂Established training data set D_trainTraining the constructed network model and adopting S₂Created validation data set D_validateVerifying the model precision to obtain a trained and verified deep convolutional neural network model;

S₄: in addition, collecting N₂Bars and data sets D₁In different built passenger-cargo collinear railway cases and according to stepsStep S_2-2To S_2-7Generating a four-channel map P characterizing railway case information_mergeEstablishing a test data set D_test；

S₅: providing a sliding scanning technology, scanning a data set D by a trained deep convolution neural network model from left to right and from top to bottom_testRepresenting four-channel map of elevation information, gradient information, railway grade information and locomotive model information of each railway case, and determining D according to output times of each limited gradient value_testAnd (4) the recommended limit gradient value of each railway case.

Further, the step S₁The deep convolutional neural network model constructed in (1) comprises 5 convolutional layers (Conv), 3 pooling layers (Pool), 2 full-link layers (FC) and 1 Softmax output layer:

(1) the convolution kernel size adopted by the first convolution layer (Conv1) is 33 multiplied by 3, the step size is 4, the number of convolution kernels is 96, and a modified linear unit (ReLU) is connected behind Conv1 to be used as a nonlinear activation function, so that the model has nonlinear characteristics;

(2) conv1 is connected with a first pooling layer (Pool1) after nonlinear treatment, the size of a pooling core adopted by Pool1 is 4 x 4, and the step size is 2;

(3) a second convolutional layer (Conv2) is connected behind the Pool1, the size of a convolution kernel adopted by the Conv2 is 3 multiplied by 96, the step size is 1, the number of the convolution kernels is 256, and a modified linear unit (ReLU) is connected behind the Conv2 for nonlinear processing;

(4) conv2 was treated non-linearly and then connected to a second pooling layer (Pool2), Pool2 with pooling kernel size of 3 × 3 and step size of 2;

(5) a third convolutional layer (Conv3) is connected behind the Pool2, the size of a convolution kernel adopted by the Conv3 is 3 multiplied by 256, the step size is 1, the number of the convolution kernels is 384, and a modified linear unit (ReLU) is connected behind the Conv3 for nonlinear processing;

(6) conv3 is connected with a fourth convolutional layer (Conv4) after nonlinear processing, the size of a convolution kernel adopted by Conv4 is 3 multiplied by 384, the step size is 1, the number of the convolution kernels is 384, and a modified linear unit (ReLU) is connected after Conv4 for nonlinear processing;

(7) conv4 is connected with a fifth convolutional layer (Conv5) after nonlinear processing, the size of a convolution kernel adopted by Conv5 is 3 multiplied by 384, the step size is 1, the number of convolution kernels is 256, and a modified linear unit (ReLU) is connected after Conv5 for nonlinear processing;

(8) conv5 was treated non-linearly and then connected to a third pooling layer (Pool3), Pool3 with pooling kernel size of 3 × 3 and step size of 2;

(9) a first full connection layer (FC1) is connected behind the Pool3, in order to prevent an overfitting phenomenon, a dropout function is adopted for connecting the Pool3 layer to the FC1 layer, and a modified linear unit (ReLU) is connected behind the FC1 layer for nonlinear processing;

(10) FC1 is connected with a second full connection layer (FC2) after nonlinear processing, a dropout function is adopted to prevent the over-fitting phenomenon, and a correction linear unit (ReLU) is connected with the FC2 for nonlinear processing;

(11) and the FC2 is connected with a Softmax output layer after nonlinear processing and is used for outputting the newly-built railway limit gradient value recommendation.

Further, the step S_2-1The railway cases collected in (1) cover different grades of railway and different locomotive models.

Further, the step S_2-2The middle railway rectangular research area division method comprises the following steps: setting the starting point and the ending point of a certain railway case line as S_i：(x_Si,y_Si) And E_i：(x_Ei,y_Ei) Then the area of investigation of the railway case is S_iAnd E_iAs diagonal point, with | x_Ei-x_SiL is long, | y_Ei-y_SiAnd | is a wide rectangular area.

Further, the step S_2-3、S_2-4And drawing the elevation gray-scale map and the gradient gray-scale map of the rectangular research area of each railway case by adopting Global Mapper software.

Further, the step S_2-5Middle and railway grade gray scale map P_{classification}Is the same size as the rectangular study area of the railway case.

Further, the stepsS_2-6Grayscale map P of middle and middle locomotive model_locomotiveIs the same size as the rectangular study area of the railway case using the locomotive model.

Further, the step S_2-7Four-channel map P of each railway case_mergeThe elevation gray-scale map P of each railway case is obtained by adopting merge function in computer vision library OpenCV_elevationGradient gray scale map P_slopeRailway grade gray scale map P_{classification}And a locomotive type gray scale map P_locomotiveAnd obtaining the fusion protein after fusion.

Further, the step S₃The network model constructed by the middle training is based on S₂Created tag data set D_trainContinuously updating the connection weight between each layer in the network model by a gradient descent algorithm, which comprises the following specific steps:

(1) softmax layer connection weight update

The Softmax layer is used for outputting the limited gradient value recommended by the model, calculating the output probability of each limited gradient value according to the output value of each neuron in the previous layer, and selecting the gradient value with the maximum output probability as the limited gradient value recommended by the model, wherein the function expression of the gradient value is shown as the formula (2):

in the formula: p (y)⁽ⁱ⁾＝j|x⁽ⁱ⁾(ii) a W) is the probability that the ith picture is taken as input data, the jth value is selected as the limiting gradient in the Softmax layer, and x⁽ⁱ⁾Is input data of the Softmax layer (i.e. output data of the previous layer), and W is a connection weight of the Softmax layer and the previous layer.

Establishing a model loss function E based on a Softmax function, wherein the function expression of the model loss function E is shown as a formula (3):

in the formula: 1{ y⁽ⁱ⁾J is logicExpression, if the i input pictures are marked as the jth limiting gradient, 1{ y }⁽ⁱ⁾1, otherwise 1{ y }⁽ⁱ⁾J is 0, and λ is a weight attenuation coefficient.

Based on the loss function E, the residuals of the neurons in the Softmax layer can be calculated as equation (4):

the connection weights of the neurons in the Softmax layer are updated according to equations (5) and (6):

(2) full connection layer connection weight update

Each neuron of the full connection layer is connected with all neurons of the previous layer, and the connection weight updating formula is as follows:

in the formula: w^lA connection weight matrix for each neuron of the current layer (full connection layer), b^lThe connecting bias vector of each neuron in the current layer is alpha, which is the learning rate.

Partial derivative of loss function to neuron connection weight of full connection layer

And partial derivatives of bias for neuron connections of full connection layer

Can be calculated according to equation (9) and equation (10), respectively.

In the formula: x is the number of^l-1Is the output vector, delta, of a connected layer above the current layer (full connected layer)^lThe residual error of each neuron in the current layer (full connection layer) can be determined according to the residual error delta of each neuron in the next connection layer^l+1And (4) calculating.

In the formula: w^l+1F (-) is the ReLU activation function, which is the connection weight matrix of each neuron of the posterior connection layer of the current layer (full connection layer).

(3) Convolutional layer connection weight update

Each neuron of the convolution layer is connected with the previous layer through a convolution kernel, and the connection weight updating formula of each convolution kernel is as follows:

in the formula:

the connection weight matrix of the (d) th convolution kernel of the current layer (convolutional layer),

the connected offset vector of the d-th convolution kernel of the current layer (convolution layer) is denoted by α as the learning rate.

The d-th convolution kernel of the current layer (convolution layer) is connected with the weight partial derivative by the loss function

The calculation formula of (a) is as follows:

in the formula:

is the output value of the D' th characteristic diagram of the previous connection layer of the current layer (convolution layer), D^l-1The number of feature maps of the previous connection layer of the current layer (convolution layer),

the residual matrix is the d-th characteristic diagram of the current layer (convolutional layer).

The d convolution kernel of the current layer (convolution layer) is connected with the bias partial derivative by the loss function

The calculation formula of (a) is as follows:

in the formula:

the connected offset vector of the d-th feature map in the current layer (convolutional layer),

and

respectively the number of rows and columns of the d-th feature map in the current layer (convolutional layer),

the residual values of i row and j column in the d-th feature map in the current layer (convolutional layer) are shown.

The residual of the current layer (convolutional layer) is calculated based on the layer residual of the next connection by back propagation. If the current layer (convolutional layer) is connected to the subsequent pooling layer, the residual matrix of the d-th feature map of the current layer (convolutional layer) is calculated by equation (17).

In the formula: x^l-1Is the output matrix of the previous connection layer of the current layer (convolutional layer),

residual matrix of the d-th characteristic diagram in a connected layer behind the current layer (convolutional layer).

If a convolutional layer is connected after the current layer (convolutional layer), the weight matrix of the current layer (convolutional layer) is calculated by equation (18).

In the formula:

is the residual matrix of the d' th characteristic diagram in the next connection layer after the current layer (convolution layer),

a d-th layer weight matrix of a d' th convolution kernel of a connection layer subsequent to the current layer (convolutional layer),

the output matrix of the d-th characteristic diagram of the current layer (convolutional layer).

Further, the step S₅The sliding scanning technique in (1) is specifically as follows: when scanning the test data set D_testAnd when a certain four-channel image is scanned, the recommended limit gradient value of a 333 x 333 pixel area in the four-channel image can be output each time, and after the whole four-channel image is scanned, the gradient value with the maximum output times is selected as the recommended limit gradient value of the railway case represented by the four-channel image.

The invention has the beneficial effects that: the deep learning simulates the hierarchical structure of the brain, can automatically acquire hierarchical multi-layer feature expression from massive data, and explores the potential rules existing between input data and output data without giving mathematical expressions. The scheme of the invention adopts a deep learning algorithm to make the slope limiting decision of the newly-built railway feasible. The invention provides a newly-built railway limited gradient optimization decision method based on a convolutional neural network in a deep learning algorithm. The scheme of the invention adopts a sliding scanning technology, and realizes the decision of limiting the gradient of different railway cases. The method has the advantages of high automation degree, strong practicability, high operation efficiency and good popularization and application prospects.

Drawings

FIG. 1 is a schematic flow chart of a newly built railway slope limiting optimization decision method of the invention;

FIG. 2 is a deep convolutional neural network model according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a sliding scanning technique according to an embodiment of the present invention.

Detailed Description

In order to explain technical contents, achieved objects, and effects of the present invention in detail, the following description is made with reference to the accompanying drawings in combination with the embodiments.

An embodiment of the present invention is a newly-built railway slope-limiting optimization decision method, as shown in fig. 1, the optimization decision method includes the following steps:

S₁: the method comprises the following steps of constructing a deep convolutional neural network model for newly-built railway slope-limiting optimization decision, wherein the constructed network model comprises 5 convolutional layers (Conv), 3 pooling layers (Pool), 2 full-connection layers (FC) and 1 Softmax output layer:

(9) a first full connection layer (FC1) is connected behind the Pool3, in order to prevent an overfitting phenomenon, a dropout function is adopted from a Pool3 layer to an FC1 layer, and a modified linear unit (ReLU) is connected behind the FC1 layer for nonlinear processing;

(11) and the FC2 is connected with a Softmax output layer after nonlinear processing and is used for outputting the slope limit recommended value of the newly-built railway.

S_2-1: 246 passenger-cargo collinear railway cases with the gradient limited by 6 per thousand, 12 per thousand and 24 per thousand are collected, the collected railway cases cover four railway grades of grade I, grade II, grade III and grade IV, three locomotive models of Shaoshan model 1, Shaoshan model 3 and Shaoshan model 4, and a railway case data set D is established₁；

S_2-5: with gray-scale values of 0, 40, 80, 120 respectivelyThe grey scale maps characterize four railway classes and are based on D₁The actual grade of each railway case is drawn, and a railway grade gray scale map P corresponding to each railway case is drawn_{classification}Establishing a railway grade gray level map set D_{classification}；

S_2-6: representing three electric locomotive models of Shaoshan 1 type, Shaoshan 3 type and Shaoshan 4 type by gray scale graphs with gray scale values of 160, 200 and 240 respectively, and according to D₁The actual locomotive model used by each railway case is drawn, and a locomotive model gray scale map P corresponding to each railway case is drawn_locomotiveEstablishing a model gray level map set D of the locomotive_locomotive；

S_2-9: will S_2-8Dividing the obtained picture with the label according to the proportion of 4:1, and establishing a training data set D for training the deep convolutional neural network_trainAnd validating the data set D_validate；

S₃: by using S₂Established training data set D_trainTraining the constructed network model and adopting S₈Created validation data set D_validateAnd verifying the model precision to obtain a trained and verified deep convolution neural network model. The training and verification takes 9 hours and 35 hoursIn minutes (i7 processor, 16G memory and GTX 1080 video card), a deep convolutional neural network model with the accuracy of 83.35% is obtained.

S₄: in addition, 36 pieces of data were collected together with the data set D₁In different built passenger-cargo collinear railway cases and according to the step S_2-2To S_2-7Establishing a four-channel map P representing railway case information_mergeEstablishing a test data set D_test；

S₅: providing a sliding scanning technology, scanning a data set D by a trained deep convolution neural network model from left to right and from top to bottom_testRepresenting four-channel map of elevation information, gradient information, railway grade information and locomotive model information of each railway case, and determining D according to output times of each limited gradient value_testAnd (4) the recommended limit gradient value of each railway case. In 36 railway cases tested at this time, the limited gradient of 34 railway cases is accurately decided (that is, the limited gradient value recommended by the model is the same as the limited gradient value of the manual decision), and the accuracy can reach 94.44%.

The sliding scanning technology refers to that the limited gradient value is determined according to the output times of different limited gradients by scanning the whole picture.

In summary, the invention provides a newly-built railway limited gradient optimization decision method, which comprises the steps of firstly constructing a deep convolution neural network model, then establishing a railway case database, representing various factors influencing the newly-built railway limited gradient decision into a gray-scale image, and fusing the gray-scale image into a multi-channel image for training the network model; and finally, providing a sliding scanning technology, and combining the trained deep convolutional neural network model to perform newly-built railway slope limit decision.

The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent changes made by using the contents of the present specification and the drawings, or applied directly or indirectly to the related technical fields, are included in the scope of the present invention.

Claims

1. A newly-built railway slope-limiting optimization decision method is characterized by comprising the following steps: the method comprises the following steps:

S_2-5: representing different railway grades as grey-scale maps with different grey-scale values according to D₁The actual grade of each railway case is drawn, and a railway grade gray scale map P corresponding to each railway case is drawn_{classification}Establishing a railway grade gray level map set D_{classification}；

S₄: in addition, collecting N₂Bars and data sets D₁In different built passenger-cargo collinear railway cases and according to the step S_2-2To S_2-7Generating a four-channel map P characterizing railway case information_mergeEstablishing a test data set D_test；

S₅: scanning the data set D by the trained deep convolution neural network model from left to right and from top to bottom_testRepresenting four-channel map of elevation information, gradient information, railway grade information and locomotive model information of each railway case according to each limitThe number of outputs of the gradient value, determining D_testAnd (4) the recommended limit gradient value of each railway case.

2. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S₁The deep convolutional neural network model comprises 5 convolutional layers, 3 pooling layers, 2 full-link layers and 1 Softmax output layer:

1) the size of a convolution kernel adopted by the first convolution layer is 33 multiplied by 3, the step size is 4, the number of the convolution kernels is 96, and a correction linear unit is connected behind the first convolution layer to be used as a nonlinear activation function, so that the model has nonlinear characteristics;

2) the first convolutional layer is connected with a first pooling layer after nonlinear processing, the size of a pooling kernel adopted by the first pooling layer is 4 multiplied by 4, and the step size is 2;

3) the second convolution layer is connected behind the first pooling layer, the size of a convolution kernel adopted by the second convolution layer is 3 multiplied by 96, the step size is 1, the number of the convolution kernels is 256, and the second convolution layer is connected with a correction linear unit for nonlinear processing;

4) the second convolutional layer is connected with a second pooling layer after nonlinear processing, the size of a pooling kernel adopted by the second pooling layer is 3 multiplied by 3, and the step size is 2;

5) the second pooling layer is connected with a third convolution layer, the size of convolution kernels adopted by the third convolution layer is 3 x 256, the step size is 1, the number of the convolution kernels is 384, and the third convolution layer is connected with a correction linear unit for nonlinear processing;

6) the third convolutional layer is connected with a fourth convolutional layer after nonlinear processing, the size of a convolutional kernel adopted by the fourth convolutional layer is 3 multiplied by 384, the step size is 1, the number of the convolutional kernels is 384, and the fourth convolutional layer is connected with a correction linear unit for nonlinear processing;

7) the fourth convolutional layer is connected with a fifth convolutional layer after nonlinear processing, the size of a convolutional kernel adopted by the fifth convolutional layer is 3 multiplied by 384, the step size is 1, the number of the convolutional kernels is 256, and the fifth convolutional layer is connected with a correction linear unit for nonlinear processing;

8) the fifth convolutional layer is connected with a third pooling layer after nonlinear processing, the size of a pooling core adopted by the third pooling layer is 3 multiplied by 3, and the step size is 2;

9) a first full-connection layer is connected behind the third pooling layer, in order to prevent the over-fitting phenomenon, a dropout function is adopted for connecting the third pooling layer to the first full-connection layer, and a correction linear unit is connected behind the first full-connection layer for nonlinear processing;

10) the first full-connection layer is connected with the second full-connection layer after nonlinear processing, a dropout function is adopted to prevent the over-fitting phenomenon, and the second full-connection layer is connected with a correction linear unit for nonlinear processing;

11) and the second full-connection layer is connected with the Softmax output layer after nonlinear processing and is used for outputting the slope-limiting recommended value of the newly-built railway.

3. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S_2-1The collected railway cases cover different grades of railway and different locomotive models.

4. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S_2-2In the middle, the method for dividing the rectangular research area based on the starting and ending positions of the railway line is as follows:

setting the starting point and the ending point of a certain railway case line as S_i：(x_Si,y_Si) And E_i：(x_Ei,y_Ei) Then the area of investigation of the railway case is S_iAnd E_iAs diagonal point, with | x_Ei-x_SiL is long, | y_Ei-y_SiAnd | is a wide rectangular area.

5. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S_2-5Middle and railway grade gray scale map P_{classification}Size of and the railwayThe rectangular study area of the cases was the same size.

6. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S_2-6Grayscale map P of middle and middle locomotive model_locomotiveIs the same size as the rectangular study area of the railway case using the locomotive model.

7. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S_2-7Four-channel map P of each railway case_mergeThe elevation gray-scale map P of each railway case is obtained by adopting merge function in computer vision library OpenCV_elevationGradient gray scale map P_slopeRailway grade gray scale map P_{classification}And a locomotive type gray scale map P_locomotiveAnd obtaining the fusion protein after fusion.

8. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S₃The network model constructed by the middle training is based on S₂Created tag data set D_trainContinuously updating the connection weight between each layer in the network model by a gradient descent algorithm, which comprises the following specific steps:

1) softmax layer connection weight update:

in the formula: p (y)⁽ⁱ⁾＝j|x⁽ⁱ⁾(ii) a W) is the probability that the ith picture is taken as input data, the jth value is selected as the limiting gradient in the Softmax layer, and x⁽ⁱ⁾The input data of the Softmax layer, W is the connection weight of the Softmax layer and the previous layer;

in the formula: 1{ y⁽ⁱ⁾J is a logic expression, if the ith input picture is marked as the jth limited gradient value, 1{ y }⁽ⁱ⁾1, otherwise 1{ y }⁽ⁱ⁾J is 0, and lambda is a weight attenuation coefficient;

based on the loss function E, the residuals of the neurons in the Softmax layer are calculated according to equation (4):

2) updating the connection weight of the full connection layer:

in the formula: w^lA connection weight matrix for each neuron of the current layer, b^lConnecting bias vectors of each neuron of the current layer, wherein alpha is a learning rate;

And partial derivatives of bias for neuron connections of full connection layer

Respectively calculating according to the formula (9) and the formula (10);

in the formula: x is the number of^l-1Is the output vector, delta, of a connection layer above the current layer^lThe residual error of each neuron in the current layer can be determined according to the residual error delta of each neuron in the next connection layer^l+1Calculating;

δ^l＝(W^l+1)δ^l+1⊙f′(W^lx^l-1+b^l) (11)

in the formula: w^l+1A connection weight matrix of each neuron of a posterior connection layer of the current layer, wherein f (·) is a ReLU activation function;

ReLU：

3) convolutional layer connection weight update:

in the formula:

the connection weight matrix for the d-th convolution kernel of the current layer,

connecting offset vectors of the d-th convolution kernel of the current layer, wherein alpha is a learning rate;

each connection weight partial derivative of the d-th convolution kernel of the current layer by the loss function

The calculation formula of (a) is as follows:

in the formula:

is the output value of the D' th characteristic diagram of the previous connection layer of the current layer, D^l-1The number of feature maps of the connection layer before the current layer,

a residual error matrix of the d-th characteristic diagram of the current layer;

the d-th convolution kernel of the current layer is connected with the bias partial derivative by the loss function

The calculation formula of (a) is as follows:

in the formula:

for the concatenated offset vector of the d-th feature map in the current layer,

and

respectively the number of rows and columns of the d-th feature map in the current layer,

residual values of i rows and j columns in the d-th feature map in the current layer are obtained;

the residual error of the current layer is calculated based on the residual error of the next connected layer through back propagation; if the current layer is connected with the pooling layer, calculating a residual error matrix of the d-th characteristic diagram of the current layer according to the formula (17);

in the formula: x^l-1The output matrix of the connection layer previous to the current layer,

a residual error matrix of the d-th characteristic diagram in a connecting layer behind the current layer;

if the convolution layer is connected behind the current layer, the weight matrix of the current layer is calculated according to the formula (18):

in the formula:

is the residual matrix of the d' th feature map in the next connection layer after the current layer,

a layer d weight matrix for the layer d "of the convolution kernel of the next layer after the current layer,

and (4) an output matrix of the d-th feature map of the current layer.

9. The newly-built railway slope limiting optimization decision method according to claim 1, characterized in that: said step S₅When scanning the test data set D_testAnd when a certain four-channel image is scanned, the recommended limit gradient value of a 333 x 333 pixel area in the four-channel image can be output each time, and after the scanning of the whole four-channel image is completed, the gradient value with the maximum output times is selected as the recommended limit gradient value of the railway case represented by the four-channel image.