CN109033521A - A kind of newly built railway ruling grade Study on Decision-making Method for Optimization - Google Patents

A kind of newly built railway ruling grade Study on Decision-making Method for Optimization Download PDF

Info

Publication number
CN109033521A
CN109033521A CN201810658482.8A CN201810658482A CN109033521A CN 109033521 A CN109033521 A CN 109033521A CN 201810658482 A CN201810658482 A CN 201810658482A CN 109033521 A CN109033521 A CN 109033521A
Authority
CN
China
Prior art keywords
layer
railway
formula
locomotive
convolution kernel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810658482.8A
Other languages
Chinese (zh)
Other versions
CN109033521B (en
Inventor
蒲浩
张洪
李伟
王雷
宋陶然
李晓明
谢佳
王杰
彭先宝
胡建平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN201810658482.8A priority Critical patent/CN109033521B/en
Publication of CN109033521A publication Critical patent/CN109033521A/en
Application granted granted Critical
Publication of CN109033521B publication Critical patent/CN109033521B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Hardware Design (AREA)
  • Geometry (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a kind of newly built railway ruling grade Study on Decision-making Method for Optimization, the Study on Decision-making Method for Optimization the following steps are included: construct depth convolutional neural networks model first;Then railway case database is established, the every factor that will affect newly built railway ruling grade decision is characterized into grayscale image, and is fused into multichannel image for training network model;Finally propose that a kind of slip sweep, the depth convolutional neural networks model that combined training is completed carry out railway ruling grade decision.Compared with prior art, this method has many advantages, such as that high degree of automation, practical, operational efficiency is high and application prospect is good.

Description

一种新建铁路限制坡度优化决策方法An Optimal Decision-Making Method for Constrained Slopes of Newly Built Railways

技术领域technical field

本发明涉及铁路设计方法,具体涉及一种新建铁路限制坡度优化决策方法。The invention relates to a railway design method, in particular to an optimization decision-making method for a new railway limited slope.

背景技术Background technique

限制坡度是一项具有全局意义的铁路主要技术标准,它直接影响线路的运输能力、工程费用、运营费用以及行车安全,甚至可能决定线路走向。随着我国经济的飞速发展,铁路运输需求不断增加,于此同时铁路建设逐渐从东部平原向西部山区转变,艰险山区的复杂环境使得铁路工程建设与日益增长运输需求的矛盾愈加突出:为更好地适应复杂的地形、地质条件,缩短线路长度,节省工程建造费用,采用较大的限制坡度是一种行之有效的手段;然而,线路运输能力同样受最大限制坡度影响,在机车型号相同(即牵引功率相同)的情况下,使用较大的限制坡度会降低机车牵引吨数,进而降低线路运输能力,同时也会增加运营费用以及下坡路段的危险性。此外,限制坡度属于固定设备标准,一旦铁路建成,将很难更改。因此,如何科学合理地决策出与所处自然、经济、社会环境最佳匹配的限制坡度,是目前铁路线路设计面临的一项重大难题。Slope restriction is a major railway technical standard with overall significance, which directly affects the transportation capacity, engineering cost, operating cost and driving safety of the line, and may even determine the direction of the line. With the rapid development of my country's economy, the demand for railway transportation continues to increase. At the same time, railway construction gradually shifts from the eastern plains to the western mountainous areas. The complex environment of difficult and dangerous mountainous areas makes the contradiction between railway engineering construction and growing transportation demand more prominent: for better It is an effective means to adapt to complex terrain and geological conditions, shorten the length of the line, save engineering construction costs, and adopt a larger limit slope; however, the line transportation capacity is also affected by the maximum limit slope. That is, when the traction power is the same), using a larger restricted slope will reduce the traction tonnage of the locomotive, thereby reducing the transportation capacity of the line, and will also increase the operating cost and the danger of the downhill section. In addition, the limit slope is a fixed equipment standard, which will be difficult to change once the railway is completed. Therefore, how to scientifically and rationally determine the limit slope that best matches the natural, economic, and social environments is a major problem in the design of railway lines.

新建铁路限制坡度决策本质上是探索多维影响因素(如:地形条件、运输需求等)与限制坡度的映射规律,从而选出最佳方案。传统的限制坡度优化决策方法通常先假定要素间的规律符合某一数学模型表达式,再通过统计回归模型参数,得到映射规律。如西南交通大学的王邸教授通过对我国上千公里山区铁路的设计资料进行统计回归,得到了限制坡度与工程费用映射规律的一般经验公式(1)。式中A为工程费用,I为限制坡度,a、b、c是通过统计回归得到的与地形条件相关的模型参数。The essence of decision-making on the slope limit of new railways is to explore the mapping law between multi-dimensional influencing factors (such as terrain conditions, transportation demand, etc.) and the limit slope, so as to select the best plan. The traditional limited slope optimization decision-making method usually assumes that the law between elements conforms to a certain mathematical model expression, and then obtains the mapping law through statistical regression model parameters. For example, Professor Wang Di of Southwest Jiaotong University obtained the general empirical formula (1) for the mapping law of limiting slope and engineering cost through statistical regression on the design data of thousands of kilometers of mountainous railways in my country. In the formula, A is the project cost, I is the limit slope, and a, b, c are the model parameters related to terrain conditions obtained through statistical regression.

然而,多维影响因素与限制坡度间的映射规律是复杂且非线性的,难以通过一个固定的函数关系式完整、准确地表达出来。因此,迫切需要一种可全面、准确识别多维影响因素与限制坡度间映射规律的方法,实现新建铁路限制坡度的优化决策。However, the mapping law between multidimensional influencing factors and limiting slope is complex and nonlinear, and it is difficult to express it completely and accurately through a fixed functional relationship. Therefore, there is an urgent need for a method that can comprehensively and accurately identify the mapping rules between multi-dimensional influencing factors and limiting slopes, so as to realize the optimal decision-making of new railway limiting slopes.

发明内容Contents of the invention

本发明所要解决的技术问题是:提供一种可全面、准确识别多维影响因素与限制坡度间映射规律的方法,进而实现新建铁路限制坡度的优化决策。The technical problem to be solved by the present invention is to provide a method that can comprehensively and accurately identify the mapping law between multi-dimensional influencing factors and limiting slopes, thereby realizing optimal decision-making on limiting slopes of newly built railways.

为了解决上述技术问题,本发明采用的技术方案为:一种新建铁路限制坡度优化决策方法,包括以下步骤:In order to solve the above-mentioned technical problems, the technical solution adopted in the present invention is: a kind of new-built railway limit slope optimization decision-making method, comprises the following steps:

S1:构建用于新建铁路限制坡度优化决策的深度卷积神经网络模型;S 1 : Construct a deep convolutional neural network model for the optimal decision-making of new railway limit slope;

S2:建立用于训练深度卷积神经网络的训练数据集Dtrain和验证数据集DvalidateS 2 : Establish a training data set D train and a verification data set D validate for training a deep convolutional neural network;

S2-1:收集N1条采用不同限制坡度的已建客货共线铁路案例,建立铁路案例数据集D1S 2-1 : collect N 1 cases of passenger and freight collinear railways with different restricted slopes, and establish railway case data set D 1 ;

S2-2:基于铁路案例数据集D1中各条铁路线路的起、终点位置划分各铁路案例的矩形研究区域,并提取各矩形研究区域内的格网高程数据信息,建立铁路案例高程数据集D2S 2-2 : Divide the rectangular research area of each railway case based on the start and end positions of each railway line in the railway case data set D 1 , and extract the grid elevation data information in each rectangular research area to establish the elevation data of the railway case set D2 ;

S2-3:基于D2中各铁路案例研究区域的格网高程数据信息,绘制各矩形研究区域的高程灰度图Pelevation,建立用于表征各铁路案例研究区域地形高程变化特征的高程灰度图集DelevationS 2-3 : Based on the grid elevation data information of each railway case study area in D 2 , draw the elevation gray map P elevation of each rectangular study area, and establish the elevation gray used to characterize the terrain elevation change characteristics of each railway case study area Degree atlas D elevation ;

S2-4:基于D2中各铁路案例研究区域的格网高程数据信息,绘制各矩形研究区域的坡度灰度图Pslope,建立用于表征各铁路案例研究区域地形坡度特征的坡度灰度图集DslopeS 2-4 : Based on the grid elevation data information of each railway case study area in D 2 , draw the slope grayscale map P slope of each rectangular study area, and establish the slope grayscale used to characterize the terrain slope characteristics of each railway case study area Atlas D slope ;

S2-5:将不同铁路等级表征为灰度值不同的灰度图,根据D1中各条铁路案例的实际等级,绘制与各条铁路案例相对应的铁路等级灰度图Pclassification,建立铁路等级灰度图集DclassificationS 2-5 : Representing different railway grades as grayscale images with different grayscale values, drawing a railway grade grayscale image P classification corresponding to each railway case according to the actual grades of each railway case in D 1 , and establishing Railway grade grayscale atlas D classification ;

S2-6:将不同机车型号表征为灰度值不同的灰度图,根据D1中各条铁路案例所使用的实际机车型号,绘制与各条铁路案例相对应的机车型号灰度图Plocomotive,建立机车型号灰度图集DlocomotiveS 2-6 : Represent different locomotive models as grayscale images with different grayscale values, draw the locomotive model grayscale image P corresponding to each railway case according to the actual locomotive models used in each railway case in D 1 locomotive , build locomotive model grayscale atlas D locomotive ;

S2-7:基于建立的高程灰度图集Delevation、坡度灰度图集Dslope、铁路等级灰度图集Dclassification、机车型号灰度图集Dlocomotive,融合D1中各条铁路案例的高程灰度图Pelevation、坡度灰度图Pslope、铁路等级灰度图Pclassification和机车型号灰度图Plocomotive,形成可表征各条铁路案例信息的四通道图Pmerge,并建立数据集DmergeS 2-7 : Based on the established elevation grayscale atlas D elevation , slope grayscale atlas D slope , railway grade grayscale atlas D classification , and locomotive model grayscale atlas D locomotive , integrate each railway case in D 1 The elevation grayscale map P elevation , slope grayscale map P slope , railway grade grayscale map P classification and locomotive model grayscale map P locomotive form a four-channel map P merge that can represent the information of each railway case, and establish a data set D merge ;

S2-8:将数据集Dmerge中表征各条铁路案例信息的四通道图切割成大小为333×333像素的图片,并赋予标签数据,其标签数据为各条铁路案例所实际使用的限制坡度值;S 2-8 : Cut the four-channel image representing the information of each railway case in the data set D merge into a picture with a size of 333×333 pixels, and assign label data. The label data is the limit actually used by each railway case slope value;

S2-9:将S2-8中所得带标签数据图按4:1的比例划分,建立用于训练深度卷积神经网络的训练数据集Dtrain和验证数据集DvalidateS 2-9 : Divide the labeled data graph obtained in S 2-8 at a ratio of 4:1, and establish a training data set D train and a verification data set D validate for training a deep convolutional neural network;

S3:采用S2建立的训练数据集Dtrain训练所构建的网络模型,并采用S2建立的验证数据集Dvalidate验证模型精度,得到经过训练和验证的深度卷积神经网络模型;S 3 : Use the training data set D train established by S 2 to train the constructed network model, and use the verification data set D validate established by S 2 to verify the accuracy of the model, and obtain a trained and verified deep convolutional neural network model;

S4:另外收集N2条与数据集D1中不同的已建客货共线铁路案例,并按照步骤S2-2至S2-7生成表征铁路案例信息的四通道图Pmerge,建立测试数据集DtestS 4 : In addition, collect N 2 cases of passenger and freight collinear railways that are different from those in the data set D 1 , and generate a four-channel graph P merge representing the railway case information according to steps S 2-2 to S 2-7 , and establish Test data set D test ;

S5:提出一种滑动扫描技术,将训练好的深度卷积神经网络模型按由左向右、由上到下的顺序扫描数据集Dtest中表征各条铁路案例高程信息、坡度信息、铁路等级信息、机车型号信息的四通道图,根据各限制坡度值的输出次数,确定Dtest中各条铁路案例的限制坡度推荐值。S 5 : Propose a sliding scanning technology, scan the trained deep convolutional neural network model from left to right and from top to bottom in the data set D test to represent the elevation information, slope information, railway The four-channel diagram of grade information and locomotive model information determines the recommended limit slope value for each railway case in the D test according to the output times of each limit slope value.

进一步地,所述步骤S1中构建的深度卷积神经网络模型包括5个卷积层(Conv),3个池化层(Pool),2个全连接层(FC)和1个Softmax输出层:Further, the deep convolutional neural network model constructed in the step S1 includes 5 convolutional layers (Conv), 3 pooling layers (Pool), 2 fully connected layers (FC) and 1 Softmax output layer :

(1)第一个卷积层(Conv1)采用的卷积核大小为33×33×3,步幅大小为4,卷积核个数为96,Conv1后连接修正线性单元(ReLU)作为非线性激活函数,使模型具备非线性特征;(1) The convolution kernel size used in the first convolutional layer (Conv1) is 33×33×3, the stride size is 4, and the number of convolution kernels is 96. After Conv1, the corrected linear unit (ReLU) is connected as the non- The linear activation function makes the model have nonlinear characteristics;

(2)Conv1经非线性处理后连接第一个池化层(Pool1),Pool1采用的池化核大小为4×4,步幅大小为2;(2) Conv1 is connected to the first pooling layer (Pool1) after nonlinear processing. The pooling kernel size used by Pool1 is 4×4, and the stride size is 2;

(3)Pool1后连接第二个卷积层(Conv2),Conv2采用的卷积核大小为3×3×96,步幅大小为1,卷积核个数为256,Conv2后连接修正线性单元(ReLU)进行非线性处理;(3) Pool1 is connected to the second convolutional layer (Conv2). The convolution kernel size used by Conv2 is 3×3×96, the stride size is 1, the number of convolution kernels is 256, and the modified linear unit is connected after Conv2. (ReLU) for nonlinear processing;

(4)Conv2经非线性处理后连接第二个池化层(Pool2),Pool2采用的池化核大小为3×3,步幅大小为2;(4) Conv2 is connected to the second pooling layer (Pool2) after nonlinear processing. The pooling kernel size used by Pool2 is 3×3, and the stride size is 2;

(5)Pool2后连接第三个卷积层(Conv3),Conv3采用的卷积核大小为3×3×256,步幅大小为1,卷积核个数为384,Conv3后连接修正线性单元(ReLU)进行非线性处理;(5) The third convolutional layer (Conv3) is connected after Pool2. The convolution kernel size used by Conv3 is 3×3×256, the stride size is 1, the number of convolution kernels is 384, and the corrected linear unit is connected after Conv3. (ReLU) for nonlinear processing;

(6)Conv3经非线性处理后连接第四个卷积层(Conv4),Conv4采用的卷积核大小为3×3×384,步幅大小为1,卷积核个数为384,Conv4后连接修正线性单元(ReLU)进行非线性处理;(6) Conv3 is connected to the fourth convolutional layer (Conv4) after nonlinear processing. The convolution kernel size used by Conv4 is 3×3×384, the stride size is 1, and the number of convolution kernels is 384. After Conv4 Connect the Rectified Linear Unit (ReLU) for nonlinear processing;

(7)Conv4经非线性处理后连接第五个卷积层(Conv5),Conv5采用的卷积核大小为3×3×384,步幅大小为1,卷积核个数为256,Conv5后连接修正线性单元(ReLU)进行非线性处理;(7) Conv4 is connected to the fifth convolutional layer (Conv5) after nonlinear processing. The convolution kernel size used by Conv5 is 3×3×384, the stride size is 1, and the number of convolution kernels is 256. After Conv5 Connect the Rectified Linear Unit (ReLU) for nonlinear processing;

(8)Conv5经非线性处理后连接第三个池化层(Pool3),Pool3采用的池化核大小为3×3,步幅大小为2;(8) Conv5 is connected to the third pooling layer (Pool3) after nonlinear processing. The pooling kernel size used by Pool3 is 3×3, and the stride size is 2;

(9)Pool3后连接第一个全连接层(FC1),为防止产生过拟合现象,在Pool3层到FC1层连接采用dropout函数,FC1后连接修正线性单元(ReLU)进行非线性处理;(9) Pool3 is connected to the first fully connected layer (FC1). In order to prevent overfitting, the dropout function is used to connect the Pool3 layer to the FC1 layer, and FC1 is connected to the corrected linear unit (ReLU) for nonlinear processing;

(10)FC1经非线性处理后连接第二个全连接层(FC2),并采用dropout函数防止产生过拟合现象,FC2后连接修正线性单元(ReLU)进行非线性处理;(10) FC1 is connected to the second fully connected layer (FC2) after nonlinear processing, and the dropout function is used to prevent overfitting, and FC2 is connected to the corrected linear unit (ReLU) for nonlinear processing;

(11)FC2经非线性处理后连接Softmax输出层,用于输出新建铁路限制坡度值推荐。(11) After nonlinear processing, FC2 is connected to the Softmax output layer, which is used to output the limit slope value recommendation of the new railway.

进一步地,所述步骤S2-1中收集的铁路案例涵盖不同等级铁路以及不同机车型号。Further, the railway cases collected in the step S2-1 cover different grades of railways and different locomotive models.

进一步地,所述步骤S2-2中铁路矩形研究区域划分方法如下:设某条铁路案例线路起、终点分别为Si:(xSi,ySi)和Ei:(xEi,yEi),则该铁路案例的研究区域为以Si和Ei为对角点,以|xEi-xSi|为长,|yEi-ySi|为宽的矩形区域。Further, the division method of the railway rectangular research area in the step S2-2 is as follows: set the start and end points of a certain railway case line as S i : (x Si , y Si ) and E i : (x Ei , y Ei ), the research area of the railway case is a rectangular area with S i and E i as the diagonal points, |x Ei -x Si | as the length, and |y Ei -y Si | as the width.

进一步地,所述步骤S2-3、S2-4中各条铁路案例矩形研究区域的高程灰度图和坡度灰度图均采用Global Mapper软件绘制。Further, in the steps S 2-3 and S 2-4 , the elevation grayscale map and gradient grayscale map of each railway case rectangular research area are drawn by Global Mapper software.

进一步地,所述步骤S2-5中,铁路等级灰度图Pclassification的大小与该铁路案例的矩形研究区域大小相同。Further, in the step S2-5 , the size of the railway grade grayscale map P classification is the same as the size of the rectangular research area of the railway case.

进一步地,所述步骤S2-6中,机车型号灰度图Plocomotive的大小与采用该机车型号铁路案例的矩形研究区域大小相同。Further, in the step S2-6 , the size of the locomotive model grayscale map P locomotive is the same as the size of the rectangular research area using the locomotive model railway case.

进一步地,所述步骤S2-7中各条铁路案例的四通道图Pmerge为采用计算机视觉库OpenCV中的merge函数对各条铁路案例的高程灰度图Pelevation、坡度灰度图Pslope、铁路等级灰度图Pclassification和机车类型灰度图Plocomotive进行融合后得到。Further, the four-channel map P merge of each railway case in the step S2-7 is the elevation grayscale map P elevation and slope grayscale map P slope of each railway case using the merge function in the computer vision library OpenCV , the railway grade grayscale map P classification and the locomotive type grayscale map P locomotive are obtained after fusion.

进一步地,所述步骤S3中训练所构建的网络模型为基于S2建立的标签数据集Dtrain,通过梯度下降算法不断更新网络模型中各层间的连接权重,具体如下:Further, the network model constructed by training in the step S3 is the label data set D train established based on S2, and the connection weights between the layers in the network model are continuously updated through the gradient descent algorithm, as follows:

(1)Softmax层连接权重更新(1) Softmax layer connection weight update

Softmax层用于输出模型推荐的限制坡度值,该层根据前一层各神经元的输出值,计算各限制坡度值的输出概率,从而选择输出概率最大的坡度值作为模型推荐的限制坡度值,其函数表达如式(2)所示:The Softmax layer is used to output the limit slope value recommended by the model. This layer calculates the output probability of each limit slope value according to the output value of each neuron in the previous layer, so that the slope value with the highest output probability is selected as the limit slope value recommended by the model. Its function expression is shown in formula (2):

式中:P(y(i)=j|x(i);W)为以第i张图片为输入数据,在Softmax层选择第j个值作为限制坡度的概率,x(i)为Softmax层的输入数据(即前一层的输出数据),W为Softmax层与前一层的连接权重。In the formula: P(y (i) =j|x (i) ; W) is the i-th picture as the input data, and the j-th value is selected as the probability of limiting the slope in the Softmax layer, and x (i) is the Softmax layer The input data (that is, the output data of the previous layer), W is the connection weight between the Softmax layer and the previous layer.

基于Softmax函数建立模型损失函数E,其函数表达式如式(3)所示:The model loss function E is established based on the Softmax function, and its function expression is shown in formula (3):

式中:1{y(i)=j}为逻辑表达式,如果i张输入图片标注的为第j个限制坡度,则1{y(i)=j}=1,否则1{y(i)=j}=0,λ为权重衰减系数。In the formula: 1{y (i) =j} is a logical expression, if the i input picture is labeled as the jth limit slope, then 1{y (i) =j}=1, otherwise 1{y (i) ) =j}=0, λ is the weight attenuation coefficient.

基于损失函数E,可按式(4)计算Softmax层各神经元的残差:Based on the loss function E, the residual error of each neuron in the Softmax layer can be calculated according to formula (4):

Softmax层各神经元的连接权重按式(5)、式(6)更新:The connection weights of each neuron in the Softmax layer are updated according to formula (5) and formula (6):

(2)全连接层连接权重更新(2) Fully connected layer connection weight update

全连接层的每一个神经元都与上一层的所有神经元相连,其连接权重更新公式如下:Each neuron in the fully connected layer is connected to all neurons in the previous layer, and its connection weight update formula is as follows:

式中:Wl为当前层(全连接层)各神经元的连接权重矩阵,bl为当前层各神经元的连接偏置向量,α为学习率。In the formula: W l is the connection weight matrix of each neuron in the current layer (fully connected layer), b l is the connection bias vector of each neuron in the current layer, and α is the learning rate.

损失函数对全连接层各神经元连接权重的偏导数和对全连接层各神经元连接偏置的偏导数可分别按式(9)和式(10)计算。The partial derivative of the loss function to the connection weights of each neuron in the fully connected layer and the partial derivative of the connection bias of each neuron in the fully connected layer It can be calculated according to formula (9) and formula (10) respectively.

式中:xl-1为当前层(全连接层)上一连接层的输出向量,δl为当前层(全连接层)各神经元的残差,可根据其后连接层各神经元的残差δl+1计算。In the formula: x l-1 is the output vector of the previous connection layer of the current layer (full connection layer), δ l is the residual error of each neuron in the current layer (full connection layer), which can be calculated according to the neurons of the subsequent connection layer The residual δ l+1 is calculated.

式中:Wl+1为当前层(全连接层)后连接层各神经元的连接权重矩阵,f(·)为ReLU激活函数。In the formula: W l+1 is the connection weight matrix of each neuron in the connection layer after the current layer (full connection layer), and f(·) is the ReLU activation function.

(3)卷积层连接权重更新(3) Convolutional layer connection weight update

卷积层各神经元通过卷积核与前一层相连,各卷积核连接权重更新公式如下:Each neuron in the convolution layer is connected to the previous layer through the convolution kernel, and the weight update formula of each convolution kernel connection is as follows:

式中:为当前层(卷积层)第d个卷积核的连接权重矩阵,为当前层(卷积层)第d个卷积核的连接偏置向量,α为学习率。In the formula: is the connection weight matrix of the dth convolution kernel of the current layer (convolutional layer), is the connection bias vector of the dth convolution kernel of the current layer (convolutional layer), and α is the learning rate.

损失函数对当前层(卷积层)第d个卷积核各连接权重偏导数的计算公式如下:The loss function is the partial derivative of each connection weight of the dth convolution kernel of the current layer (convolution layer) The calculation formula is as follows:

式中:为当前层(卷积层)前一连接层第d′个特征图的输出值,Dl-1为当前层(卷积层)前一连接层的特征图数目,为当前层(卷积层)第d个特征图的残差矩阵。In the formula: is the output value of the d′th feature map of the previous connection layer of the current layer (convolutional layer), D l-1 is the number of feature maps of the previous connection layer of the current layer (convolutional layer), is the residual matrix of the dth feature map of the current layer (convolutional layer).

损失函数对当前层(卷积层)第d个卷积核各连接偏置偏导数的计算公式如下:The partial derivative of the loss function to each connection bias of the dth convolution kernel of the current layer (convolutional layer) The calculation formula is as follows:

式中:为当前层(卷积层)中第d个特征图的连接偏置向量,分别为当前层(卷积层)中第d个特征图的行数和列数,为当前层(卷积层)中第d个特征图中i行,j列的残差值。In the formula: is the connection bias vector of the dth feature map in the current layer (convolutional layer), and Respectively, the number of rows and columns of the dth feature map in the current layer (convolutional layer), is the residual value of row i and column j in the dth feature map of the current layer (convolutional layer).

当前层(卷积层)的残差是通过反向传播,基于后一连接的层残差计算。如当前层(卷积层)后连接的是池化层,则当前层(卷积层)第d个特征图的残差矩阵按式(17)计算。The residual of the current layer (convolutional layer) is calculated based on the residual of the next connected layer through backpropagation. If the pooling layer is connected after the current layer (convolutional layer), then the residual matrix of the dth feature map of the current layer (convolutional layer) is calculated according to formula (17).

式中:Xl-1为当前层(卷积层)前一连接层的输出矩阵,当前层(卷积层)后一连接层中第d个特征图的残差矩阵。In the formula: X l-1 is the output matrix of the previous connection layer of the current layer (convolutional layer), The residual matrix of the dth feature map in the next connection layer after the current layer (convolutional layer).

如当前层(卷积层)后连接的是卷积层,则当前层(卷积层)的权重矩阵按式(18)计算。If the current layer (convolutional layer) is connected to a convolutional layer, then the weight matrix of the current layer (convolutional layer) is calculated according to formula (18).

式中:为当前层(卷积层)后一连接层中第d′个特征图的残差矩阵,为当前层(卷积层)后一连接层的第d″个卷积核的第d层权重矩阵,当前层(卷积层)第d个特征图的输出矩阵。In the formula: is the residual matrix of the d′th feature map in the next connection layer after the current layer (convolutional layer), is the weight matrix of the dth layer of the d″th convolution kernel of the next connection layer after the current layer (convolutional layer), The output matrix of the dth feature map of the current layer (convolutional layer).

进一步地,所述步骤S5中的滑动扫描技术具体如下:当扫描测试数据集Dtest中某张四通道图时,每次扫描可输出该四通道图内333×333像素大小区域的限制坡度推荐值,完成整张四通道图扫描后,选取输出次数最多的坡度值作为该四通道图所表征铁路案例的限制坡度推荐值。Further, the sliding scanning technique in step S5 is specifically as follows: when scanning a certain four -channel image in the test data set D test , each scan can output the limited slope of the 333×333 pixel area in the four-channel image Recommended value, after scanning the entire four-channel map, select the slope value with the most output times as the recommended limit slope value for the railway case represented by the four-channel map.

本发明的有益效果在于:深度学习模拟大脑的分层结构,可从海量数据中自动地获取具有层次性的多层特征表达,在无需给定数学表达式的情况下,探索出输入数据与输出数据间存在的潜在规律。本发明方案采用深度学习算法进行新建铁路限制坡度决策切实可行。本发明基于深度学习算法中的卷积神经网络,提出一种新建铁路限制坡度优化决策方法,该方法通过学习人工决策经验,识别多维影响因素与限制坡度间的映射规律,实现新建铁路限制坡度的类人决策。本发明方案采用滑动扫描技术,实现了对不同铁路案例限制坡度的决策。本发明方法自动化程度高、实用性强、运行效率高,具有良好的推广应用前景。The beneficial effect of the present invention is that deep learning simulates the hierarchical structure of the brain, and can automatically obtain hierarchical multi-layer feature expressions from massive data, and explore input data and output without giving mathematical expressions. underlying patterns in the data. The scheme of the present invention adopts the deep learning algorithm to make the decision-making of limiting the slope of the newly-built railway, which is practical and feasible. Based on the convolutional neural network in the deep learning algorithm, the present invention proposes an optimal decision-making method for limiting the slope of newly-built railways. The method learns manual decision-making experience, identifies the mapping law between multi-dimensional influencing factors and limiting slopes, and realizes the determination of limiting slopes for newly-built railways. Human decision making. The scheme of the present invention adopts the sliding scanning technology, and realizes the decision-making of restricting the slope of different railway cases. The method of the invention has high degree of automation, strong practicability and high operating efficiency, and has good prospects for popularization and application.

附图说明Description of drawings

图1为本发明的新建铁路限制坡度优化决策方法的流程示意图;Fig. 1 is the schematic flow sheet of new-built railway limit slope optimization decision-making method of the present invention;

图2为本发明实施例的深度卷积神经网络模型;Fig. 2 is the deep convolutional neural network model of the embodiment of the present invention;

图3为本发明实施例的滑动扫描技术的结构示意图。FIG. 3 is a schematic structural diagram of a sliding scanning technology according to an embodiment of the present invention.

具体实施方式Detailed ways

为详细说明本发明的技术内容、所实现目的及效果,以下结合实施方式并配合附图予以说明。In order to describe the technical content, achieved goals and effects of the present invention in detail, the following descriptions will be made in conjunction with the embodiments and accompanying drawings.

本发明实施例一为一种新建铁路限制坡度优化决策方法,如图1所示,该优化决策方法包括以下步骤:Embodiment 1 of the present invention is a kind of newly-built railway limit slope optimization decision-making method, as shown in Figure 1, this optimization decision-making method comprises the following steps:

S1:构建用于新建铁路限制坡度优化决策的深度卷积神经网络模型,所构建的网络模型包括5个卷积层(Conv),3个池化层(Pool),2个全连接层(FC)和1个Softmax输出层:S 1 : Construct a deep convolutional neural network model for the optimization decision-making of new railway limit slopes. The constructed network model includes 5 convolutional layers (Conv), 3 pooling layers (Pool), and 2 fully connected layers ( FC) and 1 Softmax output layer:

(1)第一个卷积层(Conv1)采用的卷积核大小为33×33×3,步幅大小为4,卷积核个数为96,Conv1后连接修正线性单元(ReLU)作为非线性激活函数,使模型具备非线性特征;(1) The convolution kernel size used in the first convolutional layer (Conv1) is 33×33×3, the stride size is 4, and the number of convolution kernels is 96. After Conv1, the corrected linear unit (ReLU) is connected as the non- The linear activation function makes the model have nonlinear characteristics;

(2)Conv1经非线性处理后连接第一个池化层(Pool1),Pool1采用的池化核大小为4×4,步幅大小为2;(2) Conv1 is connected to the first pooling layer (Pool1) after nonlinear processing. The pooling kernel size used by Pool1 is 4×4, and the stride size is 2;

(3)Pool1后连接第二个卷积层(Conv2),Conv2采用的卷积核大小为3×3×96,步幅大小为1,卷积核个数为256,Conv2后连接修正线性单元(ReLU)进行非线性处理;(3) Pool1 is connected to the second convolutional layer (Conv2). The convolution kernel size used by Conv2 is 3×3×96, the stride size is 1, the number of convolution kernels is 256, and the modified linear unit is connected after Conv2. (ReLU) for nonlinear processing;

(4)Conv2经非线性处理后连接第二个池化层(Pool2),Pool2采用的池化核大小为3×3,步幅大小为2;(4) Conv2 is connected to the second pooling layer (Pool2) after nonlinear processing. The pooling kernel size used by Pool2 is 3×3, and the stride size is 2;

(5)Pool2后连接第三个卷积层(Conv3),Conv3采用的卷积核大小为3×3×256,步幅大小为1,卷积核个数为384,Conv3后连接修正线性单元(ReLU)进行非线性处理;(5) The third convolutional layer (Conv3) is connected after Pool2. The convolution kernel size used by Conv3 is 3×3×256, the stride size is 1, the number of convolution kernels is 384, and the corrected linear unit is connected after Conv3. (ReLU) for nonlinear processing;

(6)Conv3经非线性处理后连接第四个卷积层(Conv4),Conv4采用的卷积核大小为3×3×384,步幅大小为1,卷积核个数为384,Conv4后连接修正线性单元(ReLU)进行非线性处理;(6) Conv3 is connected to the fourth convolutional layer (Conv4) after nonlinear processing. The convolution kernel size used by Conv4 is 3×3×384, the stride size is 1, and the number of convolution kernels is 384. After Conv4 Connect the Rectified Linear Unit (ReLU) for nonlinear processing;

(7)Conv4经非线性处理后连接第五个卷积层(Conv5),Conv5采用的卷积核大小为3×3×384,步幅大小为1,卷积核个数为256,Conv5后连接修正线性单元(ReLU)进行非线性处理;(7) Conv4 is connected to the fifth convolutional layer (Conv5) after nonlinear processing. The convolution kernel size used by Conv5 is 3×3×384, the stride size is 1, and the number of convolution kernels is 256. After Conv5 Connect the Rectified Linear Unit (ReLU) for nonlinear processing;

(8)Conv5经非线性处理后连接第三个池化层(Pool3),Pool3采用的池化核大小为3×3,步幅大小为2;(8) Conv5 is connected to the third pooling layer (Pool3) after nonlinear processing. The pooling kernel size used by Pool3 is 3×3, and the stride size is 2;

(9)Pool3后连接第一个全连接层(FC1),为防止产生过拟合现象,在Pool3层到FC1层采用dropout函数,FC1后连接修正线性单元(ReLU)进行非线性处理;(9) Pool3 is connected to the first fully connected layer (FC1). In order to prevent overfitting, the dropout function is used from the Pool3 layer to the FC1 layer, and FC1 is connected to the corrected linear unit (ReLU) for nonlinear processing;

(10)FC1经非线性处理后连接第二个全连接层(FC2),并采用dropout函数防止产生过拟合现象,FC2后连接修正线性单元(ReLU)进行非线性处理;(10) FC1 is connected to the second fully connected layer (FC2) after nonlinear processing, and the dropout function is used to prevent overfitting, and FC2 is connected to the corrected linear unit (ReLU) for nonlinear processing;

(11)FC2经非线性处理后连接Softmax输出层,用于输出新建铁路限制坡度推荐值。(11) After nonlinear processing, FC2 is connected to the Softmax output layer, which is used to output the recommended value of the new railway limit slope.

S2:建立用于训练深度卷积神经网络的训练数据集Dtrain和验证数据集DvalidateS 2 : Establish a training data set D train and a verification data set D validate for training a deep convolutional neural network;

S2-1:收集采用6‰、12‰、24‰为限制坡度的客货共线铁路案例246条,所收集的铁路案例涵盖I级、II级,III级,IV级四种铁路等级,韶山1型、韶山3型、韶山4型三种机车型号,建立铁路案例数据集D1S 2-1 : Collect 246 cases of passenger and freight collinear railways with 6‰, 12‰, and 24‰ as limit slopes. The collected railway cases cover four railway grades: I, II, III, and IV. Three types of locomotives, Shaoshan 1, Shaoshan 3, and Shaoshan 4, establish a railway case data set D 1 ;

S2-2:基于铁路案例数据集D1中各条铁路线路的起、终点位置划分各铁路案例的矩形研究区域,并提取各矩形研究区域内的格网高程数据信息,建立铁路案例高程数据集D2S 2-2 : Divide the rectangular research area of each railway case based on the start and end positions of each railway line in the railway case data set D 1 , and extract the grid elevation data information in each rectangular research area to establish the elevation data of the railway case set D2 ;

S2-3:基于D2中各铁路案例研究区域的格网高程数据信息,绘制各矩形研究区域的高程灰度图Pelevation,建立用于表征各铁路案例研究区域地形高程变化特征的高程灰度图集DelevationS 2-3 : Based on the grid elevation data information of each railway case study area in D 2 , draw the elevation gray map P elevation of each rectangular study area, and establish the elevation gray used to characterize the terrain elevation change characteristics of each railway case study area Degree atlas D elevation ;

S2-4:基于D2中各铁路案例研究区域的格网高程数据信息,绘制各矩形研究区域的坡度灰度图Pslope,建立用于表征各铁路案例研究区域地形坡度特征的坡度灰度图集DslopeS 2-4 : Based on the grid elevation data information of each railway case study area in D 2 , draw the slope grayscale map P slope of each rectangular study area, and establish the slope grayscale used to characterize the terrain slope characteristics of each railway case study area Atlas D slope ;

S2-5:分别以灰度值为0、40、80、120的灰度图表征四种铁路等级,并根据D1中各条铁路案例的实际等级,绘制与各条铁路案例相对应的铁路等级灰度图Pclassification,建立铁路等级灰度图集DclassificationS 2-5 : Represent four kinds of railway grades with grayscale images with grayscale values of 0, 40, 80, and 120, and draw the corresponding railway cases according to the actual grades of each railway case in D 1 P classification of the railway grade grayscale map, and the establishment of the railway grade grayscale atlas D classification ;

S2-6:分别以灰度值为160、200、240的灰度图表征韶山1型、韶山3型和韶山4型三种电力机车型号,并根据D1中各条铁路案例所使用的实际机车型号,绘制与各条铁路案例相对应的机车型号灰度图Plocomotive,建立机车型号灰度图集DlocomotiveS 2-6 : The three types of electric locomotives of Shaoshan 1 , Shaoshan 3 and Shaoshan 4 are represented by grayscale images with grayscale values of 160, 200, and 240 respectively, and according to the For the actual locomotive model, draw the locomotive model grayscale map P locomotive corresponding to each railway case, and establish the locomotive model grayscale atlas D locomotive ;

S2-7:基于建立的高程灰度图集Delevation、坡度灰度图集Dslope、铁路等级灰度图集Dclassification、机车型号灰度图集Dlocomotive,融合D1中各条铁路案例的高程灰度图Pelevation、坡度灰度图Pslope、铁路等级灰度图Pclassification和机车型号灰度图Plocomotive,形成可表征各条铁路案例信息的四通道图Pmerge,并建立数据集DmergeS 2-7 : Based on the established elevation grayscale atlas D elevation , slope grayscale atlas D slope , railway grade grayscale atlas D classification , and locomotive model grayscale atlas D locomotive , integrate each railway case in D 1 The elevation grayscale map P elevation , slope grayscale map P slope , railway grade grayscale map P classification and locomotive model grayscale map P locomotive form a four-channel map P merge that can represent the information of each railway case, and establish a data set D merge ;

S2-8:将数据集Dmerge中表征各条铁路案例信息的四通道图切割成大小为333×333像素的图片,并赋予标签数据,其标签数据为各条铁路案例所实际使用的限制坡度值;S 2-8 : Cut the four-channel image representing the information of each railway case in the data set D merge into a picture with a size of 333×333 pixels, and assign label data. The label data is the limit actually used by each railway case slope value;

S2-9:将S2-8中所得带标签图片按4:1的比例划分,建立用于训练深度卷积神经网络的训练数据集Dtrain和验证数据集DvalidateS 2-9 : Divide the labeled pictures obtained in S 2-8 in a ratio of 4:1, and establish a training data set D train and a verification data set D validate for training a deep convolutional neural network;

S3:采用S2建立的训练数据集Dtrain训练所构建的网络模型,并采用S8建立的验证数据集Dvalidate验证模型精度,得到经过训练和验证的深度卷积神经网络模型。本次训练和验证耗时9小时35分钟(i7处理器、16G内存和GTX 1080显卡),得到精度为83.35%的深度卷积神经网络模型。S 3 : Use the training data set D train established by S 2 to train the constructed network model, and use the verification data set D validate established by S 8 to verify the accuracy of the model, and obtain a trained and verified deep convolutional neural network model. The training and verification took 9 hours and 35 minutes (i7 processor, 16G memory and GTX 1080 graphics card), and the deep convolutional neural network model with an accuracy of 83.35% was obtained.

S4:另外收集36条与数据集D1中不同的已建客货共线铁路案例,并按照步骤S2-2至S2-7建立表征铁路案例信息的四通道图Pmerge,建立测试数据集DtestS 4 : Collect another 36 cases of passenger and freight collinear railways that are different from those in data set D 1 , and follow steps S 2-2 to S 2-7 to establish a four-channel graph P merge representing the railway case information, and establish a test Data set Dtest ;

S5:提出一种滑动扫描技术,将训练好的深度卷积神经网络模型按由左向右、由上到下的顺序扫描数据集Dtest中表征各条铁路案例高程信息、坡度信息、铁路等级信息、机车型号信息的四通道图,并根据各个限制坡度值的输出次数,确定Dtest中各条铁路案例的限制坡度推荐值。在本次测试的36条铁路案例中,34条铁路案例的限制坡度得到了准确的决策(即模型推荐的限制坡度值与人工决策的限制坡度值相同),准确率可达94.44%。S 5 : Propose a sliding scanning technology, scan the trained deep convolutional neural network model from left to right and from top to bottom in the data set D test to represent the elevation information, slope information, railway The four-channel diagram of grade information and locomotive model information, and according to the output times of each limit slope value, determine the limit slope recommendation value of each railway case in the D test . Among the 36 railway cases tested in this test, the limit slopes of 34 railway cases have been accurately decided (that is, the limit slope values recommended by the model are the same as the limit slope values determined manually), and the accuracy rate can reach 94.44%.

本发明所称滑动扫描技术是指通过扫描整张图片,根据不同限制坡度的输出次数决策限制坡度值。The sliding scanning technology referred to in the present invention refers to scanning the entire picture, and determining the limit slope value according to the output times of different limit slopes.

综上所述,本发明提供一种新建铁路限制坡度优化决策方法,首先构建深度卷积神经网络模型,然后建立铁路案例数据库,将影响新建铁路限制坡度决策的各项因素表征成灰度图,并融合成多通道图像用于训练网络模型;最后提出一种滑动扫描技术,结合训练完成的深度卷积神经网络模型进行新建铁路限制坡度决策。In summary, the present invention provides a new railway limit slope optimization decision-making method, first constructs a deep convolutional neural network model, then establishes a railway case database, and represents various factors that affect the new railway limit slope decision into a grayscale map, And fused into a multi-channel image for training the network model; finally, a sliding scanning technology is proposed, combined with the trained deep convolutional neural network model to make a new railway limit slope decision.

以上所述仅为本发明的实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等同变换,或直接或间接运用在相关的技术领域,均同理包括在本发明的专利保护范围内。The above description is only an embodiment of the present invention, and does not limit the patent scope of the present invention. All equivalent transformations made by using the description of the present invention and the contents of the accompanying drawings, or directly or indirectly used in related technical fields, are all included in the same principle. Within the scope of patent protection of the present invention.

Claims (9)

1. a kind of newly built railway ruling grade Study on Decision-making Method for Optimization, it is characterised in that: the following steps are included:
S1: building is used for the depth convolutional neural networks model of newly built railway ruling grade Optimal Decision-making;
S2: establish the training dataset D for training depth convolutional neural networkstrainWith validation data set Dvalidate
S2-1: collect N1Item uses the built mixed passenger and freight railway case of different ruling grades, establishes railway case data collection D1
S2-2: it is based on the railway case data collection D1In the start, end position of each rail track divide the square of each railway case Shape survey region, and the grid altitude data information in each rectangle survey region is extracted, establish railway case altitude data collection D2
S2-3: it is based on D2In each railway case study region grid altitude data information, draw the elevation of each rectangle survey region Grayscale image Pelevation, establish the elevation gray scale atlas for characterizing each railway case study region landform altitude variation characteristic Delevation
S2-4: it is based on D2In each railway case study region grid altitude data information, draw the gradient of each rectangle survey region Grayscale image Pslope, establish the gradient gray scale atlas D for characterizing each railway case study region terrain slope featureslope
S2-5: different classifications of rail are characterized as the different grayscale image of gray value, according to D1In each railway case actual grade, Draw classification of rail grayscale image P corresponding with each railway caseclassification, establish classification of rail gray scale atlas Dclassification
S2-6: different type of locomotive are characterized as the different grayscale image of gray value, according to D1In it is real used in each railway case Border type of locomotive draws type of locomotive grayscale image P corresponding with each railway caselocomotive, establish type of locomotive gray scale Atlas Dlocomotive
S2-7: the elevation gray scale atlas D based on foundationelevation, gradient gray scale atlas Dslope, classification of rail gray scale atlas Dclassification, type of locomotive gray scale atlas Dlocomotive, merge D1In each railway case elevation grayscale image Pelevation、 Gradient grayscale image Pslope, classification of rail grayscale image PclassificationWith type of locomotive grayscale image Plocomotive, formation can characterize respectively The four-way figure P of railway case informationmerge, and establish data set Dmerge
S2-8: by data set DmergeIt is 333 × 333 pixels that the four-way figure of each railway case information of middle characterization, which is cut into size, Picture, and assign label data, the ruling grade value that label data is actually used by each railway case;
S2-9: by S2-8Middle gained tape label datagram presses the ratio cut partition of 4:1, establishes for training depth convolutional neural networks Training dataset DtrainWith validation data set Dvalidate
S3: use S2The training dataset D of foundationtrainThe constructed network model of training, and use S2The validation data set of foundation DvalidateModel accuracy is verified, is obtained by training and the depth convolutional neural networks model verified;
S4: in addition collect N2Item and data set D1Middle different built mixed passenger and freight railway case, and according to step S2-2To S2-7It is raw At the four-way figure P of characterization railway case informationmerge, establish test data set Dtest
S5: a kind of slip sweep is proposed, by trained depth convolutional neural networks model by from left to right, from top to bottom Sequential scan data set DtestMiddle characterization each railway case elevation information, grade information, classification of rail information, type of locomotive The four-way figure of information determines D according to the output times of each ruling grade valuetestIn the ruling grade of each railway case push away Recommend value.
2. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S1In, The depth convolutional neural networks model includes 5 convolutional layers, 3 pond layers, 2 full articulamentums and 1 Softmax output Layer:
1) the convolution kernel size that first convolutional layer uses is 33 × 33 × 3, step size 4, and convolution kernel number is 96, first Connection amendment linear unit makes model have nonlinear characteristic as nonlinear activation function after a convolutional layer;
2) first convolutional layer connects first pond layer, the pond core size that first pond layer uses after Nonlinear Processing It is 4 × 4, step size 2;
3) second convolutional layer is connected after first pond layer, the convolution kernel size that second convolutional layer uses for 3 × 3 × 96, Step size is 1, and convolution kernel number is 256, and connection amendment linear unit carries out Nonlinear Processing after second convolutional layer;
4) second convolutional layer connects second pond layer, the pond core size that second pond layer uses after Nonlinear Processing It is 3 × 3, step size 2;
5) third convolutional layer is connected after second pond layer, the convolution kernel size that third convolutional layer uses for 3 × 3 × 256, Step size is 1, and convolution kernel number is 384, and connection amendment linear unit carries out Nonlinear Processing after third convolutional layer;
6) third convolutional layer connects the 4th convolutional layer, the convolution kernel size that the 4th convolutional layer uses after Nonlinear Processing It is 3 × 3 × 384, step size 1, convolution kernel number is 384, and connection amendment linear unit carries out non-after the 4th convolutional layer Linear process;
7) the 4th convolutional layer connects the 5th convolutional layer, the convolution kernel size that the 5th convolutional layer uses after Nonlinear Processing It is 3 × 3 × 384, step size 1, convolution kernel number is 256, and connection amendment linear unit carries out non-after the 5th convolutional layer Linear process;
8) the 5th convolutional layer connects third pond layer, the pond core size that third pond layer uses after Nonlinear Processing It is 3 × 3, step size 2;
9) first full articulamentum is connected after the layer of third pond, to prevent over-fitting, layer is arrived in third pond First full articulamentum connection uses dropout function, and connection amendment linear unit carries out non-linear after first full articulamentum Processing;
10) first full articulamentum connects second full articulamentum after Nonlinear Processing, and prevents from producing using dropout function Over-fitting is given birth to, connection amendment linear unit carries out Nonlinear Processing after second full articulamentum;
11) second full articulamentum connects Softmax output layer after Nonlinear Processing, for exporting newly built railway ruling grade Recommendation.
3. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S2-1 In, the railway case of collection covers different brackets railway and different type of locomotive.
4. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S2-2 In, the method for dividing rectangle survey region based on rail track start, end position is as follows:
If certain railway case route start, end are respectively Si: (xSi,ySi) and Ei: (xEi,yEi), then the research of the railway case Region is with SiAnd EiFor angle steel joint, with | xEi-xSi| to grow, | yEi-ySi| it is wide rectangular area.
5. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S2-5 In, classification of rail grayscale image PclassificationSize it is identical as the rectangle survey region size of the railway case.
6. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S2-6 In, type of locomotive grayscale image PlocomotiveSize with using the type of locomotive railway case rectangle survey region size phase Together.
7. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S2-7 In each railway case four-way figure PmergeTo use the merge function in the OpenCV of computer vision library to each railway case The elevation grayscale image P of exampleelevation, gradient grayscale image Pslope, classification of rail grayscale image PclassificationWith locomotive type gray scale Scheme PlocomotiveIt is obtained after being merged.
8. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S3In The constructed network model of training is based on S2The label data collection D of foundationtrain, network is constantly updated by gradient descent algorithm The connection weight of each interlayer in model, specific as follows:
1) Softmax layers of connection weight update:
The Softmax layers of ruling grade value recommended for output model, this layer are calculated according to the output valve of each neuron of preceding layer The output probability of each ruling grade value, thus the ruling grade value for selecting the maximum value of slope of output probability to recommend as model, Shown in its function representation such as formula (2):
In formula: P (y(i)=j | x(i);It W) is to select j-th of value as limit in Softmax layer choosing using the i-th picture as input data The probability of the gradient processed, x(i)For Softmax layers of input data, W is the connection weight of Softmax layers with preceding layer;
Model loss function E is established based on Softmax function, shown in function expression such as formula (3):
In formula: 1 { y(i)=j } it is logical expression, if i-th input picture mark is j-th of ruling grade value, 1 { y(i)=j }=1, otherwise 1 { y(i)=j }=0, λ be weight attenuation coefficient;
Based on loss function E, the residual error of Softmax layers of each neuron is calculated by formula (4):
The connection weight of Softmax layers of each neuron is updated by formula (5), formula (6):
2) full articulamentum connection weight updates:
Each neuron of full articulamentum is connected with upper one layer of all neurons, and connection weight more new formula is as follows:
In formula: WlFor the connection weight matrix of each neuron of current layer, blFor the connection bias vector of each neuron of current layer, α is Learning rate;
Partial derivative of the loss function to each neuron connection weight of full articulamentumIt is connected partially with to each neuron of full articulamentum The partial derivative setIt is calculated respectively by formula (9) and formula (10);
In formula: xl-1For the output vector of an articulamentum on current layer, δlIt, can be according to connecting thereafter for the residual error of each neuron of current layer Meet the residual error δ of each neuron of layerl+1It calculates;
δl=(Wl+1l+1⊙f′(Wlxl-1+bl) (11)
In formula: Wl+1For the connection weight matrix of each neuron of articulamentum after current layer, f () is ReLU activation primitive;
3) convolutional layer connection weight updates:
Each neuron of convolutional layer is connected by convolution kernel with preceding layer, and each convolution kernel connection weight more new formula is as follows:
In formula:For the connection weight matrix of d-th of convolution kernel of current layer,It is biased for the connection of d-th of convolution kernel of current layer Vector, α are learning rate;
Loss function is to each connection weight partial derivative of d-th of convolution kernel of current layerCalculation formula it is as follows:
In formula:For the output valve of the previous a characteristic pattern of articulamentum d ' of current layer, Dl-1For the spy of the previous articulamentum of current layer Map number is levied,For the residual matrix of d-th of characteristic pattern of current layer;
Loss function respectively connects biasing partial derivative to d-th of convolution kernel of current layerCalculation formula it is as follows:
In formula:For the connection bias vector of d-th of characteristic pattern in current layer,WithD-th of feature respectively in current layer The line number and columns of figure,For i row in d-th of characteristic pattern in current layer, the residual values of j column;
The residual error of current layer is the layer residual computations by backpropagation, based on latter connection;If latter linked current layer is pond Change layer, then the residual matrix of d-th of characteristic pattern of current layer is calculated by formula (17);
In formula: Xl-1For the output matrix of the previous articulamentum of current layer,D-th characteristic pattern is residual in the latter articulamentum of current layer Poor matrix;
If latter linked current layer is convolutional layer, the weight matrix of current layer is calculated by formula (18):
In formula:For the residual matrix of a characteristic pattern of d ' in the latter articulamentum of current layer,For the latter company of current layer D layers of weight matrix of a convolution kernel of d " of layer are connect,The output matrix of d-th of characteristic pattern of current layer.
9. newly built railway ruling grade Study on Decision-making Method for Optimization according to claim 1, it is characterised in that: the step S5In, As scan test data collection DtestIn certain four-way figure when, it is big to scan 333 × 333 pixels in the exportable four-way figure every time The ruling grade recommendation of zonule selects the value of slope conduct for then taking output times most after completing whole four-way figure scanning The ruling grade recommendation of the characterized railway case of the four-way figure.
CN201810658482.8A 2018-06-25 2018-06-25 Newly-built railway slope-limiting optimization decision method Active CN109033521B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810658482.8A CN109033521B (en) 2018-06-25 2018-06-25 Newly-built railway slope-limiting optimization decision method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810658482.8A CN109033521B (en) 2018-06-25 2018-06-25 Newly-built railway slope-limiting optimization decision method

Publications (2)

Publication Number Publication Date
CN109033521A true CN109033521A (en) 2018-12-18
CN109033521B CN109033521B (en) 2021-04-20

Family

ID=64610700

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810658482.8A Active CN109033521B (en) 2018-06-25 2018-06-25 Newly-built railway slope-limiting optimization decision method

Country Status (1)

Country Link
CN (1) CN109033521B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118070217A (en) * 2024-01-29 2024-05-24 中南大学 Railway main technical standard recommendation method, computer storage medium and equipment
CN118643578A (en) * 2024-08-14 2024-09-13 兰州交通大学 Railway line design method in plateau areas based on deep convolutional generative adversarial network

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105243233A (en) * 2015-11-04 2016-01-13 中南大学 Line-station collaborative optimization method for railway in complicated mountain area
CN106777752A (en) * 2016-12-30 2017-05-31 华东交通大学 A kind of bullet train follows the trail of operation curve Optimal Setting method
CN106844949A (en) * 2017-01-18 2017-06-13 清华大学 A kind of training method for realizing the controllable two-way LSTM models of locomotive section
CN106910185A (en) * 2017-01-13 2017-06-30 陕西师范大学 A kind of DBCC disaggregated models and construction method based on CNN deep learnings
CN107133960A (en) * 2017-04-21 2017-09-05 武汉大学 Image crack dividing method based on depth convolutional neural networks
CN107689035A (en) * 2017-08-30 2018-02-13 广州华多网络科技有限公司 A kind of homography matrix based on convolutional neural networks determines method and device
US20180068216A1 (en) * 2015-03-13 2018-03-08 Institute Of Acoustics, Chinese Academy Of Sciences Big data processing method based on deep learning model satisfying k-degree sparse constraint

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180068216A1 (en) * 2015-03-13 2018-03-08 Institute Of Acoustics, Chinese Academy Of Sciences Big data processing method based on deep learning model satisfying k-degree sparse constraint
CN105243233A (en) * 2015-11-04 2016-01-13 中南大学 Line-station collaborative optimization method for railway in complicated mountain area
CN106777752A (en) * 2016-12-30 2017-05-31 华东交通大学 A kind of bullet train follows the trail of operation curve Optimal Setting method
CN106910185A (en) * 2017-01-13 2017-06-30 陕西师范大学 A kind of DBCC disaggregated models and construction method based on CNN deep learnings
CN106844949A (en) * 2017-01-18 2017-06-13 清华大学 A kind of training method for realizing the controllable two-way LSTM models of locomotive section
CN107133960A (en) * 2017-04-21 2017-09-05 武汉大学 Image crack dividing method based on depth convolutional neural networks
CN107689035A (en) * 2017-08-30 2018-02-13 广州华多网络科技有限公司 A kind of homography matrix based on convolutional neural networks determines method and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118070217A (en) * 2024-01-29 2024-05-24 中南大学 Railway main technical standard recommendation method, computer storage medium and equipment
CN118643578A (en) * 2024-08-14 2024-09-13 兰州交通大学 Railway line design method in plateau areas based on deep convolutional generative adversarial network
CN118643578B (en) * 2024-08-14 2024-10-18 兰州交通大学 Railway line design method in plateau areas based on deep convolutional generative adversarial network

Also Published As

Publication number Publication date
CN109033521B (en) 2021-04-20

Similar Documents

Publication Publication Date Title
Yu et al. Prediction of highway tunnel pavement performance based on digital twin and multiple time series stacking
CN107230113A (en) A kind of house property appraisal procedure of multi-model fusion
Qi et al. Knowledge discovery from soil maps using inductive learning
CN109146831A (en) Remote sensing image fusion method and system based on double branch deep learning networks
CN106991374A (en) Handwritten Digit Recognition method based on convolutional neural networks and random forest
CN109636150A (en) A kind of method for building up and its system of smart city " more rule unifications " appraisement system
CN109558902A (en) A kind of fast target detection method
CN109493147A (en) House property automatic evaluation method and system based on multi-layer Model Fusion
CN116757347B (en) A railway line selection method and system based on deep learning
CN107203752A (en) A kind of combined depth study and the face identification method of the norm constraint of feature two
CN118333571A (en) An intelligent management system for landscaping engineering projects
CN108665093A (en) Traffic accidents Severity forecasting method based on deep learning
CN118297775A (en) Urban planning management and control system based on digital twin technology
CN108876487A (en) A kind of industrial plot estimation method based on big data and intelligent decision mechanism
CN109033521A (en) A kind of newly built railway ruling grade Study on Decision-making Method for Optimization
CN117314266A (en) Novel intelligent scientific and technological talent evaluation method based on hypergraph attention mechanism
Owens et al. Taking digital soil mapping to the field: Lessons learned from the Water Smart Agriculture soil mapping project in Central America
CN116485029A (en) Oilfield development dynamic data accuracy detection method, device, equipment and medium
CN116468994A (en) Village and town shrinkage simulation method, system and device based on street view data
CN111311302A (en) A real estate intelligent evaluation method based on multi-source data-driven
CN108876144B (en) A substation pre-location method based on deep learning algorithm
CN110443303B (en) Intelligent recognition method of coal rock microcomponents based on image segmentation and classification
CN117876514A (en) CGAN-based virtual land block and building texture generation method
Duwal Modelling urban growth in Kathmandu Valley
Niu et al. [Retracted] Analysis of the 3D Application Evaluation System of Landscape Based on Hybrid Cooperation of VR and AI

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant