CN116363603A

CN116363603A - Deep learning-based car ladder handrail detection method, storage medium and equipment

Info

Publication number: CN116363603A
Application number: CN202310271676.3A
Authority: CN
Inventors: 张光伟
Original assignee: Harbin Kejia General Mechanical and Electrical Co Ltd
Current assignee: Harbin Kejia General Mechanical and Electrical Co Ltd
Priority date: 2023-03-20
Filing date: 2023-03-20
Publication date: 2023-06-30

Abstract

A car ladder handrail detection method, storage medium and equipment based on deep learning belong to the technical field of railway wagon detection. The method aims at solving the problem that the detection effect is not ideal due to poor image quality of the escalator handrail when the escalator handrail is detected by the existing method. Firstly, building a training set of the ladder handrail images of the vehicle, and carrying out degradation and quality improvement to build image pairs; and then adding an adaptively adjusted convolutional network before the characteristic extraction network of the Faster-RCNN to obtain an improved Faster-RCNN network, and training the network by using the image constructed by the degradation and the quality improvement based on the loss function of the two parts of sub-networks to obtain the parameter weight. And finally detecting the handrail part of the car ladder according to the trained network.

Description

Deep learning-based car ladder handrail detection method, storage medium and equipment

Technical Field

The invention belongs to the technical field of railway wagon detection, and particularly relates to a car ladder handrail detection method, a storage medium and equipment.

Background

In order to ensure safe operation of the railway wagon, periodic inspection of the railway wagon parts is required. The early-stage train fault detection mode mainly comprises manual image checking or field checking, and is low in efficiency, so that conditions such as missing of faults and false detection are easily caused, and the running safety of the train is influenced. Therefore, the fault automatic identification technology is significant in motor car detection, and the method uses a deep learning technology to take the high-definition image obtained by shooting as data, so that automatic fault alarm is realized, and the motor car operation efficiency is improved.

With the development of deep learning technology, the existing deep learning technology can realize accurate detection of most railway wagon components. In general, the closer the data quality is to the data quality in the prediction and training, the higher the accuracy of the recognition. Therefore, in general, training data with different qualities is added to train the recognition network model, that is, the recognition network can recognize the data with different qualities by including as much image data with different qualities as possible on the training data layer, so as to ensure the accuracy of detection. However, in some image data with poor quality, the effect of improving the accuracy in this way is very limited, and the recognition network still cannot well judge whether the recognition area has a fault or not. Especially, the detection of the car ladder handrail has very many conditions of poor image quality, so that the situation of poor recognition effect is often caused by poor image quality when the detection is performed by utilizing the existing deep learning technology.

Disclosure of Invention

The invention aims to solve the problem that the detection effect is not ideal due to the poor image quality of the escalator handrail when the escalator handrail is detected by using the existing mode.

The car ladder handrail detection method based on deep learning comprises the following steps:

positioning a car ladder handrail part area on the side of the car body through the acquired side part image; loading an improved Faster-RCNN model and weight, carrying out convolutional network image quality improvement processing of self-adaptive adjustment on an image to be identified, and then predicting, wherein the prediction results comprise handrails, handrail breakage, rivets and rivet loss; according to the detected handrails and the detected handrail breaking classes in the images to be identified, carrying out logic judgment by combining the detected rivets and the detected rivet breaking classes, so as to realize fault prediction after improving the image quality;

the improved Faster-RCNN is obtained by the following steps:

s1, acquiring an image of a handrail of a car ladder on the side part of a car body, and constructing an image data set;

carrying out data pair construction on each image in the image data set, namely carrying out quality reduction treatment on the images, and then carrying out quality improvement treatment on the images before quality reduction to obtain an image pair of a low-quality image and a high-quality image;

s2, building an improved Faster-RCNN identification network: adding an adaptive adjustment convolution network before a Faster-RCNN feature extraction network, and inputting data to be identified into the Faster-RCNN feature extraction network for processing after obtaining a result through the adaptive adjustment convolution network;

the self-adaptive adjustment convolution network structure is as follows:

firstly, a convolution module of 3*3 is connected with a Swin-Conv module after the convolution module of 3*3, and then is connected with an improved cascade structure network, wherein the improved cascade structure network adopts cascade structure units of SwinT+BN+ReLU; setting a Swin-Conv network after the cascade structure network, and finally connecting a convolution module of 3*3;

s3, training the improved Faster-RCNN by using the low-quality image and the high-quality image:

sending the reduced quality image into a self-adaptive convolutional network, and performing Loss with a high quality image correspondingly generated when a data set is established, and calculating to obtain Loss _deblur ；

The self-adaptive convolutional network generation result is sent to the recognition network Faster-RCNN to generate the Loss of the corresponding detection part _dec The method comprises the steps of carrying out a first treatment on the surface of the And then the Loss of the whole network is obtained:

Loss _actually ＝αLoss _deblur +(1-α)Loss _dec

wherein, loss _actually For the actual total Loss, α is the Loss weight;

thereby obtaining the trained improved Faster-RCNN.

Further, in the process of constructing the image dataset in step S1, it is necessary to perform fault marking on the car ladder handrail image.

Further, when the image is subjected to the quality degradation process, the image quality degradation formula is as follows:

where y is the reduced quality image, x is the original image, k is the blur kernel,

representing convolution ∈ _s For downsampling, s is the downsampling multiple and n is noise.

Further, when the image is subjected to quality reduction processing, different parameters are manually set based on an image quality reduction formula to perform quality reduction.

Further, in the process of manually setting different parameters to degrade based on an image degradation formula, the degradation process is performed by setting different noise types, and the noise n comprises Gaussian noise and photon noise.

Further, in the self-adaptive convolutional network, the cascade structure network comprises 17 cascade structure units, and the Swin-Conv network arranged behind the cascade structure network comprises 4 Swin-Conv modules.

Further, the structure of the Swin-Conv module is as follows:

first, X is obtained by convolution of 1*1 and processing of split layer ₁ ，X ₂ Two features:

X ₁ ，X ₂ ＝Split(Conv1*1(X))

wherein X represents the input of the 1*1 convolution;

X ₁ treatment by SwinT, X ₂ Processing by RConv to obtain two processed results Y ₁ ,Y ₂ ；

Y ₁ ,Y ₂ ＝SwinT(X ₁ ),RConv(X ₂ )

Then for the obtained Y ₁ ,Y ₂ After feature fusion and convolution of 1*1, add to the original input, namely:

Z＝Conv1*1(Concat(Y ₁ ,Y ₂ ))+X。

further, the process of logically determining, based on the detected handrail and handrail break class in the image to be identified, in combination with the detected rivet and rivet break class, includes the steps of:

when the handrails are detected, if the detected result also contains rivets or the rivets are broken, calculating the length-width ratio of a detection frame where the handrails are positioned, and judging that faults exist when the length-width ratio is larger than a threshold value 1;

when the handrail breakage is detected but the rivet breakage or the rivet loss is not contained, calculating the length-width ratio of a detection frame where the handrail breakage is located, and judging that a fault exists when the length-width ratio is larger than a threshold value 2.

A computer storage medium having stored therein at least one instruction loaded and executed by a processor to implement a deep learning based car ladder handrail detection method.

A deep learning based car ladder handrail detection apparatus, the apparatus comprising a processor and a memory having stored therein at least one instruction loaded and executed by the processor to implement a deep learning based car ladder handrail detection method.

The beneficial effects are that:

1. the automatic detection replaces manual detection, the detection efficiency and accuracy are improved, the influence of the physiology and the psychology of personnel is avoided, and the operation quality is greatly improved.

2. The present invention provides a method for improving a network so that network detection is more accurate. The invention adopts the mode of combining training by utilizing the low-quality image and the high-quality image, can still ensure the detection effect under the condition of more poor image quality of the car ladder handrail, i.e. improves the overall detection accuracy. Meanwhile, the invention modifies the Faster-RCNN identification network to a certain extent, so that the improved network can better express the image characteristics of one of the image pairs on the basis of better detection capability, thereby better promoting the matching treatment of the image pairs in the follow-up process and further ensuring the detection effect of the invention.

Drawings

Fig. 1 is a flow chart of a car ladder handrail detection.

Fig. 2 is a block diagram of a constructed neural network.

Fig. 3 is a low quality image obtained by degradation.

Fig. 4 is a high quality image obtained by upgrading.

Detailed Description

The first embodiment is as follows:

the embodiment is a vehicle ladder handrail detection method based on deep learning, which classifies collected vehicle ladder handrails by adopting a convolutional neural network, wherein a detection flow chart is shown in fig. 1, and specifically comprises the following steps:

1. building training data sets

The line images are acquired through the motor car shooting equipment, the car ladder handrail images on the side of the car body are collected, the diversity and the complexity of the images are guaranteed, the brightness of various images are changed, the images on rainy and snowy days and sunny days are distinguished, the quality of the images is good, and the like, and the images are contained in the data set.

The fault marking is carried out on the car ladder handrail image, and the car ladder handrail image comprises four types of handrails, namely handrails broken, rivets and lost rivets. When the images are marked, because the failed images are few, manual manufacturing failure can be selected, namely PS can be carried out on the images according to the actual loss condition, so that the effect of sample equalization is achieved, and a low-quality and high-quality data pair needs to be constructed no matter the failed images of the PS or the original normal images.

The image is subjected to data pair construction, and these images are first subjected to noise addition and image processing with a blur kernel, so that the image quality is reduced as low-quality image data, as shown in fig. 3. Then, noise reduction and deblurring are carried out on the images before the quality is reduced, so that another group of relatively clear high-quality image data is obtained, as shown in fig. 4; an image pair of a low-quality image and a high-quality image is obtained by degrading the image and enhancing the image.

The formula for image degradation is as follows:

where y is the degraded image, x is the original image, k is the blur kernel,

representing convolution ∈ _s For downsampling, s is a downsampling multiple, and n is noise;

the image degradation can be performed by manually setting parameters by using the formula, wherein n mainly uses Gaussian noise and photon noise, and besides, some system tools can be used for degradation or promotion to construct a data pair.

Different parameters are used as much as possible when constructing a low-quality image and a high-quality image pair, so that the diversity of a degradation mode and a quality improvement mode can be ensured, and the network effect is better, wherein the low-quality image obtained by the degradation mode is simulated data needing to be improved, and the improved image is a desired image suitable for identification.

2. Building an identification network and obtaining a weight file

The built identification network is mainly used for detecting faults, and the noise level, the brightness and darkness degree and the blurring condition of the image can seriously influence the identification condition of the faults. In some image data with poor quality, the mode of containing image data with different quality as much as possible on the aspect of training data and enabling an identification network to recognize the data with different quality has limited effect of improving accuracy, and the problem that whether an identification area has faults or not still cannot be well judged by the identification network is solved. On the basis of using a low-quality image and high-quality image pair, the invention also modifies the Faster-RCNN identification network to a certain extent, so that the improved network can better express the image characteristics of one of the image pairs on the basis of better detection capability, thereby better promoting the matching processing of the image pair in the follow-up process; the self-adaptive adjustment convolution network is added before the characteristic extraction network of the fast-RCNN, so that the data to be identified is input into the characteristic extraction network of the fast-RCNN after the result is obtained through the self-adaptive adjustment convolution network.

The added convolutional neural network plays a role in self-adaptive adjustment of the image, so that the image to be identified can reach a high-quality state more suitable for identification after self-adaptive adjustment, and the image is improved in noise level and blur level.

In the self-adaptive convolutional network, a Swin-Conv module is used, and the specific structure of the Swin-Conv module is as follows:

firstly, a convolution module of 3*3 is connected with a Swin-Conv module after the convolution module of 3*3, and then a modified cascade structure network is connected, wherein the modified cascade structure network adopts SwinT (Swin Transformer) cascade structure units, namely, 17 cascade structure units of SwinT+BN+ReLU; the cascade structure network is followed by a Swin-Conv network comprising 4 Swin-Conv modules, and finally a convolution module of 3*3 is connected.

The specific structure of the Swin-Conv module is as follows:

X ₁ ，X ₂ ＝Split(Conv1*1(X))

wherein X represents the input of the 1*1 convolution;

the two features are then treated separately as follows:

operation X ₁ Treatment by SwinT, X ₂ Processing by RConv to obtain two processed results Y ₁ ,Y ₂ ；

Y ₁ ,Y ₂ ＝SwinT(X ₁ ),RConv(X ₂ )

Then for the obtained Y ₁ ,Y ₂ After feature fusion and convolution of 1*1, add to the original input:

Z＝Conv1*1(Concat(Y ₁ ,Y ₂ ))+X

training the improved Faster-RCNN with images of low quality images and high quality images: sending the degraded image into a Faster-RCNN (convolutional network) added with self-adaptive adjustment, obtaining a result, and carrying out Loss with a high-quality image correspondingly generated when a data set is established, wherein the Loss of the part is selected to be a conventional L1Loss, and the Loss is obtained by calculation _deblur The result of the generation after passing through the part of the network is sent to the recognition network Faster-RCNN, and the Loss of the corresponding detection part is generated according to the mmdetection framework _dec The two Loss add up to the Loss of the entire network:

Loss _actually ＝αLoss _deblur +(1-α)Loss _dec

wherein, loss _actually For the actual total Loss, it is sent to the optimizer for optimization, and α is the weight occupied by Loss, where a fixed value of 0.3 is used.

Compared with the conventional Faster-RCNN detection algorithm, the network can adaptively adjust the image to a certain extent before detection, so that the image is more suitable for detection, the accuracy of the detection algorithm is further improved, and the program can more accurately identify faults by improving the accuracy of the identification algorithm.

3. Prediction

Positioning a car ladder handrail part area on the side part of the car body through the acquired side part image;

loading improved Faster-RCNN model weight, carrying out Faster-RCNN processing added with a self-adaptive convolution network on an image to be identified (an image of a car ladder handrail part area positioned to the side part of a car body and a car ladder handrail part area intercepted), carrying out logic judgment according to detected handrails and handrail breaking classes in the image to be identified and combined with detected rivets and rivet breaking classes, so as to realize fault prediction after improving the image quality, and uploading alarm information to a platform if the image is considered to need alarm through logic judgment.

In some embodiments, when the handrail class is detected, if the detected image also contains rivets or rivet broken classes, calculating the length-width ratio of a detection frame where the handrail is positioned, alarming when the length-width ratio is larger than a threshold value 1, and not alarming when the length-width ratio is smaller than the threshold value 1. When the handrail breakage is detected but the two types of rivet breakage or rivet loss are not contained, calculating the length-width ratio of a detection frame where the handrail breakage is located, if the length-width ratio is lower than a threshold value 2, not alarming, and if the length-width ratio is higher than the threshold value 2, alarming.

Through research and analysis, the two types of rivet breaking and rivet losing are found to occur along with breaking, and in order to avoid misjudgment aiming at the condition of detecting the handrail, the rivet breaking or rivet losing is used as the misjudgment of the handrail to be further judged by the misjudgment of the handrail through further research, and the occurrence of misjudgment can be greatly reduced; as for the invention, through research and analysis, the fact that no rivet is lost or the rivet is broken is used as a further judgment for misjudging the handrail as the handrail is broken.

The second embodiment is as follows:

the embodiment is a computer storage medium, in which at least one instruction is stored, where the at least one instruction is loaded and executed by a processor to implement the vehicle ladder handrail detection method based on deep learning.

It should be understood that the instructions comprise a computer program product, software, or computerized method corresponding to any of the methods described herein; the instructions may be used to program a computer system, or other electronic device. Computer storage media may include readable media having instructions stored thereon and may include, but is not limited to, magnetic storage media, optical storage media; magneto-optical storage media include read-only memory ROM, random-access memory RAM, erasable programmable memory (e.g., EPROM and EEPROM), and flash memory layers, or other types of media suitable for storing electronic instructions.

And a third specific embodiment:

the embodiment is a car ladder handrail detection device based on deep learning, the device comprises a processor and a memory, and it should be understood that the device comprising any device comprising the processor and the memory described in the invention can also comprise other units and modules for displaying, interacting, processing, controlling and other functions through signals or instructions;

the memory stores at least one instruction, and the at least one instruction is loaded and executed by the processor to realize the car ladder handrail detection method based on deep learning.

The above examples of the present invention are only for describing the calculation model and calculation flow of the present invention in detail, and are not limiting of the embodiments of the present invention. Other variations and modifications of the above description will be apparent to those of ordinary skill in the art, and it is not intended to be exhaustive of all embodiments, all of which are within the scope of the invention.

Claims

1. The car ladder handrail detection method based on deep learning is characterized by comprising the following steps of:

the improved Faster-RCNN is obtained by the following steps:

the self-adaptive adjustment convolution network structure is as follows:

Lossa _ctually ＝αLoss _deblur +(1-α)Loss _dec

wherein, loss _actually For the actual total Loss, α is the Loss weight;

thereby obtaining the trained improved Faster-RCNN.

2. The deep learning-based car ladder handrail detection method according to claim 1, wherein in the process of constructing the image dataset in step S1, fault marking is required for car ladder handrail images.

3. The method for detecting the handrail of the car ladder based on the deep learning according to claim 2, wherein when the image is subjected to the quality reduction processing, an image quality reduction formula is as follows:

4. The method for detecting a car ladder handrail based on deep learning according to claim 3, wherein when the image is subjected to the quality degradation process, different parameters are manually set based on an image quality degradation formula to degrade.

5. The method for detecting the car ladder handrail based on deep learning according to claim 4, wherein in the process of manually setting different parameters to degrade based on an image degradation quality formula, the degradation processing is performed by setting different noise types, and the noise n comprises Gaussian noise and photon noise.

6. The method for detecting the car ladder handrail based on deep learning according to claim 5, wherein in the self-adaptive adjustment convolutional network, the cascade structure network comprises 17 cascade structure units, and the Swin-Conv network arranged behind the cascade structure network comprises 4 Swin-Conv modules.

7. The deep learning-based car ladder handrail detection method as claimed in claim 6, wherein the Swin-Conv module has the following structure:

X ₁ ，X ₂ ＝Split(Conv1*1(X))

wherein X represents the input of the 1*1 convolution;

Y ₁ ,Y ₂ ＝SwinT(X ₁ ),RConv(X ₂ )

Z＝Conv1*1(Concat(Y ₁ ,Y ₂ ))+X。

8. the deep learning based car ladder handrail detection method according to any one of claims 1 to 7, wherein the process of performing logic judgment in combination with the detected rivets and rivet break classes according to the detected handrail and handrail break classes in the image to be identified comprises the steps of:

9. A computer storage medium having stored therein at least one instruction that is loaded and executed by a processor to implement the deep learning based car ladder handrail detection method of any one of claims 1 to 8.

10. A deep learning based car ladder handrail detection apparatus, the apparatus comprising a processor and a memory having stored therein at least one instruction loaded and executed by the processor to implement the deep learning based car ladder handrail detection method of any one of claims 1 to 8.