CN112766260A - Image identification method and system for positioning air reservoir for accelerating and relieving railway train - Google Patents

Image identification method and system for positioning air reservoir for accelerating and relieving railway train Download PDF

Info

Publication number
CN112766260A
CN112766260A CN202110056516.8A CN202110056516A CN112766260A CN 112766260 A CN112766260 A CN 112766260A CN 202110056516 A CN202110056516 A CN 202110056516A CN 112766260 A CN112766260 A CN 112766260A
Authority
CN
China
Prior art keywords
accelerated release
release reservoir
positioning
frame
reservoir
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110056516.8A
Other languages
Chinese (zh)
Other versions
CN112766260B (en
Inventor
于婷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Kejia General Mechanical and Electrical Co Ltd
Original Assignee
Harbin Kejia General Mechanical and Electrical Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Kejia General Mechanical and Electrical Co Ltd filed Critical Harbin Kejia General Mechanical and Electrical Co Ltd
Priority to CN202110056516.8A priority Critical patent/CN112766260B/en
Publication of CN112766260A publication Critical patent/CN112766260A/en
Application granted granted Critical
Publication of CN112766260B publication Critical patent/CN112766260B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/255Detecting or recognising potential candidate objects based on visual cues, e.g. shapes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)
  • Train Traffic Observation, Control, And Security (AREA)

Abstract

The invention discloses an image recognition method and system for positioning an accelerated release reservoir of a railway train, which solve the technical problem of low efficiency and poor stability of the conventional manual detection and belong to the technical field of image recognition, wherein the method comprises the following steps: constructing a sample data set for accelerating the relieving of the reservoir; training an improved SSD network by using a sample data set to obtain an accelerated release reservoir positioning model, wherein the improved SSD model has the structure as follows: resnet50 is used as a backbone network, a BN layer, a pooling layer and a classification output layer at the top layer are deleted, two convolution layers are additionally added, the size of a convolution kernel is 3 x 3, and the step length is 2; and acquiring an actual image, inputting the actual image into the accelerated release reservoir positioning model, and acquiring the position information of the accelerated release reservoir. Compared with manual maintenance, the method can improve the detection efficiency and stability, and can improve the detection accuracy.

Description

Image identification method and system for positioning air reservoir for accelerating and relieving railway train
Technical Field
The invention relates to the technical field of image recognition, in particular to an image recognition method and system for positioning an acceleration relieving air reservoir of a railway train.
Background
At present, the accelerated release reservoir is mainly positioned by manually checking images or checking on the spot, so that the mode has low efficiency, faults are easy to miss, and further, serious railway traffic accidents are easy to cause. Therefore, it is highly desirable to provide an automatic image recognition method for fault detection of the acceleration mitigation air cylinder, but since the acceleration mitigation air cylinders of different vehicle types are located at different positions on the vehicle body, an automatic image recognition method is required to first locate the position of the acceleration mitigation air cylinder, and then further perform subsequent fault detection.
Disclosure of Invention
The present invention is directed to solving, at least to some extent, one of the technical problems in the related art.
Therefore, the invention aims to provide an image recognition method for positioning an acceleration release reservoir of a railway train, which improves the detection efficiency, stability and accuracy.
Another object of the present invention is to provide an image recognition system for positioning an accelerated release reservoir of a railroad train.
In order to achieve the above object, an embodiment of the present invention provides an image recognition method for positioning an accelerated release reservoir of a railway train, including the following steps: step S1, constructing a sample data set of the accelerated release reservoir; step S2, training the improved SSD network by using the sample data set to obtain an accelerated release reservoir positioning model, wherein the improved SSD model has the structure that: resnet50 is used as a backbone network, a BN layer, a pooling layer and a classification output layer at the top layer are deleted, two convolution layers are additionally added, the size of a convolution kernel is 3 x 3, and the step length is 2; and step S3, acquiring an actual image, inputting the actual image into the accelerated release reservoir positioning model, and acquiring the position information of the accelerated release reservoir.
According to the image recognition method for positioning the accelerated release reservoir of the railway train, the position of the accelerated release reservoir is positioned in an automatic image recognition mode, on one hand, the detection efficiency and stability can be improved compared with manual maintenance, on the other hand, the detection accuracy can be improved along with the continuous maturity of deep learning and artificial intelligence technology, and meanwhile, through improving the SSD algorithm matching strategy, each real frame ground channel box can obtain equal number of predicted frames anchor, so that the problem of unbalanced positive and negative samples is solved, the convergence of loss functions in the training process is accelerated, and the accuracy of the algorithm for positioning smaller accelerated release reservoirs is also improved.
In addition, the image recognition method for positioning the accelerated release reservoir of the railway train according to the embodiment of the invention can also have the following additional technical characteristics:
further, in an embodiment of the present invention, step S1 specifically includes: acquiring coarse positioning images of a plurality of accelerated release air cylinders as an original image set; marking the area where the accelerated release reservoir is located in the original image set as a marking information set, wherein the original image set corresponds to the marking information set one by one; and amplifying the original image set and the marked information set by using an amplification algorithm to obtain a sample data set.
Further, in an embodiment of the present invention, step S2 specifically includes: inputting the sample data set into an improved SSD network to obtain a 5-layer characteristic diagram; respectively constructing two convolution kernels with the size of 3 x 3 on the 5 layers of feature graphs as a regression predictor and a multi-classification predictor; predicting the anchor of each position on a preset feature map by using a regression predictor and a multi-classification predictor, generating a default frame for each feature map by using a default frame generation rule to obtain a fixed number of output predictions, and screening the output predictions by using a non-maximum value to inhibit NMS and a top _ k threshold value to obtain an output frame; sampling the output frame by a random sampling or difficult sampling method to calculate the loss function, calculating the gradient value of the loss function, updating the parameters according to the gradient value, and performing iterative training until the maximum iteration times is reached to obtain the accelerated release reservoir positioning model.
Further, in an embodiment of the present invention, the IOUs corresponding to each real frame ground channel box are sorted in a descending order, then predicted frames anchor corresponding to the first M IOUs are taken as samples, each predicted frame anchor has a chance of matching with the real frame ground channel only once, M is a positive integer, and the IOU is an intersection ratio of the predicted frame and the real frame, wherein the input of the matching mechanism of the improved SSD network is: each predicted frame anchor and the IOU set corresponding to the real frame, and a parameter N; the matching mechanism output of the improved SSD network is: each predicted frame anchor matches the index of the resulting real frame.
Further, in an embodiment of the present invention, step S3 specifically includes: collecting an actual image; and zooming the actual image into an image with the size of 1600 × 256, inputting the zoomed image into an accelerated release reservoir positioning model for positioning detection of an accelerated release reservoir, judging a threshold value of an output result, and if the length-width ratio of a position frame is greater than 0.8 and less than 1.2, determining that the positioned reservoir is the accelerated release reservoir, and further outputting the position information of the accelerated release reservoir.
In order to achieve the above object, an embodiment of another aspect of the present invention provides an image recognition system for positioning an acceleration release reservoir of a railway train, including: the construction module is used for constructing a sample data set for accelerating the relieving of the reservoir; the training module is used for training the improved SSD network by utilizing the sample data set to obtain an accelerated release reservoir positioning model, wherein the improved SSD model has the structure that: resnet50 is used as a backbone network, a BN layer, a pooling layer and a classification output layer at the top layer are deleted, two convolution layers are additionally added, the size of a convolution kernel is 3 x 3, and the step length is 2; and the acquisition module is used for acquiring an actual image, inputting the actual image into the accelerated release reservoir positioning model and acquiring the accelerated release reservoir position information.
According to the image recognition system for positioning the accelerated release reservoir of the railway train, the position of the accelerated release reservoir is positioned in an automatic image recognition mode, on one hand, the detection efficiency and stability can be improved compared with manual maintenance, on the other hand, the detection accuracy can be improved along with the continuous maturity of deep learning and artificial intelligence technology, and meanwhile, through improving the SSD algorithm matching strategy, each real frame ground channel box can obtain equal number of predicted frames anchor, so that the problem of unbalanced positive and negative samples is solved, the convergence of loss functions in the training process is accelerated, and the accuracy of the algorithm for positioning smaller accelerated release reservoirs is also improved.
In addition, the image recognition system for positioning the accelerated release reservoir of the railway train according to the embodiment of the invention can also have the following additional technical characteristics:
further, in one embodiment of the present invention, the building module comprises: the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring rough positioning images of a plurality of accelerated release air cylinders as an original image set; the marking unit is used for marking an area where the accelerated release reservoir is located in the original image set as a marking information set, wherein the original image set corresponds to the marking information set one by one; and the amplification unit is used for amplifying the original image set and the marked information set by using an amplification algorithm to obtain a sample data set.
Further, in one embodiment of the present invention, the training module comprises: the input unit is used for inputting the sample data set into an improved SSD network to obtain a 5-layer characteristic diagram; the construction unit is used for respectively constructing two convolution kernels with the size of 3 x 3 on the 5 layers of feature maps as a regression predictor and a multi-classification predictor; the prediction and screening unit is used for predicting the prediction frame anchor of each position on a preset feature map by using the regression predictor and the multi-classification predictor, generating a default frame for each feature map by using a default frame generation rule to obtain a fixed number of output predictions, and screening the output predictions by adopting a non-maximum value to inhibit NMS and a top _ k threshold value to obtain an output frame; and the iteration unit is used for sampling the output frame by a random sampling or difficult sampling method to calculate the loss function, calculating the gradient value of the loss function, updating the parameter according to the gradient value, and performing iterative training until the maximum iteration number is reached to obtain the accelerated release reservoir positioning model.
Further, in one embodiment of the present invention, the algorithm matching policy in the improved SSD network is: the IOU corresponding to each real frame ground truth box is subjected to descending order, then, the predicted frames anchor corresponding to the first M IOUs are taken as samples, each predicted frame anchor has the chance of being matched with the real frame ground truth box only once, M is a positive integer, and the IOU is the intersection and comparison of the predicted frames and the real frames, wherein the input of a matching mechanism of the improved SSD network is as follows: an IOU set and a parameter N corresponding to each predicted frame anchor and a real frame ground channel box; the matching mechanism output of the improved SSD network is: and the index of the real frame group truth box is obtained by matching each predicted frame anchor.
Further, in one embodiment of the present invention, the obtaining module includes: the acquisition unit is used for acquiring an actual image; and the positioning and output unit is used for zooming the actual image into an image with the size of 1600 × 256, inputting the zoomed image into the accelerated release reservoir positioning model for positioning detection of the accelerated release reservoir, judging a threshold value of an output result, and if the length-width ratio of the position frame is greater than 0.8 and smaller than 1.2, determining that the positioned reservoir is the accelerated release reservoir and further outputting the position information of the accelerated release reservoir.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The foregoing and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
FIG. 1 is a flow chart of an image recognition method for positioning a rail train accelerated release reservoir in accordance with one embodiment of the present invention;
FIG. 2 is a flow chart of a specific accelerated mitigation reservoir location identification of the present invention;
fig. 3 is a schematic structural diagram of an image recognition system for positioning an acceleration mitigation reservoir of a railway train according to an embodiment of the present invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.
The following describes an image recognition method and system for positioning an acceleration release reservoir of a railway train according to an embodiment of the invention with reference to the accompanying drawings.
FIG. 1 is a flow chart of an image recognition method for positioning a rail train acceleration mitigation reservoir in accordance with an embodiment of the present invention.
As shown in fig. 1, the image recognition method for positioning the accelerated release reservoir of the railway train comprises the following steps:
in step S1, a sample data set of accelerated release reservoirs is constructed.
Specifically, as shown in fig. 2, a high-definition image of the train can be acquired through a high-definition imaging device built around the train track. It can be understood that, because the form and the position of the acceleration relieving air cylinder of different vehicle types are not fixed, the image of the middle part of the vehicle needs to be taken for positioning, and the positioned acceleration relieving air cylinder is further taken as a sub-image. Due to the fact that shooting conditions of different stations are different, images shot by the equipment are different, the camera can be influenced by natural conditions such as rainwater and mud stains, and the acquired accelerated release reservoir images are different. In order to ensure the diversity of the data set, images of acceleration mitigation air cylinders of different vehicle types, which are shot by each station under different conditions, need to be collected completely, that is, the rough positioning images of a plurality of acceleration mitigation air cylinders are used as an original image set.
After the original image set is obtained, the area where the accelerated release reservoir is located in the original image set is marked manually to serve as a marking information set, namely the marking information set is information of a rectangular frame of a target area. Wherein, the original image set and the mark information set are required to be in one-to-one correspondence.
In order to improve the stability and adaptation habit of the algorithm, the data set is augmented by utilizing an augmentation algorithm, which can include rotation, translation, scaling, mirror image, brightness adjustment and the like of the original image. And after amplification, the original image set and the marking information set form a sample data set for accelerating and relieving the air cylinder.
In step S2, the improved SSD network is trained using the sample data set to obtain an accelerated release reservoir positioning model, where the improved SSD model has a structure: resnet50 is used as a backbone network, the top BN layer, pooling layer and classification output layer are deleted, two convolution layers are added, the convolution kernel size is 3 × 3, and the step size is 2.
Specifically, the vehicle bottom image containing the accelerated release reservoir is input into the network, firstly, resnet50 is selected as a main network to replace an original VGG cascade network, a BN layer, a pooling layer and a classification output layer at the top layer are deleted, and then two convolution layers are additionally added, wherein the convolution kernel size is 3 x 3, and the step length is 2. The 3-layer profile from resnet plus the expanded two-layer profile form all profiles. The output of the 5-layer residual block or convolutional layer on top of the network is used as a feature map for the SSD model. Then, two convolution kernels with the size of 3 x 3 are respectively constructed on the 5 layers of feature maps to serve as a regression predictor and a multi-classification predictor. The two convolution kernels predict the prediction frame anchor of each position on a specific characteristic diagram so as to obtain a fixed number of predictions, and further adopt non-maximum suppression (NMS) and top _ k thresholds to screen the output predictions so as to obtain a final output frame of the target detection model. And then sampling an output frame by a random sampling or difficult sampling method for calculating a loss function, wherein the classification loss and the position loss need to be calculated respectively, calculating the gradient value of the loss function, updating the parameters according to the gradient value, and performing iterative training until the maximum iteration number is reached to obtain an accelerated release reservoir positioning model.
It should be noted that in the existing model matching process, firstly, each real frame group channel will select a predicted frame anchor with the largest IOU (intersection ratio between the predicted frame and the real frame) to match, so as to ensure that each real frame group channel is at least matched to one predicted frame anchor, then, in all the IOUs of the predicted frame anchor and the real frame group channel, a part with a threshold value greater than 0.5 is selected for a subsequent prediction link, and the predicted frame anchor is matched to an object corresponding to the largest IOU. However, for smaller or larger objects, the matching strategy has a very limited number of positive samples, which is much smaller than the number of samples matched by medium-sized objects. The excessive difference of the sample quantity is not beneficial to the convergence of the loss function in the training process, and has certain influence on the detection effect.
Therefore, the SSD algorithm is improved, specifically, because the accelerated release reservoir occupies a small area at the bottom of the whole vehicle body, in order to improve the detection effect of the SSD on small targets, a new algorithm matching strategy is selected: and performing descending order on the IOU corresponding to each real frame ground channel, and then taking the predicted frames anchor corresponding to the first M IOUs as samples, wherein each predicted frame anchor has a matching chance only once, and M is a positive integer. The matching mechanism inputs are: and an IOU set and a parameter N corresponding to each predicted frame anchor and the real frame ground channel box. The matching mechanism outputs the index of the real frame group truth box matched by each predicted frame anchor. The convergence of the loss function in the training process also improves the accuracy of the algorithm for positioning the small accelerated release reservoir. Through an improved algorithm matching strategy, each real frame ground truth box can obtain equal number of predicted frames anchor, the problem of imbalance of positive and negative samples is solved, convergence of loss functions in the training process is accelerated, and the accuracy of the algorithm on smaller accelerated release air cylinders is improved.
In step S3, an actual image is acquired and input to the accelerated release reservoir positioning model, and accelerated release reservoir position information is obtained.
Specifically, as shown in fig. 2, an actual image of a station passing through the vehicle is collected, the actual image is scaled to 1600 × 256, the actual image is input into a trained positioning network to perform positioning detection of the accelerated release reservoir, an output result is judged, if the aspect ratio of a position frame is greater than 0.8 and smaller than 1.2 and the type is the accelerated release reservoir, the positioned reservoir is considered to be the accelerated release reservoir, and further, position information of a partial sub-image of the accelerated release reservoir and the position information of the accelerated release reservoir at the bottom of the vehicle are output.
According to the image recognition method for positioning the accelerated release reservoir of the railway train, provided by the embodiment of the invention, the position of the accelerated release reservoir is positioned in an automatic image recognition mode, so that on one hand, compared with manual maintenance, the detection efficiency and stability can be improved, on the other hand, along with the continuous maturity of deep learning and artificial intelligence technology, the detection accuracy can be improved, meanwhile, through improving the SSD algorithm matching strategy, each real frame group channel box can obtain equal number of predicted frames anchor, the problem of imbalance of positive and negative samples is solved, the convergence of loss functions in the training process is accelerated, and the accuracy of the algorithm for positioning smaller accelerated release reservoirs is also improved.
Next, an image recognition system for positioning an acceleration release reservoir of a railway train according to an embodiment of the present invention will be described with reference to the drawings.
Fig. 3 is a schematic structural diagram of an image recognition system for positioning an acceleration mitigation reservoir of a railway train according to an embodiment of the present invention.
As shown in fig. 3, the system 10 includes: a building module 100, a training module 200 and an acquisition module 300.
The building module 100 is configured to build a sample data set of the accelerated release reservoir. The training module 200 is configured to train the improved SSD network by using the sample data set, to obtain an accelerated release reservoir positioning model, where the improved SSD model has a structure: resnet50 is used as a backbone network, the top BN layer, pooling layer and classification output layer are deleted, two convolution layers are added, the convolution kernel size is 3 × 3, and the step size is 2. The obtaining module 300 is configured to collect an actual image, input the actual image into the accelerated release reservoir positioning model, and obtain the accelerated release reservoir position information.
Further, in one embodiment of the present invention, the building module 100 comprises: the acquiring unit 101 is configured to acquire coarse positioning images of multiple accelerated release air cylinders as an original image set; the marking unit 102 is configured to mark an area where the accelerated release reservoir is located in the original image set as a marking information set, where the original image set corresponds to the marking information set one to one; and the amplification unit 103 is configured to amplify the original image set and the labeled information set by using an amplification algorithm to obtain a sample data set.
Further, in one embodiment of the present invention, the training module 200 comprises: an input unit 201, configured to input the sample data set into an improved SSD network to obtain a 5-layer feature map; a constructing unit 202, configured to respectively construct two convolution kernels with a size of 3 × 3 on the 5-layer feature map as a regression predictor and a multi-class predictor; the prediction and screening unit 203 is used for predicting the prediction frame anchor of each position on a preset feature map by using a regression predictor and a multi-classification predictor, generating a default frame for each feature map by using a default frame generation rule to obtain a fixed number of output predictions, and screening the output predictions by using a non-maximum value to inhibit NMS and a top _ k threshold value to obtain an output frame; and the iteration unit 204 is configured to sample the output frame by using a random sampling or difficult sampling method to calculate the loss function, calculate a gradient value of the loss function, update parameters according to the gradient value, and perform iterative training until the maximum iteration number is reached, so as to obtain an accelerated release reservoir positioning model.
Further, in an embodiment of the present invention, the IOUs corresponding to each real frame ground channel box are sorted in a descending order, then predicted frames anchor corresponding to the first M IOUs are taken as samples, each predicted frame anchor has a chance of matching with the real frame ground channel only once, M is a positive integer, and the IOU is an intersection ratio of the predicted frame and the real frame, wherein the input of the matching mechanism of the improved SSD network is: an IOU set and a parameter N corresponding to each predicted frame anchor and a real frame ground channel box; the matching mechanism output of the improved SSD network is: and the index of the real frame group truth box is obtained by matching each predicted frame anchor.
Further, in an embodiment of the present invention, the obtaining module 300 includes: an acquisition unit 301 for acquiring an actual image; and the positioning and output unit 302 is configured to scale the actual image into an image with a size of 1600 × 256, input the scaled image into an accelerated release reservoir positioning model to perform positioning detection on the accelerated release reservoir, perform threshold judgment on an output result, and if the aspect ratio of the position frame is greater than 0.8 and smaller than 1.2, determine that the positioned reservoir is an accelerated release reservoir, and further output position information of the accelerated release reservoir.
It should be noted that the explanation of the foregoing embodiment of the image identification method for positioning the accelerated release reservoir of the railway train is also applicable to the system, and is not repeated here.
According to the image recognition system for positioning the accelerated release reservoir of the railway train, provided by the embodiment of the invention, the position of the accelerated release reservoir is positioned in an automatic image recognition mode, so that the detection efficiency and stability can be improved compared with manual maintenance on one hand, and the detection accuracy can be improved along with the continuous maturity of deep learning and artificial intelligence technology on the other hand.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims (10)

1. An image identification method for positioning an accelerated release reservoir of a railway train is characterized by comprising the following steps:
step S1, constructing a sample data set of the accelerated release reservoir;
step S2, training an improved SSD network by using the sample data set to obtain an accelerated release reservoir positioning model, wherein the improved SSD model has the structure that: resnet50 is used as a backbone network, a BN layer, a pooling layer and a classification output layer at the top layer are deleted, two convolution layers are additionally added, the size of a convolution kernel is 3 x 3, and the step length is 2;
and step S3, acquiring an actual image, inputting the actual image into the accelerated release reservoir positioning model, and acquiring the position information of the accelerated release reservoir.
2. The image recognition method for positioning the accelerated release reservoir of a railroad train according to claim 1, wherein the step S1 specifically comprises:
acquiring coarse positioning images of a plurality of accelerated release air cylinders as an original image set;
marking the area where the accelerated release reservoir is located in the original image set as a marked information set, wherein the original image set corresponds to the marked information set one by one;
and amplifying the original image set and the marked information set by using an amplification algorithm to obtain the sample data set.
3. The image recognition method for positioning the accelerated release reservoir of a railroad train according to claim 1, wherein the step S2 specifically comprises:
inputting the sample data set into the improved SSD network to obtain a 5-layer characteristic diagram;
respectively constructing two convolution kernels with the size of 3 x 3 on the 5 layers of feature maps as a regression predictor and a multi-classification predictor;
predicting a prediction frame anchor of each position on a preset feature map by using the regression predictor and the multi-classification predictor, generating a default frame for each feature map by using a default frame generation rule to obtain a fixed number of output predictions, and screening the output predictions by using a non-maximum value to inhibit NMS and a top _ k threshold value to obtain an output frame;
sampling the output frame by a random sampling or difficult sampling method to calculate a loss function, calculating a gradient value of the loss function, updating parameters according to the gradient value, and performing iterative training until the maximum iteration number is reached to obtain the accelerated release reservoir positioning model.
4. The method of claim 3, wherein the algorithm matching strategy in the improved SSD network is: the IOU corresponding to each real frame ground channel box is subjected to descending order, then predicted frames anchor corresponding to the first M IOUs are taken as samples, each predicted frame anchor has the chance of being matched with the real frame ground channel only once, M is a positive integer, and the IOU is the intersection and comparison of the predicted frames and the real frames, wherein the input of a matching mechanism of the improved SSD network is as follows: an IOU set and a parameter N corresponding to each predicted frame anchor and a real frame ground channel box; the matching mechanism output of the improved SSD network is: and the index of the real frame group truth box is obtained by matching each predicted frame anchor.
5. The image recognition method for positioning the accelerated release reservoir of a railroad train according to claim 1, wherein the step S3 specifically comprises:
collecting the actual image;
and zooming the actual image into an image with the size of 1600 × 256, inputting the zoomed image into the accelerated release reservoir positioning model for positioning detection of the accelerated release reservoir, judging a threshold value of an output result, and if the length-width ratio of a position frame is larger than 0.8 and smaller than 1.2, determining that the positioned reservoir is the accelerated release reservoir, and further outputting the position information of the accelerated release reservoir.
6. An image recognition system for accelerated release reservoir positioning of a railroad train, comprising:
the construction module is used for constructing a sample data set for accelerating the relieving of the reservoir;
a training module, configured to train an improved SSD network by using the sample data set, to obtain an accelerated release reservoir location model, where the improved SSD model has a structure: resnet50 is used as a backbone network, a BN layer, a pooling layer and a classification output layer at the top layer are deleted, two convolution layers are additionally added, the size of a convolution kernel is 3 x 3, and the step length is 2;
and the acquisition module is used for acquiring an actual image, inputting the actual image into the accelerated release reservoir positioning model and acquiring the accelerated release reservoir position information.
7. The image recognition system for railroad train accelerated release reservoir location of claim 6, wherein the building module comprises:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring rough positioning images of a plurality of accelerated release air cylinders as an original image set;
the marking unit is used for marking the area where the accelerated release reservoir is located in the original image set as a marking information set, wherein the original image set corresponds to the marking information set in a one-to-one mode;
and the amplification unit is used for amplifying the original image set and the marked information set by using an amplification algorithm to obtain the sample data set.
8. The image recognition system for accelerated release reservoir positioning of a railroad train of claim 6, wherein the training module comprises:
an input unit, configured to input the sample data set into the improved SSD network, to obtain a 5-layer feature map;
the construction unit is used for respectively constructing two convolution kernels with the size of 3 x 3 on the 5 layers of feature maps as a regression predictor and a multi-classification predictor;
the prediction and screening unit is used for predicting a prediction frame anchor of each position on a preset feature map by using the regression predictor and the multi-classification predictor, generating a default frame for each feature map by using a default frame generation rule to obtain a fixed number of output predictions, and screening the output predictions by using a non-maximum value to inhibit NMS and a top _ k threshold value to obtain an output frame;
and the iteration unit is used for sampling the output frame by a random sampling or difficult sampling method to calculate a loss function, calculating a gradient value of the loss function, updating parameters according to the gradient value, performing iterative training until the maximum iteration number is reached, and obtaining the accelerated release reservoir positioning model.
9. The image recognition system for accelerated release reservoir location of a railroad train of claim 8, wherein the algorithm matching strategy in the modified SSD network is: the IOU corresponding to each real frame ground channel box is subjected to descending order, then predicted frames anchor corresponding to the first M IOUs are taken as samples, each predicted frame anchor has the chance of being matched with the real frame ground channel only once, M is a positive integer, and the IOU is the intersection and comparison of the predicted frames and the real frames, wherein the input of a matching mechanism of the improved SSD network is as follows: an IOU set and a parameter N corresponding to each predicted frame anchor and a real frame ground channel box; the matching mechanism output of the improved SSD network is: and the index of the real frame group truth box is obtained by matching each predicted frame anchor.
10. The image recognition system for accelerated release reservoir positioning of a railroad train of claim 1, wherein the acquisition module comprises:
the acquisition unit is used for acquiring the actual image;
and the positioning and output unit is used for zooming the actual image into an image with the size of 1600 × 256, inputting the zoomed image into the accelerated release reservoir positioning model for positioning detection of the accelerated release reservoir, judging a threshold value of an output result, and if the length-width ratio of the position frame is greater than 0.8 and smaller than 1.2, determining that the positioned reservoir is the accelerated release reservoir, and further outputting the position information of the accelerated release reservoir.
CN202110056516.8A 2021-01-15 2021-01-15 Image identification method and system for positioning air reservoir for accelerating and relieving railway train Active CN112766260B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110056516.8A CN112766260B (en) 2021-01-15 2021-01-15 Image identification method and system for positioning air reservoir for accelerating and relieving railway train

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110056516.8A CN112766260B (en) 2021-01-15 2021-01-15 Image identification method and system for positioning air reservoir for accelerating and relieving railway train

Publications (2)

Publication Number Publication Date
CN112766260A true CN112766260A (en) 2021-05-07
CN112766260B CN112766260B (en) 2021-09-14

Family

ID=75701955

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110056516.8A Active CN112766260B (en) 2021-01-15 2021-01-15 Image identification method and system for positioning air reservoir for accelerating and relieving railway train

Country Status (1)

Country Link
CN (1) CN112766260B (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109300114A (en) * 2018-08-30 2019-02-01 西南交通大学 The minimum target components of high iron catenary support device hold out against missing detection method
CN110197113A (en) * 2019-03-28 2019-09-03 杰创智能科技股份有限公司 A kind of method for detecting human face of high-precision anchor point matching strategy
CN110796643A (en) * 2019-10-18 2020-02-14 四川大学 Rail fastener defect detection method and system
CN111027260A (en) * 2019-12-24 2020-04-17 大连圣力来监测技术有限公司 Reciprocating compressor fault diagnosis method based on neural network
US20200124581A1 (en) * 2018-10-19 2020-04-23 The Climate Corporation Detecting infection of plant diseases by classifying plant photos
CN111079627A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Railway wagon brake beam body breaking fault image identification method
CN111080608A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Method for recognizing closing fault image of automatic brake valve plug handle of railway wagon in derailment
CN111079819A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Method for judging state of coupler knuckle pin of railway wagon based on image recognition and deep learning
CN111079746A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Railway wagon axle box spring fault image identification method
CN111091551A (en) * 2019-12-12 2020-05-01 哈尔滨市科佳通用机电股份有限公司 Method for detecting loss fault of brake beam strut opening pin of railway wagon
CN111126472A (en) * 2019-12-18 2020-05-08 南京信息工程大学 Improved target detection method based on SSD
CN111860623A (en) * 2020-07-03 2020-10-30 北京林业大学 Method and system for counting tree number based on improved SSD neural network

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109300114A (en) * 2018-08-30 2019-02-01 西南交通大学 The minimum target components of high iron catenary support device hold out against missing detection method
US20200393435A1 (en) * 2018-10-19 2020-12-17 The Climate Corporation Detecting infection of plant diseases by classifying plant photos
US20200124581A1 (en) * 2018-10-19 2020-04-23 The Climate Corporation Detecting infection of plant diseases by classifying plant photos
CN110197113A (en) * 2019-03-28 2019-09-03 杰创智能科技股份有限公司 A kind of method for detecting human face of high-precision anchor point matching strategy
CN110796643A (en) * 2019-10-18 2020-02-14 四川大学 Rail fastener defect detection method and system
CN111091551A (en) * 2019-12-12 2020-05-01 哈尔滨市科佳通用机电股份有限公司 Method for detecting loss fault of brake beam strut opening pin of railway wagon
CN111080608A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Method for recognizing closing fault image of automatic brake valve plug handle of railway wagon in derailment
CN111079819A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Method for judging state of coupler knuckle pin of railway wagon based on image recognition and deep learning
CN111079746A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Railway wagon axle box spring fault image identification method
CN111079627A (en) * 2019-12-12 2020-04-28 哈尔滨市科佳通用机电股份有限公司 Railway wagon brake beam body breaking fault image identification method
CN111126472A (en) * 2019-12-18 2020-05-08 南京信息工程大学 Improved target detection method based on SSD
CN111027260A (en) * 2019-12-24 2020-04-17 大连圣力来监测技术有限公司 Reciprocating compressor fault diagnosis method based on neural network
CN111860623A (en) * 2020-07-03 2020-10-30 北京林业大学 Method and system for counting tree number based on improved SSD neural network

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
WEI LIU ET AL.: "SSD: Single Shot MultiBox Detector", 《HTTPS://ARXIV.ORG/ABS/1512.02325》 *
李萍 等: "基于深度学习的铁路列车关键零部件图像故障检测", 《铁道科学与工程学报》 *
王士凯: "《铁路货车站修技术问答》", 31 October 1998, 中国铁道出版社 *

Also Published As

Publication number Publication date
CN112766260B (en) 2021-09-14

Similar Documents

Publication Publication Date Title
CN110827251B (en) Power transmission line locking pin defect detection method based on aerial image
CN111353413B (en) Low-missing-report-rate defect identification method for power transmission equipment
CN109977812B (en) Vehicle-mounted video target detection method based on deep learning
CN108647655B (en) Low-altitude aerial image power line foreign matter detection method based on light convolutional neural network
CN107609525B (en) Remote sensing image target detection method for constructing convolutional neural network based on pruning strategy
CN109767423B (en) Crack detection method for asphalt pavement image
CN109559302A (en) Pipe video defect inspection method based on convolutional neural networks
CN110163213B (en) Remote sensing image segmentation method based on disparity map and multi-scale depth network model
CN113160062B (en) Infrared image target detection method, device, equipment and storage medium
CN109446925A (en) A kind of electric device maintenance algorithm based on convolutional neural networks
CN108921875A (en) A kind of real-time traffic flow detection and method for tracing based on data of taking photo by plane
CN108039044B (en) Vehicle intelligent queuing system and method based on multi-scale convolutional neural network
CN109214308A (en) A kind of traffic abnormity image identification method based on focal loss function
CN108648210B (en) Rapid multi-target detection method and device under static complex scene
CN112232351A (en) License plate recognition system based on deep neural network
CN112733742A (en) Deep learning-based fault detection method for round pin of lower pull rod of railway wagon
CN111091551A (en) Method for detecting loss fault of brake beam strut opening pin of railway wagon
CN116342894A (en) GIS infrared feature recognition system and method based on improved YOLOv5
CN114612803B (en) Improved CENTERNET transmission line insulator defect detection method
CN115719475A (en) Three-stage trackside equipment fault automatic detection method based on deep learning
CN112766260B (en) Image identification method and system for positioning air reservoir for accelerating and relieving railway train
CN114782915A (en) Intelligent automobile end-to-end lane line detection system and equipment based on auxiliary supervision and knowledge distillation
CN114359167A (en) Insulator defect detection method based on lightweight YOLOv4 in complex scene
CN103605960A (en) Traffic state identification method based on fusion of video images with different focal lengths
CN107705330A (en) Visibility intelligence estimating and measuring method based on road camera

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant