CN116091870A

CN116091870A - Network training and detecting method, system and medium for identifying and detecting damage faults of slave plate seat

Info

Publication number: CN116091870A
Application number: CN202310184867.6A
Authority: CN
Inventors: 王璐
Original assignee: Harbin Kejia General Mechanical and Electrical Co Ltd
Current assignee: Harbin Kejia General Mechanical and Electrical Co Ltd
Priority date: 2023-03-01
Filing date: 2023-03-01
Publication date: 2023-05-09
Anticipated expiration: 2043-03-01
Also published as: CN116091870B

Abstract

The utility model provides a follow-up plate seat damage fault recognition detection network training and detection method, system and medium, relates to image processing technology field, and is because there are various natural interference, such as illumination, rainwater, mud stain to current follow-up plate seat image, and then leads to the model to be unable to concentrate on the overall shape in the training process to influence the effect of training, further led to the problem that the model recognition accuracy rate that trains is low. The improved loss function can filter the influence of individual pixel points on the recognition, so that the model is more focused on the overall shape, a more accurate model can be obtained under the same training set and the same training times, and the accuracy of the recognition of the damaged faults of the slave plate seat is improved. The loss function after the application is improved can also solve the problem caused by unbalance of positive and negative samples in the training process to a certain extent.

Description

Network training and detecting method, system and medium for identifying and detecting damage faults of slave plate seat

Technical Field

The invention relates to the technical field of image processing, in particular to a network training and detecting method, a system and a medium for identifying and detecting plate seat breakage faults.

Background

The slave plate seat is used for fixing a coupler buffer device, is used for realizing the coupling and traction between a locomotive and a vehicle or between the vehicle and the vehicle, and is required to bear traction force and impact force generated when the train runs or is in shunting operation. In order to ensure the stable and safe running of the train, the validity and the integrity of the slave plate seat are required to be identified and detected, and once faults such as breakage, rivet breakage and the like are found, the faults are required to be rapidly processed. At present, a mode of manually checking images is adopted to perform fault checking on the slave board seat, and the situations of fatigue, omission and the like are very easy to occur in the working process of car checking staff, so that the occurrence of missed checking and false checking is caused, the driving safety is influenced, the manual checking efficiency is low, and a large amount of time is required for fault checking.

With the development of deep learning technology, deep learning should be able to realize automated detection in various fields. For the damage faults of the slave plate seat, when the slave plate seat is detected by adopting a deep learning technology, the acquired slave plate seat image is abnormal in part of pixel points due to various natural interferences such as illumination, rainwater and mud stains, and the model cannot concentrate on the whole shape due to the pixel points, so that the training effect is influenced, and the trained model is low in recognition accuracy.

Disclosure of Invention

The purpose of the invention is that: the method aims at solving the problems that the model cannot concentrate on the whole shape in the training process due to various natural interferences such as illumination, rainwater and mud stains existing in the existing slave board seat image, so that the training effect is affected, and the recognition accuracy of the trained model is low.

The technical scheme adopted by the invention for solving the technical problems is as follows:

a method of training a network for identifying and detecting a failure from a breakage of a board seat, comprising:

a step of acquiring a railway train slave board seat area image and constructing a training set using the slave board seat area image, and

training a Mask-RCNN network by using the training set;

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

wherein θ represents an adjustable parameter, θ is between 0 and 1, L _Focal Representing the Focal Loss function, L _shape Representing a Dice Loss function;

the Focal Loss function is expressed as:

FL(p _t )＝-α _t (1-p _t ) ^γ ×log(p _t )

wherein p is _t Representing confidence level of model prediction of a certain class t, 0 < p _t ＜1；α _t And gamma represents an adjustable parameter, wherein gamma is ≡1.

Further, the Dice Loss function is expressed as:

wherein A represents a set of actual target pixels, namely the shape of an actual target, which is simply called shape A; b represents a set of model prediction target pixels, namely the shape of a prediction target, which is called shape B for short; x represents the coordinates of each pixel point within a given range;

representing the average value of the coordinates of each point on the shape A, namely the geometric center coordinate of the shape A; B-A represents the set of points in shape B that are not in shape A at the same time, i.e., B-A n B; i A-B I are the same; />

Representing the distance of each point in shape B that is not in shape a from the geometric center of shape a; />

And the same is done; the two are subjected to difference and square, and an improved shape loss function is obtained;

and->

Is used to characterize the deviation of the model's predicted value from the actual target.

Further, the steps of collecting the images of the slave board seat area of the railway train, constructing a training set by utilizing the images of the slave board seat area, and training the Mask-RCNN network by utilizing the training set are specifically as follows:

step one: taking side and bottom images of the railroad train;

step two: intercepting a slave board seat area image in side and bottom images of the railway train, and simulating a fault image based on the slave board seat area image;

step three: carrying out image amplification on the fault image, and determining a training set based on the amplified fault image and the slave board seat area image;

step four: and (5) training the Mask-RCNN network by using the training set.

And further, intercepting the image of the slave plate seat area in the second step, determining the start and stop positions of the slave plate seat area according to the wheelbase information, and intercepting the slave plate seat area according to the start and stop positions.

Further, the image amplification includes: panning, zooming, and sharpening.

A failure recognition and detection method for damage of a slave plate seat comprises the following steps:

acquiring a railway train slave board seat area image as an image to be detected, carrying out segmentation detection on the image to be detected by using a trained Mask-RCNN network, and judging whether faults exist according to detection results;

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

the Focal Loss function is expressed as:

FL(p _t )＝-α _t (1-p _t ) ^γ ×log(p _t )

Further, the Dice Loss function is expressed as:

and->

Further, the training steps of the Mask-RCNN network are as follows:

step one: taking side and bottom images of the railroad train;

step four: and (5) training the Mask-RCNN network by using the training set.

A failure recognition detection system for breakage of a slave board seat, comprising: the device comprises an acquisition module, a training module and a detection module;

the acquisition module is used for acquiring an image of a slave plate seat area of the railway train and taking the acquired image of the slave plate seat area as an image to be detected;

the training module is used for training the Mask-RCNN network;

the detection module is used for carrying out segmentation detection on the image to be detected by utilizing the trained Mask-RCNN network, and judging whether faults exist or not according to the detection result;

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

wherein θ represents an adjustable parameter, θ is between 0 and 1, L _Focal Representing the Focal Loss function, L _shape Representing the Dice Loss function.

A computer-readable storage medium in which a computer-readable program is stored for performing any one of the methods described above.

The beneficial effects of the invention are as follows:

the improved loss function can filter the influence of individual pixel points on the recognition, so that the model is more focused on the overall shape, a more accurate model can be obtained under the same training set and the same training times, and the accuracy of the recognition of the damaged faults of the slave plate seat is improved. The problem that the unbalance of positive and negative samples caused in the training process can also be solved to a certain extent to the loss function after the application is improved, so that the application deep learning network identifies the slave plate seat state and realizes fault alarm, thereby improving the detection efficiency and the stability and the safety of train operation. The method replaces manual detection by utilizing the mode of automatic image identification, and reduces labor cost. The deep learning algorithm is applied to automatic identification of the fault of the slave board seat, so that the stability and the accuracy of the whole algorithm are improved, and the fault detection time is effectively shortened. In the model training process, a method of combining the shape correlation coefficient and the Focal Loss is adopted as a new Loss function, optimization calculation is carried out, the training effect is improved, and the accuracy of the trained model in identifying the slave plate seat area image is further improved.

Drawings

Fig. 1 is a flow chart of the present application.

Detailed Description

It should be noted in particular that, without conflict, the various embodiments disclosed herein may be combined with each other.

The first embodiment is as follows: the network training method for identifying and detecting the breakage fault of the plate seat according to the embodiment comprises the following steps:

training a Mask-RCNN network by using the training set;

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

The Focal Loss function is expressed as:

FL(p _t )＝-α _t (1-p _t ) ^γ ×log(p _t )

wherein p is _t Representing confidence level of model prediction of a certain class t, 0 < p _t ＜1；α _t And gamma represents an adjustable parameter which is used for respectively balancing the number of positive and negative samples and adjusting the imbalance problem of the difficult-to-separate and easy-to-separate samples, wherein gamma is more than or equal to 1.

However, rail wagons have their specificity, i.e. the probability of failure is low, resulting in a very large proportion of normal pictures in the acquired data set and a sparse number of negative samples of failure. For this feature, the loss function of the model is improved. In replacing and improving the Loss function, the Focal Loss function is sometimes used to solve the problem caused by sample imbalance during training.

The Focal Loss function can be expressed as:

FL(p _t )＝-α _t (1-p _t ) ^γ ×log(p _t )

wherein p is _t Representing confidence level of model prediction of a certain class t, 0 < p _t ＜1；α _t And gamma is an adjustable parameter which is used for respectively balancing the number of positive and negative samples and adjusting the imbalance problem of the easily separable samples, wherein gamma is more than or equal to 1.

For a scoring sample with high probability of occurrence, the probability of the model identifying the target as the sample class is higher, i.e., p _t Closer to 1, which tends to be taken to be smaller alpha _t The value reduces the influence of the value on the calculation of the overall loss function; at the same time, p _t The larger the 1-p _t Smaller, and (1-p) due to gamma.gtoreq.1 _t ) ^γ The smaller the loss power function of the easily separable sample is, the two phases are combined, and the problem of unbalance of the positive and negative samples in the training process is solved to a certain extent.

The second embodiment is as follows: this embodiment is a further description of the first embodiment, and the difference between this embodiment and the first embodiment is that the Dice Loss function is expressed as:

and->

Since segmentation detection is performed, judgment of shape similarity is introduced into the Loss function on the basis of Focal Loss. Conventional methods, such as Dice Loss, tend to be designed for pixel-level shape similarity, but sometimes the target morphology is not completely fixed for the recognition of the failure from the board seat, and the image quality may be problematic, such as shadowing, shielding or noise, if the conventional shape similarity analysis method is used, the change of part of key pixels may affect the training result. Thus, a new judgment shape similarity loss function is proposed, which can be expressed as:

wherein A represents a set of actual target pixels, namely the shape of an actual target, which is hereinafter simply referred to as shape A; b represents model predictionA set of target pixels, i.e. the shape of the predicted target, hereafter simply referred to as shape B; x represents the coordinates of each pixel point within a given range;

representing the average value of the coordinates of each point on the shape A, namely the geometric center coordinate of the shape A; B-A represents the set of points in shape B that are not in shape A at the same time, i.e., B-A n B; i A-B I are the same;

And the same is done; and (5) performing difference and squaring on the two to obtain the improved shape loss function.

From this equation, it can be seen that if shape A is similar to shape B overall, then

The value of (2) and->

Should be close, the loss function is close to 0; otherwise, if the shapes have larger difference, the difference value of the two is larger, the loss function is larger, the deviation between the predicted value of the characterization model and the actual target is overlarge, and the model is required to be further optimized or trained. It can also be seen that if only some pixels change, the overall impact is not very great, thereby reducing the impact of key pixels on the loss function calculation.

And a third specific embodiment: this embodiment is a further description of the second embodiment, and the difference between the second embodiment and the second embodiment is that the steps of acquiring an image of a slave board seat area of a railway train, and constructing a training set using the image of the slave board seat area, and the steps of training a Mask-RCNN network using the training set are specifically as follows:

step one: taking side and bottom images of the railroad train;

step three: amplifying the fault image, and determining a training set and a testing set based on the amplified fault image and the slave board seat area image;

step four: and training the Mask-RCNN network by using the training set and the testing set.

The specific embodiment IV is as follows: the third embodiment is further described, and the difference between the third embodiment and the third embodiment is that in the second step, the image of the slave board seat area is taken, the start-stop position of the slave board seat area is determined according to the wheelbase information, and the slave board seat area is taken according to the start-stop position.

Fifth embodiment: the present embodiment is further described with respect to the fourth embodiment, and the difference between the present embodiment and the fourth embodiment is that the image amplification includes: panning, zooming, and sharpening.

Specific embodiment six: the method for identifying and detecting the damage faults of the slave board seat according to the embodiment comprises the following steps:

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

The whole thought of this application is: high-definition linear array imaging devices are built on two sides of a railway vehicle, and when a train passes through, a sensor is triggered to collect images. And intercepting a target subgraph and amplifying a fault image by utilizing the wheelbase information and priori knowledge of the slave board seat position so as to construct a data set. Based on the obtained data set training deep learning network model, the acquired image is identified by using the model, and the fault form and the fault position are determined. And mapping the identification result into the original image according to the mapping relation, and uploading the alarm. And the staff performs corresponding processing according to the identification result to ensure the safe operation of the train. When in actual use, a complete picture of the side part of the train is acquired and a target image to be detected is cut out; and detecting through the network model to obtain a detection result, analyzing the detection result, judging whether a fault exists, and if so, alarming is needed.

Specifically, the step of collecting an image of a railway train from a slab seat area image to an image to be measured generally comprises the steps of collecting the image and intercepting the area:

the image acquisition step specifically comprises the following steps:

and (3) constructing high-speed imaging equipment, acquiring high-definition linear array gray level images of all parts of the truck, and splicing the high-definition linear array gray level images into complete pictures of the side part and the bottom part of the train. Images in different environments in different time periods are collected, samples are expanded, various natural interferences such as illumination, rainwater, mud stains and the like exist in the data samples, the robustness of the algorithm is enhanced, and the algorithm can be applied to different working conditions of a train.

The step of region interception specifically comprises:

the detected slave plate seat targets are only distributed at the side part of the train, and the distribution has certain characteristics, so that the possible starting and stopping positions of the slave plate seat targets are determined according to priori knowledge such as the wheel base information and the like, the target areas are intercepted according to the starting and stopping positions to obtain target images, the sizes of the images to be identified are reduced, the relative proportion of the targets in the identification images is increased, the interference is reduced, the training of a model is facilitated, and the identification rate can be improved.

In the algorithm model used in the deep learning, a loss function is needed for measuring the model effect, characterizing the error between the model predicted value and the true value, and adjusting the weight parameter according to the result obtained by the loss function calculation so as to obtain a more accurate model.

For fault recognition from the board seat, a Mask-RCNN segmentation algorithm is used, which is a Two Stage target detection algorithm, and commonly used loss functions are a cross entropy loss function, a Smoooh L1 loss function, and the like.

Wherein the cross entropy loss function can be expressed as:

wherein x is _i Represents the i-th sample, p (x _i ) Representing the true distribution of the samples, q (x _i ) Representing the distribution of model predictions.

Whereas Smoooh L1 loss function can be expressed as:

the present application combines the Focal Loss with the proposed shape similarity Loss function, resulting in a new Loss function as follows:

L＝θL _Focal +(1-θ)L _shape

wherein θ is an adjustable parameter, and can be adjusted according to the actual application scene, and is usually between 0 and 1.

The improved loss function can solve the problem caused by unbalance of positive and negative samples in the training process to a certain extent, and can filter the influence of individual pixel points on recognition, so that the model is more focused on the whole shape, a more accurate segmentation model can be obtained under the same training set and the same training times, and the research and development efficiency and the accuracy of plate seat damage fault recognition are improved.

The FocalLoss loss function is expressed as:

FL(p _t )＝-α _t (1-p _t ) ^γ ×log(p _t )

wherein p is _t Representing confidence level of model prediction of a certain class t, 0 < p _t ＜1；α _t And gamma represents an adjustable parameter, wherein gamma is greater than or equal to 1;

seventh embodiment: this embodiment is a further description of the sixth embodiment, and the difference between this embodiment and the sixth embodiment is that the Dice Loss function is expressed as:

and->

The difference value of (2) is used for representing the deviation between the predicted value of the model and the actual target;

eighth embodiment: this embodiment is a further description of embodiment seven, and the difference between this embodiment and embodiment seven is that the training steps of the Mask-RCNN network are as follows:

step one: taking side and bottom images of the railroad train;

step four: and (5) training the Mask-RCNN network by using the training set.

Detailed description nine: the present embodiment provides a failure recognition and detection system for breakage of a slave board, comprising: the device comprises an acquisition module, a training module and a detection module;

the training module is used for training the Mask-RCNN network;

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

Detailed description ten: a computer-readable storage medium of the present embodiment stores therein a computer-readable program for executing the method of any one of the first to sixth embodiments.

It should be noted that the detailed description is merely for explaining and describing the technical solution of the present invention, and the scope of protection of the claims should not be limited thereto. All changes which come within the meaning and range of equivalency of the claims and the specification are to be embraced within their scope.

Claims

1. The network training method for identifying and detecting the breakage faults of the plate seat is characterized by comprising the following steps of:

training a Mask-RCNN network by using the training set;

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

the Focal Loss function is expressed as:

FL(p _t )＝-α _t (1-p _t ) ^γ ×log(p _t )

2. The method for training a network for identifying and detecting a broken fault from a board seat according to claim 1, wherein the Dice Loss function is expressed as:

and->

3. The method for training the slave board seat damage fault recognition detection network according to claim 2, wherein the steps of acquiring the slave board seat area image of the railway train and constructing a training set by using the slave board seat area image, and training the Mask-RCNN network by using the training set are specifically as follows:

step one: taking side and bottom images of the railroad train;

step four: and (5) training the Mask-RCNN network by using the training set.

4. A method for training a slave board seat breakage fault recognition and detection network according to claim 3, wherein in the second step, the slave board seat area image is intercepted, the start-stop position of the slave board seat area is determined according to the wheelbase information, and the slave board seat area is intercepted according to the start-stop position.

5. The method for training a network for identifying and detecting a failure from a board seat failure according to claim 4, wherein the image amplification comprises: panning, zooming, and sharpening.

6. The method for identifying and detecting the damage faults of the slave plate seat is characterized by comprising the following steps:

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

wherein θ represents an adjustable parameter, θ is between 0 and 1, L _Focal Representing FocalLoss loss function, L _shape Representing a DiceLoss loss function;

the FocalLoss loss function is expressed as:

FL(p _t )＝-α _t (1-p _t ) ^γ ×log(p _t )

7. The method for identifying and detecting a failure from a board seat breakage according to claim 6, wherein the DiceLoss loss function is expressed as:

and->

8. The method for identifying and detecting the breakage fault of the slave board seat according to claim 7, wherein the training step of the Mask-RCNN network is as follows:

step one: taking side and bottom images of the railroad train;

step four: and (5) training the Mask-RCNN network by using the training set.

9. A failure recognition and detection system for breakage of a slave board seat, characterized by comprising: the device comprises an acquisition module, a training module and a detection module;

the training module is used for training the Mask-RCNN network;

the detection module is used for carrying out segmentation detection on the image to be detected by utilizing the trained Mask-RCNN network, and judging whether faults exist or not according to detection results;

the loss function of the Mask-RCNN network is expressed as:

L＝θL _Focal +(1-θ)L _shape

10. A computer readable storage medium, characterized in that the medium has stored therein a computer readable program for executing the method according to any one of claims 1 to 6.