CN117372942A

CN117372942A - Reservoir floater automatic identification method based on improved SegNet model

Info

Publication number: CN117372942A
Application number: CN202310069796.5A
Authority: CN
Inventors: 王龙宝; 张珞弦; 储洪强; 徐淑芳; 毛莺池; 栾茵琪; 杨青青; 徐荟华
Original assignee: Hohai University HHU
Current assignee: Hohai University HHU
Priority date: 2023-02-03
Filing date: 2023-02-03
Publication date: 2024-01-09

Abstract

The invention discloses a reservoir floater identification method based on an improved SegNet model, which comprises the following steps: acquiring a reservoir float data set by using an unmanned aerial vehicle; performing data preprocessing operation on the acquired data set; labeling the data set and dividing the data set into a training set and a testing set; training and improving a SegNet algorithm model, providing a combination of maximum pooling index and residual error connection, more comprehensively retaining a large amount of detail information contained in the multi-scale image, retaining the characteristics of colors, textures, boundaries and the like of the original image to the greatest extent, and improving the identification precision of reservoir floaters; detecting and identifying reservoir floaters; and pushing the reservoir image recognition result to relevant responsible persons and management staff. The beneficial effects of the invention are as follows: the improved SegNet algorithm model is adopted to detect and identify reservoir floaters, the training parameter quantity is moderate, the detection precision is good, and the automatic high-precision detection and identification of reservoir water surface floaters are realized.

Description

Reservoir floater automatic identification method based on improved SegNet model

Technical Field

The invention belongs to the field of image analysis and identification, and particularly relates to an automatic reservoir floater identification method based on an improved SegNet model.

Background

A large amount of floaters in the rainwater period flow into the reservoir area along with flood, so that not only is the water quality polluted, but also the flood discharge safety of spillways and water delivery holes is affected. The large-area floaters bring great hidden trouble to the safe operation of the reservoir in the flood season. At present, the floating objects and water quality change conditions of reservoir operation management mainly depend on manual field inspection, and the manual cost is high and the inspection efficiency is low. In recent 20 years, image monitoring and video monitoring systems are increasingly widely built in reservoirs, but currently, manual interpretation is basically adopted for image and video monitoring. Although image and video monitoring can be used to timely understand the condition of the floating objects in the reservoir site, the potential effectiveness is not exerted far enough, and is mainly realized in the following aspects: the manual interpretation workload is large. Because of the large number of reservoirs and the large number of image and video monitoring sites, the workload of manually judging floats one by one is too great for workers of a county level, a city level and even a provincial level reservoir management unit. Therefore, most of image and video information is only taken as a necessary auxiliary means, and important attention is paid when a reservoir is in a problem; the manual interpretation is not in time. Because the number of sites is large, the problem of large-area floaters is difficult to find in time by manual interpretation, evidence cannot be obtained in time and early warning is carried out, so that the normalized float cleaning work efficiency is low, and the ecological supervision capability of water quality is weak. In the last ten years, the development and application of the deep learning bring great influence to various scientific and technological fields and life fields, and the application start of the deep learning image recognition segmentation in the water conservancy industry is late, and is not mature enough compared with other fields. With the development of the deep learning algorithm and the Internet of things, cloud computing capacity is fully developed, potential of video monitoring is developed, intelligent monitoring of reservoir floaters is achieved, and the method has very important significance for reducing cost and enhancing reservoir safety monitoring management.

Disclosure of Invention

The invention aims to: the intelligent detection inspection level of the water surface floaters of the reservoir is improved, an effective intelligent recognition method of the water surface floaters is provided by utilizing an artificial intelligence emerging technology, the labor cost is reduced, and the standardized intelligent management level of the reservoir is greatly improved.

The technical scheme is as follows: in order to achieve the above purpose, the invention provides an automatic reservoir float identification method based on an improved SegNet model, which comprises the following steps:

s1: acquiring reservoir floater image data under different scenes acquired by aerial photography by using an unmanned aerial vehicle, classifying and screening the image data, and removing invalid images to obtain a reservoir floater image data set for training an improved SegNet model;

s2: preprocessing a reservoir float image data set: the data set formed in the step S1 is subjected to noise reduction processing by utilizing a data noise reduction algorithm, so that the accuracy of reservoir floater image information is improved, and meanwhile, the data set is subjected to data enhancement by utilizing a data enhancement algorithm, and the scale of the training data set is expanded;

s3: labeling the data sets, and dividing the data sets into training sets and verification sets according to the number of the labeled data sets in a ratio of 7:3;

s4: performing model training on the improved SegNet algorithm model by utilizing the training set and the verification set determined in the step S3, and outputting an optimal network structure model and various parameters;

s5: detecting and identifying reservoir floaters: detecting the water surface image of the reservoir acquired by the unmanned aerial vehicle by utilizing an improved SegNet algorithm model based on the optimal network structure model parameters, judging whether reservoir floaters needing to be cleaned exist, and if so, calculating and marking the position information and coverage area of the floaters;

s6: and (5) pushing the detection and identification result of the water reservoir floaters in the step (S5) to relevant responsible persons and management staff to help timely clean the reservoir floaters.

Further, the acquisition of the image data in the step S1 is as follows:

the image data distribution needs to comprise different illumination degrees, such as early, middle, late and the like in the day; the image data distribution also needs to include different weather disturbances such as sunny, rainy, cloudy, fog, wind, etc.; the image data distribution also needs to include the influence of surrounding environment and other objects on the water surface, such as surrounding environment reflection of houses, trees, bridges and the like, weeds, water colors, fishing net fish boxes, fishing boats and the like.

Further, the specific content of the data preprocessing in the step S2 is as follows:

the data noise reduction adopts a Gaussian filtering algorithm; the data enhancement adopts a geometric transformation method, and mainly adopts five transformation methods of turning, rotating, cutting, scaling and translation.

Further, the specific algorithm flow of the gaussian filtering is as follows:

s21: performing zero padding on the image;

s22: generating a Gaussian filter template according to the kernel size and standard deviation size of the set Gaussian filter;

s23: gaussian filtering of the input image: scanning each pixel in the image by using the template generated in the step S22, and replacing the value of the central pixel point of the template by using the weighted average gray value of the pixels in the neighborhood determined by the template;

s24: and outputting the Gaussian filtered image.

Further, the specific formula of the two-dimensional gaussian function used to generate the gaussian filter template in step S22 is as follows:

wherein (x, y) is a point coordinate, sigma is a standard deviation, coordinates of each position of the image are brought into a two-dimensional Gaussian function, and the obtained value is a coefficient corresponding to the (x, y) coordinate in the template.

Further, the step S3 performs semantic annotation on the data set as follows:

and carrying out pixel-level semantic segmentation labeling on the reservoir floater image by using a dataset labeling tool LabelMe, continuously clicking and drawing a series of points at the edge of the floater, wherein the starting point and the end point of the points are the same point, and the points are connected in sequence and are connected end to form a closed polygon.

Further, the specific steps of training the model in step S4 are as follows:

s41: the training set is input into an improved SegNet network model of initialization parameters, and the model is specifically improved as follows: the network structure combines the maximum pooling index and residual connection, shallow layer feature mapping extracted in the encoder stage is input into the nonlinear upsampling stage of the decoder, and the characteristics such as color, texture and boundary of an original image are reserved to the greatest extent through dense feature mapping images generated by deconvolution, so that the identification precision of reservoir floaters is effectively improved;

s42: according to the principle of minimum cross entropy loss of the segmentation result, continuously iterating the network update model parameters by using the verification set until convergence and minimum loss are reached, and setting a cross entropy loss verification function as follows:

introducing a balance factor beta, wherein the value is in the interval of [0,1 ]:

the modified Cross Entropy Loss equation (B-CE) is designed as follows:

CE(p，y)＝-βlog(p _t )

where p represents the predicted probability that the sample is in that category and y represents the sample label.

S43: and the model parameters reach the minimum loss, the model training is completed, and the optimal network structure model and various parameters are output.

Further, the reservoir float coverage area calculation method in step S5 is as follows:

s51: the image is converted from BGR to HSV, so that color distinction is facilitated;

s52: extracting a specified color by using a cv2.inRange function;

s53: the contours are found and the areas of the contours are calculated.

Further, the specific steps of pushing information in step S6 are as follows:

s61: calculating coordinates of a center point A of a reservoir floating object area: acquiring the circumscribed rectangle of the region, namely acquiring the maximum and minimum X, Y of the region (namely, the coordinates of four corner points of the circumscribed rectangle: XMax, XMin, YMax, YMin), the center point A coordinate is calculated as follows:

CenterX＝(XMax+XMin)/2

CenterY＝(YMax+YMin)/2

s62: placing the areas responsible for all responsible persons into the same coordinate system, and marking the range responsible for all responsible persons;

s63: judging whether the central point A is in a region which is responsible for a certain responsible person by using a ray method, namely taking the point A as a starting point, taking the x-axis direction as the positive direction of rays, judging the number of intersection points of the straight line and a specific region, if the point is an odd point, the point is in the specific region, and if the point is an even point, the point is not in the specific region;

s64: if the central point A is judged to be in the area which is responsible for a responsible person, the information such as the identified coverage area of the reservoir floaters is pushed to the relevant responsible person.

The beneficial effects are that: compared with the prior art, the invention has the following advantages:

1. the improved algorithm model greatly reduces training parameters aiming at the application scene and improves training efficiency.

2. The improved algorithm introduces a multi-residual error connection strategy, more comprehensively reserves a large amount of detail information contained in the multi-scale image, and reduces information loss caused by reduction and downsampling. Meanwhile, a cross entropy loss function with a balance factor is designed, so that the convergence efficiency of network training is further accelerated, and the imbalance problem of samples is improved.

3. The intelligent monitoring of reservoir floaters is realized, so that the cost is reduced and the reservoir safety monitoring management is enhanced.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a network training flow diagram;

FIG. 3 is a schematic diagram of a ray method for dividing the area to which the reservoir float belongs;

Detailed Description

The present invention is further illustrated in the accompanying drawings and detailed description which are to be understood as being merely illustrative of the invention and not limiting of its scope, and various modifications of the invention, which are equivalent to those skilled in the art upon reading the invention, will fall within the scope of the invention as defined in the appended claims.

The invention provides an automatic reservoir floater identification method based on an improved SegNet model, which is shown in figure 1 and comprises the following steps:

s1: and acquiring reservoir floater image data under different scenes acquired by aerial photography by using an unmanned aerial vehicle, classifying and screening the image data, and removing invalid images to obtain a reservoir floater image data set for training an improved SegNet model. The specific requirements for image data acquisition are as follows:

S2: preprocessing a reservoir float image data set: the data set formed in the step S1 is subjected to noise reduction treatment by utilizing a data noise reduction algorithm, so that the accuracy of the image information of the reservoir floaters is improved; and meanwhile, data enhancement is carried out on the data set by utilizing a data enhancement algorithm, and the scale of the training data set is expanded on the data. The dataset preprocessing uses a specific algorithm:

the data noise reduction adopts a Gaussian filtering algorithm; the data enhancement adopts a geometric transformation method, and mainly adopts five transformation methods of turning, rotating, cutting, scaling and translation. The specific algorithm flow of Gaussian filter is as follows:

s21: performing zero padding on the image;

s22: and generating a Gaussian filter template according to the set kernel size and standard deviation size of the Gaussian filter. The specific formula of the two-dimensional gaussian function used to generate the gaussian filter template is as follows:

s24: and outputting the Gaussian filtered image.

S3: labeling the data sets, and dividing the data sets into training sets and verification sets according to the number of the labeled data sets in a ratio of 7:3. The specific content of semantic annotation on the data set is as follows:

S4: and (3) performing model training on the improved SegNet algorithm model by using the training set and the verification set determined in the step (S3), and outputting an optimal network structure model and various parameters. The specific steps of training the model are shown in fig. 2:

the modified Cross Entropy Loss equation (B-CE) is designed as follows:

CE(p，y)＝-βlog(p _t )

S5: reservoir floater picture detection and identification and floater coverage area calculation: and detecting the water surface image of the reservoir acquired by the unmanned aerial vehicle by utilizing an improved SegNet algorithm model based on the optimal network structure model parameters, judging whether reservoir floaters needing to be cleaned exist, and if so, calculating and marking the position information and the coverage area of the floaters. When the improved SegNet model is used for reservoir floater image identification, the method specifically comprises the following steps:

(1) Convolving the input reservoir float image to obtain characteristic images of H.W.64 channels, denoted as F ₁ ；

(2) Will F ₁ Downsampling to obtain H/2*W/2×64, and convolving to obtain H/2*W/2×128, denoted as F ₂ ；

(3) Will F ₂ Downsampling to obtain H/4*W/4×128, and convolving to obtain H/4*W/4×256, denoted as F ₃ ；

(4) Will F ₃ Downsampling to obtain H/8*W/8×256, denoted as F ₄ ；

(5) Will F ₄ Upsampling to obtain H/4*W/4X 256, denoted as F' ₃ Then F 'is carried out' ₃ Characteristic images of H/4*W/4 x 128, denoted De (F) ₃ ′)，F′ ₃ The calculation formula of (2) is as follows:

F′ ₃ ＝Fuse(PI(F ₃ )，F ₄ )

wherein PI represents the pooling index and Fuse represents the fusion of feature maps;

(6) De (F) ₃ ') up-sampling to obtain H/2*W/2 x 128, denoted as F' ₂ Then F 'is carried out' ₂ H/2*W/2 x 64, denoted De (F' ₂ )，F′ ₂ The calculation formula of (2) is as follows:

F′ ₂ ＝Fuse(PI(F ₂ )，De(F′ ₃ ))

(7) De (F' ₂ ) Restoring resolution to H×W after upsampling, combining F ₁ H.W.128, denoted F, is obtained by cascade operation _c The calculation formula is as follows:

F _c ＝Conc(Fuse(PI(F ₁ )，De(F′ ₂ ))，F ₁ )

(8) And finally, assigning a value to the category to which each pixel belongs through a softmax function, outputting a corresponding semantic segmentation result, and calculating the reservoir float coverage area according to the output reservoir float semantic segmentation result. The calculation method of the reservoir float coverage area comprises the following steps:

(1) The image is converted from RGB to HSV, so that color distinction is facilitated;

(2) Extracting a specified color by using a cv2.inRange function;

(3) The contours are found and the areas of the contours are calculated.

S6: and (5) pushing the detection and identification result of the water reservoir floaters in the step (S5) to relevant responsible persons and management staff to help timely clean the reservoir floaters. The specific steps of pushing information are shown in fig. 3:

s61: calculating coordinates of a center point A of a reservoir float area according to reservoir float position information: acquiring the circumscribed rectangle of the region, namely acquiring the maximum and minimum X, Y of the region (namely, the coordinates of four corner points of the circumscribed rectangle: XMax, XMin, YMax, YMin), the center point A coordinate is calculated as follows:

CenterX＝(XMax+XMin)/2

CenterY＝(YMax+YMin)/2

Claims

1. The reservoir floater automatic identification method based on the improved SegNet model is characterized by comprising the following steps of:

s2: reservoir floater picture detection and identification and floater coverage area calculation: the data set formed in the step S1 is subjected to noise reduction treatment by utilizing a data noise reduction algorithm, so that the accuracy of the image information of the reservoir floaters is improved; meanwhile, the data enhancement algorithm is utilized to carry out data enhancement on the data set, and the scale of the training data set is expanded on the data;

2. The reservoir float identification method based on the improved SegNet model of claim 1, wherein the image data acquisition in step S1 is as follows:

3. The reservoir float identification method based on the improved SegNet model according to claim 1, wherein the specific content of the data preprocessing in the step S2 is as follows:

4. The reservoir float automatic identification method based on the improved SegNet model according to claim 3, wherein the specific algorithm flow of Gaussian filtering is as follows:

s21: performing zero padding on the image;

s24: and outputting the Gaussian filtered image.

5. The method for automatically identifying reservoir floats based on an improved SegNet model according to claim 4, wherein the specific formula of the two-dimensional Gaussian function used for generating the Gaussian filter template in the step S22 is as follows:

6. The automatic reservoir float identification method based on the improved SegNet model according to claim 1, wherein the semantic annotation of the data set in step S3 is as follows:

7. The method for automatically identifying reservoir floaters based on the improved SegNet model as claimed in claim 1, wherein the training model in step S4 specifically comprises the following steps:

the modified Cross Entropy Loss equation (B-CE) is designed as follows:

CE(p，y)＝-βlog(p _t )

8. The automatic reservoir float identification method based on the improved SegNet model as claimed in claim 1, wherein the reservoir float coverage area calculation method in step S5 is as follows:

s51: the image is converted from RGB to HSV, so that color distinction is facilitated;

s52: extracting a specified color by using a cv2.inRange function;

s53: the contours are found and the areas of the contours are calculated.

9. The automatic reservoir floater identification method based on the improved SegNet model according to claim 1, wherein the specific steps of pushing information in step S6 are as follows:

CenterX＝(XMax+XMin)/2

CenterY＝(YMax+YMin)/2