CN111027511A

CN111027511A - Remote sensing image ship detection method based on region of interest block extraction

Info

Publication number: CN111027511A
Application number: CN201911338663.3A
Authority: CN
Inventors: 侯彪; 刘佳丽; 焦李成; 马文萍; 马晶晶; 杨淑媛
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2019-12-23
Filing date: 2019-12-23
Publication date: 2020-04-17
Anticipated expiration: 2039-12-23
Also published as: CN111027511B

Abstract

The invention discloses an optical remote sensing image ship detection method based on region of interest block extraction, which mainly solves the problems of low detection precision and more false alarms in the prior art. The implementation scheme is as follows: constructing an optical remote sensing image ship detection data set; downsampling and defogging enhancing the wide remote sensing image, and carrying out land and water segmentation by using context information and image global characteristics; training a target detection model based on the SCRDEt by using the constructed data set; according to the land and water segmentation result, scanning the original wide remote sensing image by using a partially overlapped sliding window to extract an interested area block as a to-be-detected area, and inputting the to-be-detected area image into a detection model to obtain an area detection result; mapping the region result to the original wide image scale, and performing improved non-maximum suppression to optimize the primary detection result; and optimizing the detection result again according to the structural characteristics of the ship. The method has high detection precision and low false alarm rate, and can be used for acquiring the ship target of interest and the position thereof in the large-format remote sensing image.

Description

Remote sensing image ship detection method based on region of interest block extraction

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a method for detecting a ship by using an optical remote sensing image, which can be used for target identification in a large-format remote sensing image.

Background

Target detection of optical remote sensing images is one of important problems in the field of remote sensing image research, and ship target detection has extremely important application value in aspects of fishery management, military reconnaissance, strategic deployment and the like due to the particularity and the criticality. The ship target detection is to determine whether ships exist in a water area or on the shore from a complex scene and position the ships.

The traditional ship target detection method mainly comprises methods of sea-land segmentation and prior geographic information, a large number of features need to be manually designed to position a ship, and the adaptability is not strong when targets of two different scenes, namely an onshore ship and an offshore ship, are detected simultaneously.

In recent years, deep learning is rapidly developed, and a deep convolutional neural network can automatically extract shallow and deep features of an image, so that manual feature extraction of a traditional method is avoided, and an advanced result is obtained in the field of image processing. Currently, methods based on deep learning are the mainstream methods for target detection.

The ship target detection method based on deep learning can be divided into two types according to the detection frame form: a horizontal-based detection block and a tilt-based detection block. Wherein:

the method based on the horizontal detection frame comprises a classic two-stage detection frame fast-RCNN, and a single-stage detection frame Yolo, RetinaNet and SSD.

A method for detecting a frame based on tilt, comprising R²CNN, ROI-Transformer, SCRDEt, etc.

Due to the characteristics of random ship direction, dense ships approaching to the shore and the like, the inclined detection frame can be used for more accurately positioning the ships. In the method adopting the tilt Detection frame, the SCRDEt (means More Robust Detection for Small, clustered and tracked Objects) model can fully fuse the bottom-layer features and the high-layer features, and achieve a better Detection result for the dense object. However, the method based on deep learning does not perform land and water segmentation on the image, but directly inputs the wide image slice into a trained model to generate a result, inputs a complex land area not containing a water area into a detection model, not only reduces the detection efficiency, but also may cause an obvious land false alarm, and in addition, due to the fact that the incomplete hull caused by image segmentation also causes obvious missing detection, the detection precision is not high.

Disclosure of Invention

The invention aims to provide an optical remote sensing image ship detection method based on region-of-interest block extraction aiming at the defects of the prior art so as to optimize the positioning of ship targets in a remote sensing image and improve the detection efficiency and precision.

In order to achieve the purpose, the technical scheme of the invention comprises the following steps:

(1) constructing an optical remote sensing image ship detection data set G:

1a) downloading high-grade second optical remote sensing data, manually screening areas containing ship targets, cutting the areas in a partially overlapped mode and storing the areas;

1b) randomly turning or rotating all the images obtained in the step 1a up, down, left and right to obtain amplified images and storing the amplified images;

1c) marking the amplified images by inclined rectangular frames, storing marking information as an xml format file, and forming an optical remote sensing image ship detection data set G by using all the amplified images and the corresponding marking information;

(2) downsampling the wide optical remote sensing image, and reducing cloud and fog shielding on the image by using a defogging algorithm based on dark channel prior to obtain an enhanced image I;

(3) carrying out land and water segmentation on the enhanced image I by utilizing the context information and the image global features, extracting interested water area blocks, and obtaining a binary image:

(3a) performing initial threshold segmentation on the enhanced image I by using the context characteristic information of the pixels to obtain an initial land-water segmentation binary image I₃；

(3b) Marking initial land and water segmentation binary image I₃The set W ═ W of the connected domain of the middle water area₁,...,w_i,...,w_nIn which w_iRepresenting the communication area of the ith water area; marking initial land and water segmentation binary image I₃Set of continental linking domains L ═ { L ═ L₁,...,l_j,...,l_mIn which l_jRepresents the jth land-based communication domain; extracting the region characteristics of each water area communication region and each land communication region, and re-calibrating the classification according to a judgment rule to obtain an optimized land and water segmentation binary image I of the enhanced image I₄；

(3c) For optimizing land and water segmentation binary image I₄After morphological dilation operation, the up-sampling is recovered to the original wide image size to obtain the final land and water segmentation binary image I₅；

(4) Method for training SCRDet target detection model M based on convolutional neural network by adopting random multi-scale strategy₀Obtaining a trained detection model M;

(5) using the final land-water segmentation binary map I₅Extracting interested blocks on the original wide image by using partially overlapped sliding windows as a region to be detected to form a region image set to be detectedF＝{f₁,...,f_i,...,f_nThe position set is S ═ S₁,...,s_i,...,s_nIn which f_iDenotes the i-th detection area, s_iIndicates the detection area f_iThe coordinates of the upper left corner are input into a detection model M to obtain an image set F of each region to be detected_iThe area detection result of (1);

(6) mapping the region detection result to the original wide image scale according to the position set S to obtain a primary detection result set

Wherein P isⁱ、

Respectively representing the category, position coordinates and confidence score of the ith detection frame of the preliminary detection; performing improved non-maximum suppression on the primary detection result set A to obtain a primary optimized detection result set

Wherein Qⁱ、

Respectively representing the category, position coordinates and confidence score of the ith detection frame of the primary optimization;

(7) carrying out secondary optimization on the primary optimization detection result set B according to the structural characteristics of the ship to obtain a final detection result set

Wherein R isⁱ、

And respectively representing the category, position coordinates and confidence score of the ith detection frame finally generated.

Compared with the prior art, the invention has the following advantages:

1. according to the characteristics of complex scene and large size of the wide remote sensing image, the method utilizes context information and global characteristics of the down-sampled and defogged enhanced image to carry out land and water segmentation, and carries out morphological operation, so that land and water segmentation results can be more effectively optimized, and ship targets in an area of interest are reserved.

2. According to the invention, the whole image is scanned through the partially overlapped sliding window, and the region of interest is extracted according to the land and water segmentation result to be used as the region to be detected, so that the false alarm on land is reduced, and the target detection efficiency is effectively improved; in addition, the improved non-maximum suppression is performed on the detection result on the wide scale, the missing detection caused by image slices is effectively reduced, and the detection precision is improved.

Drawings

FIG. 1 is a schematic flow chart of an implementation of the present invention;

FIG. 2 is a simulation result image of land and water segmentation of a wide remote sensing image using the present invention;

FIG. 3 is a simulation result image of a ship test performed on a wide remote sensing image using a conventional method;

FIG. 4 is a simulation result image of a ship test performed on a wide remote sensing image using the present invention.

Detailed Description

The embodiments and effects of the present invention will be further explained below with reference to the drawings.

Referring to fig. 1, the implementation steps of this embodiment are as follows:

step 1, constructing an optical remote sensing image ship detection data set G.

1.1) downloading high-score second-order optical remote sensing data, manually screening areas containing ship targets, cutting the areas by using a part of overlapped sliding windows with the size of 832 multiplied by 832 and the step length of 416, and storing the areas;

1.2) randomly turning or rotating all the images obtained in the step 1.1 up, down, left and right to obtain amplified images and storing the amplified images;

1.3) labeling all the amplification images obtained in the step 1.2) by using inclined rectangular frames, saving the labeling information as an xml format file, and forming an optical remote sensing image ship detection data set G by using all the amplification images and the corresponding labeling information.

And 2, downsampling and defogging enhancement of the wide optical remote sensing image.

And carrying out down-sampling on the original wide optical remote sensing Image by 8 times, and reducing cloud and fog shielding on the original wide optical remote sensing Image according to a defogging algorithm based on Dark Channel Prior, which is proposed by He-Cacamme in Single Image HazeRemoval Using Dark Channel Prior, so as to obtain an enhanced Image I.

And 3, performing land and water segmentation by using the context information and the image global features.

3.1) carrying out preliminary threshold segmentation on the enhanced image I by utilizing the context characteristic information of the pixels;

3.1.a) performing mirror image expansion with the width of 3 on the edge of the enhanced image I to obtain an expanded image I ', taking each pixel point of the enhanced image I as the center of a 7 multiplied by 7 sliding window, extracting a corresponding window area on the expanded image I', and calculating a regional variance V as a context characteristic value of the central pixel point to obtain a characteristic image V of the enhanced image I, wherein the characteristic value of the enhanced image I at the coordinate (I, j) is V (I, j);

3.1.b) determining the corresponding peak variance V at the histogram peak with 1 as the distribution histogram of the interval statistical feature map V_maxCalculate [ v ]_maxAnd +∞) of the variance g of the valley corresponding to the first valley of the histogram_t；

3.1.c) feature value of the enhanced image I at coordinate (I, j) is V (I, j) and valley variance g_tAnd (3) comparison:

if V (i, j) is less than or equal to g_tThe updated V (i, j) is 1,

otherwise, updating V (i, j) to 0;

3.1.d) morphological dilation of the updated feature V to obtain a feature binary image I₁；

3.1.e) with improved Laplace operator

Performing Laplace gray scale transformation on the enhanced image I, and segmenting the enhanced image I after gray scale transformation by using an OTSU (over the Top) adaptive threshold method to obtain a gray scale binary image I₂；

F) mapping the characteristic binary image I₁And gray scale binary image I₂Performing logic and operation on the pixel-by-pixel correspondence to obtain a preliminary land-water segmentation binary image I₃In which I₃The middle pixel value 0 represents land, and the pixel value 1 represents water;

3.2) carrying out category recalibration on the communication area by utilizing the characteristics of the water area communication area and the land communication area:

3.2.a) marking the preliminary land-water segmentation binary map I₃The set W ═ W of the connected domain of the middle water area₁,...,w_i,...,w_nIn which w_iRepresenting the communication area of the ith water area, calculating the area of the communication areas of the n water areas, taking the three communication areas with the largest areas as wide-width image water area characteristic templates, and calculating the variance sigma of the water area characteristic templates²；

3.2.b) initializing an area threshold t₁＝35×35；

3.2.c) traversing the water area communication area set W, and calculating W in W_iArea of (2)

Sum variance

If it is

And is

Then recalibrate w_iFor water area, otherwise, recalibrate w_iIs land;

3.2.d) marking the preliminary land-water segmentation binary map I₃Set of continental linking domains L ═ { L ═ L₁,...,l_j,...,l_mIn which l_jRepresents the jth land-based communication domain;

3.2.e) traverse the set L of terrestrial communication domains, calculate L in L_jHas an area of

If it is

Then recalibrate_jFor water area, otherwise, recalibrate l_jLand;

f) respectively assigning 0 and 1 to all pixels in the land communication domain and the water communication domain after re-calibration to obtain an optimized land-water segmentation binary image I₄；

3.3) segmentation of binary maps I with optimized land₄Obtaining a final land and water segmentation binary image I₅：

3.3.a) replication of an optimized amphibian-segmentation binary map I₄Obtaining a copied result graph I₄'；

3.3.b) optimization of land-water segmentation binary image I with 15 × 15 elliptical structuring elements by pixel-by-pixel scan₄And structural elements are combined with I₄And performing logic and operation on all pixel points in the corresponding region:

3.3.c) determining the result of each pixel point, updating I₄' center pixel within corresponding region:

if the result of each pixel point is 0, updating the central pixel of the corresponding area to be 0;

otherwise, updating the central pixel of the corresponding area to be 1;

3.3.d) updating I₄' Up-sampling by a factor of 8, and₄' the size of the image is restored to the original wide image size to obtain the final land and water segmentation binary image I₅。

Step 4, training the SCRDEt model M based on the convolution neural network at random and multiple scales₀。

4.1) taking 90% of the optical remote sensing image ship detection data set G constructed in the step one as a training sample, and taking the rest 10% as a test sample;

4.2) use of ResNet-50 network as SCRdet model M₀And pre-training model parameters obtained by ResNet-50 network by using data set ImageNet, and then using the model parameters to carry out model M on the SCRdet₀Initializing parameters of the backbone network;

4.3) randomly selecting a training image and a corresponding label in the training sample, and randomly turning the image and the label up and down or left and right to obtain a transformation image X and a real label Y;

4.4) random slave [600,700,832]Selecting a certain scale as the length of the short edge of the image, scaling the transformed image X and the real label Y to the selected scale, and inputting the scaled transformed image X and the real label Y into the SCRdet model M₀Obtaining a prediction result Y';

4.5) calculating the error between the predicted result Y' and the true annotation Y, minimizing the error using an optimizer Adam to update the SCRDEt model M₀The weight parameter of (2);

4.6) repeating 4.3) -4.5), and obtaining the trained target detection model M when the number of training rounds reaches 200000.

And 5, extracting the region to be detected on the original wide image, and inputting the region to be detected into the trained target detection model M to detect the region.

5.1) initializing a window parameter p which is 832, and initializing an image set F and a position set S of a region to be detected as an empty set;

5.2) scanning the original wide remote sensing image by using a partially overlapped sliding window with the size of p multiplied by p and the step length of p/2, and calculating a final land and water segmentation binary image I corresponding to a window area f of the original wide remote sensing image₅Water area S in the area_w；

5.3) according to the area S of the water area_wJudging whether the window area f is an interested area block:

if S_w＞(p/8)²If so, judging that the window area F is an interested area block, adding the window area F into the image set F of the area to be detected, adding the upper left corner coordinate S of the window area F into the position set S, and obtaining the image set F of the area to be detected as { F ═ F {₁,...,f_i,...,f_nAnd the set of positions S ═ S₁,...,s_i,...,s_n}；

Otherwise, judging that the window area f is a non-interesting area block and not carrying out any operation;

5.5) set F ═ F of the region images to be detected₁,...,f_i,...,f_nInputting the data into a detection model M to obtain each area f_iAll ofThe category, position coordinates, confidence score of the detection box.

And 6, performing improved non-maximum suppression on the primary detection result on the original wide image scale.

6.1) according to the set of positions S ═ { S ═ S₁,...,s_i,...,s_nArea f to be detected in_iCoordinate s of_iEach region f_iMapping the detection result to the original wide image scale to obtain a primary detection result set:

wherein P isⁱ、

Respectively representing the category, position coordinates and confidence score of the ith detection frame of the preliminary detection;

6.2) initializing the detection result set B after one-time optimization to be null;

6.3) assuming the classes of all detection frames in the preliminary detection result set A as the same class, and making the index of the detection result with the highest confidence level in the A

Taking out the detection result from A according to the index r

After adding set B, and

delete from set A;

6.4) calculating the intersection ratio of each detection frame in the preliminary detection result set A and the detection frame with the highest confidence coefficient

Namely the ratio of the overlapping area of the two detection frames to the area of the union region;

6.5) determination of the intersection-to-parallel ratio

Cross-over ratio threshold t₂The size relationship of (1):

if it is

Then the confidence score is updated

If not, then,

not updating;

6.6) repeating the operations of the steps 6.3) -6.5) until the preliminary detection result set A is empty, and obtaining a detection result set after primary optimization:

wherein Qⁱ、

And respectively representing the category, position coordinates and confidence score of the ith detection frame after the primary optimization.

Step 7, according to the structural characteristics of ships in the optical remote sensing image, all detection results in the detection result set B after one-time optimization

And (5) carrying out secondary optimization.

7.1) judge confidence score

The size of (2): if it is

The class l of the ith detection boxⁱOptimizing as background, otherwise optimizing as ship target;

7.2) according to the position coordinates of the detection frame

Obtaining the longest edge x of the detection frame, and judging the size of x: if x > 450, the class Q of the ith detection frame is determinedⁱOptimizing as a background, otherwise, optimizing as a ship target;

7.3) according to the position coordinates of the detection frame

Calculating the ratio y of the longest edge and the shortest edge of the detection frame, and judging the size of y: if y is more than 11, the type Q of the ith detection frame is determinedⁱAnd optimizing as a background, otherwise, optimizing as a ship target.

The effect of the invention can be further illustrated by the following simulation experiment:

1. simulation conditions are as follows:

the simulation experiment adopts a full-color image of a main city in China shot by an optical remote sensing satellite with a ground resolution of 1 meter or 4 meters.

The CPU used for simulation is Intel (R) core (TM) i7-8750H, the main frequency is 2.20GHz, the GPU is 8G GeForceGTX1080, the simulation platform is a UBUNTU 16.04 operating system, a Tensorflow deep learning framework is used, and Python3.6 software is adopted for experiment.

2. Simulation content and results:

simulation 1, carrying out land and water segmentation on 10000 × 10000 wide remote sensing images by using the method, and shielding land according to land and water segmentation results, wherein the result is shown in fig. 2, wherein black represents land, and the rest are interested water areas. As can be seen from figure 2, the land-bound ship is well preserved.

Simulation 2, using the existing convolution neural network-based method to directly perform slice-type one-by-one detection on 10000 × 10000 wide remote sensing images, and the result is shown in fig. 3, wherein a green frame marks the detected ship, and blue numbers are time-shared as confidence. As can be seen from fig. 3, when the wide remote sensing image is directly sliced, the missing detection is obviously caused by the defect of the ship, because the distribution characteristics of the ship are not considered in the existing method.

Simulation 3, the invention is used for carrying out ship target detection on 10000 × 10000 wide remote sensing images, and the result is shown in fig. 4, wherein a green frame marks the detected ships, and a blue number is a confidence score.

Comparing simulation 2 with simulation 3, the method effectively performs land and water segmentation on the premise of retaining the information of the ship on the shore, extracts the region of interest according to the land and water segmentation result, optimizes the region of interest by adopting a partial overlap detection strategy and improving non-maximum inhibition, and finally optimizes the detection result again by combining the ship structure information.

Claims

1.A remote sensing image ship detection method based on region of interest block extraction is characterized by comprising the following steps:

(1) constructing an optical remote sensing image ship detection data set G:

(3b) Marking initial land and water segmentation binary image I₃Middle water area communicating areaSet W ═ W₁,…,w_i,…,w_nIn which w_iRepresenting the communication area of the ith water area; marking initial land and water segmentation binary image I₃Set of continental linking domains L ═ { L ═ L₁,…,l_j,…,l_mIn which l_jRepresents the jth land-based communication domain; extracting the region characteristics of each water area communication region and each land communication region, and re-calibrating the classification according to a judgment rule to obtain an optimized land and water segmentation binary image I of the enhanced image I₄；

(5) using the final land-water segmentation binary map I₅Extracting an interested area block as a to-be-detected area on the original wide image by using a partially overlapped sliding window, and forming an image set of the to-be-detected area as F ═ { F ═ F₁,…,f_i,…,f_nIs assembled into positions

S＝{s₁,…,s_i,…,s_nIn which f_iDenotes the i-th detection area, s_iIndicates the detection area f_iThe coordinates of the upper left corner are input into a detection model M to obtain an image set F of each region to be detected_iThe area detection result of (1);

Wherein P isⁱ、

Respectively representing the category, position coordinates and confidence score of the ith detection frame of the preliminary detection; improved non-maxima suppression for preliminary test result set AObtaining a set of optimized detection results

Wherein Qⁱ、

Wherein R isⁱ、

2. The method according to claim 1, wherein the preliminary threshold segmentation is performed on the enhanced image I in (3a) by using the context feature information of the pixels, which is implemented as follows:

3a1) performing mirror image expansion with the width of 3 on the edge of the enhanced image I to obtain an expanded image I ', taking each pixel point of the enhanced image I as the center of a 7 multiplied by 7 sliding window, extracting a corresponding window area on the expanded image I', and calculating a regional variance V as a context characteristic value of the central pixel point to obtain a characteristic image V of the enhanced image I, wherein the characteristic value of the enhanced image I at the coordinate (I, j) is V (I, j);

3a2) determining the corresponding peak variance V at the histogram peak by taking 1 as the distribution histogram of the interval statistical feature graph V_maxCalculate [ v ]_maxAnd +∞) of the variance g of the valley corresponding to the first valley of the histogram_tIf V (i, j) is less than or equal to g_tIf the updated V (I, j) is 1, otherwise, the updated V (I, j) is 0, and the feature map V after updating is subjected to morphological dilation operation to obtain a feature binary map I₁；

3a3) Carrying out gray scale transformation on the enhanced image I by using an improved Laplacian operator, and segmenting the enhanced image I by using an OTSU (over the Top) adaptive threshold method to obtain a gray scale binary image I₂；

3a4) The characteristic binary image I₁And gray scale binary image I₂Performing logic and operation on the pixel-by-pixel correspondence to obtain a preliminary land-water segmentation binary image I₃，I₃The middle pixel value 0 represents land and the pixel value 1 represents water.

3.A method according to claim 1 wherein an optimized land-water cut binary map I is utilized in (3c)₄Obtaining a final land and water segmentation binary image I₅The implementation is as follows:

3c1) duplication optimization land-water segmentation binary map I₄Obtaining a copied result graph I₄'；

3c2) Optimization of land-water segmentation binary image I by pixel-by-pixel scanning of 15 x 15 elliptical structural elements₄And structural elements are combined with I₄And performing logic and operation on all pixel points in the corresponding region:

3c3) judging the result of each pixel point, and updating I₄' center pixel within corresponding region:

otherwise, updating the central pixel of the corresponding area to be 1;

3c4) will update the I₄' Up-sampling by a factor of 8, and₄' the size of the image is restored to the original wide image size to obtain the final land and water segmentation binary image I₅。

4. The method of claim 1, wherein random multi-scale strategy is adopted in (4) to train deep convolutional neural network-based SCRDEt model M₀And obtaining a detection model M, which is specifically realized as follows:

4a) taking 90% of the constructed optical remote sensing image ship detection data set G as a training sample, and taking the rest 10% as a test sample;

4b) randomly selecting a training image and a corresponding label in a training sample, and randomly turning the image and the label up and down or left and right to obtain a transformation image X and a real label Y;

4c) random slave [600,700,832]Selecting a certain scale as the length of the short edge of the image, scaling the transformed image X and the real label Y to the selected scale, and inputting the scaled transformed image X and the real label Y into the SCRdet model M₀Obtaining a prediction result Y';

4d) calculating the error between the predicted result Y' and the real annotation Y, and using an optimizer Adam to minimize the error so as to update the SCRDEt model M₀The weight parameter of (2);

4e) and repeating 4b) -4d), and obtaining the detection model M after training when the number of the training rounds reaches 200000.

5. The method of claim 1, wherein the final land-water segmentation binary map I is utilized in (5)₅Extracting an interested area block from an original wide image as a to-be-detected area, inputting an image set F of the to-be-detected area into a detection model M to obtain an area detection result, and specifically realizing the following steps:

5a) initializing an image set F and a position set S of a region to be detected as an empty set;

5b) scanning the original wide remote sensing image by using a partial overlapping window with the size of p multiplied by p and the step length of p/2, and calculating a final land and water segmentation binary image I corresponding to a window area f of the original wide remote sensing image₅The water area in the region is S_w；

5c) According to S_wJudging whether the window area f is an interested area block:

if S_w＞(p/8)²If so, judging that the window area F is an interesting area block, adding the window area F into an image set F of the area to be detected, and adding the upper left corner coordinate S of the window area F into a position set S;

5d) f & ltf & gt of regional image set to be detected₁,…,f_i,…,f_nInputting the data into a detection model M to obtain each area f_iClass, position coordinates, confidence of all detection boxesAnd (4) degree score.

6. The method of claim 1, wherein in (6), the region detection result is mapped to the original wide image scale, and the preliminary detection result set a is subjected to improved non-maximum suppression, which is implemented as follows:

6a) according to the position set S ═ S { (S)₁,…,s_i,…,s_nArea f to be detected in_iCoordinate s of_iEach region f_iMapping the detection result to the original wide image scale to obtain a primary detection result set

Wherein P isⁱ、

6b) initializing the detection result set B after the primary optimization to be null;

6c) the categories of all detection frames in the preliminary detection result set A are assumed to be the same category, and the index of the detection result with the highest confidence level in the A is made

Taking out the detection result from A according to the index r

After adding set B, and

delete from set A;

6d) calculating the intersection ratio of each detection frame in the preliminary detection result set A and the detection frame with the highest confidence coefficient

6e) Cross is judgedIs to be compared with

Cross-over ratio threshold t₂The size relationship of (1):

if it is

Then the confidence score is updated

If not, then,

not updating;

6f) repeating the operations of the steps 6c) to 6e) until the preliminary detection result set A is empty, and obtaining a detection result set after primary optimization:

wherein Qⁱ、

7. The method according to claim 1, wherein in (7), all the detection results in the once-optimized detection result set B are determined according to the structural features of the ship

And performing secondary optimization, specifically realizing the following steps:

7a) determining confidence score

The size of (2): if it is

Then will be firstClass l of i detection boxesⁱOptimizing as background, otherwise optimizing as ship target;

7b) according to the position coordinates of the detection frame

Obtaining the longest edge of the detection frame as x, and judging the size of the x: if x > 450, the class Q of the ith detection frame is determinedⁱOptimizing as a background, otherwise, optimizing as a ship target;

7c) according to the position coordinates of the detection frame

Calculating the ratio of the longest side to the shortest side of the detection frame as y, and judging the size of y: if y is more than 11, the type Q of the ith detection frame is determinedⁱAnd optimizing as a background, otherwise, optimizing as a ship target.