CN112862748B

CN112862748B - Multi-dimensional domain feature combined SAR ship intelligent detection method

Info

Publication number: CN112862748B
Application number: CN202011562298.7A
Authority: CN
Inventors: 李东; 梁全欢; 李秀琴; 万俊; 陈展野; 程婉儒; 唐先慧
Original assignee: Chongqing University
Current assignee: Chongqing University
Priority date: 2020-12-25
Filing date: 2020-12-25
Publication date: 2023-05-30
Anticipated expiration: 2040-12-25
Also published as: CN112862748A

Abstract

The invention belongs to the technical field of SAR ship target detection, and discloses a multi-dimensional domain feature combined SAR ship intelligent detection method, wherein the existing Synthetic Aperture Radar (SAR) ship detection method based on deep learning only utilizes space feature information of a ship target, and can not obtain satisfactory detection effects for multi-scale targets and rotating targets under a complex background. To overcome these problems, the present invention employs a feature pyramid network to learn multi-scale features of a vessel target space; and secondly, obtaining the rotation invariant feature of the SAR vessel target on the frequency domain by utilizing polar coordinate Fourier transformation. On this basis, a new space-frequency feature fusion network is proposed to obtain a more compact cross-domain feature representation. The invention effectively overcomes the scale transformation and rotation behavior of the SAR vessel target and improves the detection performance of the SAR vessel detection algorithm.

Description

Multi-dimensional domain feature combined SAR ship intelligent detection method

Technical Field

The invention belongs to the technical field of SAR vessel target detection, and particularly relates to an intelligent detection method for a multi-dimensional domain feature combined SAR vessel.

Background

At present, the synthetic aperture radar can generate high-resolution microwave images of an observation scene under all-weather conditions throughout the day, and is widely applied to many civil and military fields. In recent years, SAR ship detection has been increasingly valued for its important value in practical applications such as offshore management, offshore traffic control, marine environmental protection, and the like. With the rapid development of SAR imaging technology, the observation level of a ship target is higher and higher, so that SAR ship detection based on fine information and high-level characteristics is possible.

The traditional SAR vessel target detection method mostly depends on manual participation, and target characteristics need to be selected according to manual experience; and the problems of motion blur, speckle noise and the like of the SAR image are caused, so that the traditional ship detection method is difficult to extract effective characteristics, and the target detection performance of the SAR ship is limited.

In recent years, with the development of artificial intelligence and SAR imaging technologies, more and more scholars apply a deep learning technology to the field of SAR ship target detection, and the deep learning technology can autonomously learn the refined characteristics and high-level semantic information of the ship target, so that the artificial participation degree is effectively reduced, and the ship target detection precision is improved. However, only the space characteristic information of the SAR vessel target is utilized, and the complementary characteristics of other dimension fields of the vessel target are ignored, so that the vessel target detection reliability and the classification recognition accuracy are limited.

Through the above analysis, the problems and defects existing in the prior art are as follows: the existing Shan Weiyu SAR ship detection method only utilizes the space characteristic information of ship targets, and particularly cannot obtain satisfactory detection effects under multi-scale or rotating targets and complex backgrounds.

The difficulty of solving the problems and the defects is as follows: 1. how to accurately extract the frequency domain rotation invariant feature of the vessel target in deep learning; 2. how to accurately align the spatial features and the frequency domain features so as to ensure the effectiveness of feature fusion; 3. how to effectively realize cross-domain fusion of multidimensional domain features and obtain better feature expression.

The meaning of solving the problems and the defects is as follows: 1. the SAR ship detection network integrating multidimensional domain features and deep learning is provided; 2. the rotation problem of the ship target is solved by utilizing the frequency domain characteristics; 3. the ship detection performance of the method is superior to that of other mainstream target detection algorithms.

Disclosure of Invention

Aiming at the problems existing in the prior art, the invention provides an intelligent detection method for a multi-dimensional domain feature combined SAR vessel. According to the invention, by exploring and utilizing the spatial multi-scale characteristics and the frequency domain rotation invariant characteristics of the SAR vessel target, more compact cross-domain characteristic representation is obtained through a spatial-frequency domain characteristic fusion network, so that the detection performance is improved.

The invention is realized in such a way that the intelligent detection method of the multi-dimensional domain feature combined SAR vessel comprises the following steps:

step one, analyzing multi-scale problems and rotation problems existing in the existing SAR vessel targets, acquiring layered space characteristics from bottom to top through a backbone network, and acquiring the multi-scale characteristics by adopting FPN;

step two, the spatial multi-scale characteristics obtained in the step one are obtained through an RPN network, coordinate information and category information of a candidate region of the SAR image are obtained, and the coordinate information and the category information are mapped to an original image and a multi-scale characteristic image respectively;

step three, processing is carried out by combining a time-frequency domain dimension domain aiming at the rotation behavior of the ship; according to the candidate region of the original image obtained in the second step, rotating invariant information of the SAR vessel target is obtained through Fourier-polar coordinate transformation, and the rotating invariant feature of the SAR vessel target frequency domain is obtained through feature extraction;

and fourthly, designing a finer feature fusion network aiming at space-frequency domain feature fusion, and obtaining more compact feature expression in different modes by interactively updating parameters of different networks so as to comprehensively characterize the features of each dimension domain.

Further, in the first step, the neural network Ω is convolved through a spatial channel _S The spatial feature phi can be easily obtained from the SAR image _S (x, y), which can be expressed as

φ _S (x,y)＝Ω _S (I(x,y),θ _s )；

Wherein I (x, y) represents an input SAR vessel image, θ _s Representing omega _S Related parameter sets in (a); upsampling high-level features by FPN and enhancement via cross-connect to yield spatial multi-scale features φ _{S_MS} (x,y)。

Further, in the second step, the proposed area of the image is obtained by sharing the spatial feature to the RPN, and the specific process is as follows:

the first step, a 3×3 convolution check is used to convolve the shared feature map, so as to make the extracted features more robust;

secondly, using a convolution kernel of 1 multiplied by 1 twice to obtain category information and position information of the candidate region;

third, mapping the candidate region position information to the original image and the multi-scale feature map to obtain I respectively _Sub (x, y) and phi _{S_MS_Sub} (x,y)。

Further, in the third step, the obtained original image candidate region I _Sub (x, y) performing polar coordinate transformation, wherein the transformation relation is as follows:

where r represents the distance from the point to the origin in a rectangular coordinate system, θ is the angle between the x-axis and the line connecting the point and the origin, and is positive in the counterclockwise direction.

Further, the obtained polar coordinate image is subjected to Fourier transformation and the amplitude spectrum is taken

Wherein F is _Sub (mu, v) represents candidate region frequency domain information, mu, v are corresponding two-dimensional frequency domain variables respectively, and the amplitude spectrum is taken to obtain rotation invariant information M _Sub (u,v)。

Further, the rotation invariant information M obtained by the convolution neural network pair _Sub (u, v) feature extraction

φ _{F_RI_Sub} (x,y)＝Ω _F (M _Sub (u,v),θ _F )；

In phi _{F_RI_Sub} (x, y) represents the rotation invariant feature of the SAR vessel target, Ω _F Extracting network for frequency domain characteristics, θ _F For its associated parameter set.

Further, in the third step, the rotation behavior of the SAR vessel target is processed, and the rotation invariant feature is extracted, and the specific process is as follows:

the rotation behavior under the rectangular coordinates of the image is represented as translation of the target under the polar coordinates, so that the rotation transformation under the rectangular coordinates is converted into translation transformation by utilizing the polar coordinate transformation; and simultaneously, combining the characteristic of Fourier transformation, and taking the magnitude spectrum of the spectrum information of the SAR ship target so as to adapt to any rotation angle of the SAR ship target, thereby obtaining the rotation invariant information of the SAR ship target.

In the third step, the rotation invariant feature of the SAR vessel target is obtained by extracting the feature of the SAR vessel target through a convolutional neural network.

In the fourth step, a finer feature fusion network is designed, and the spatial-frequency domain features are fused by interactively updating parameters of different networks, which comprises the following specific processes:

expanding the spatial features into one-dimensional vectors, and expanding the frequency domain features through the full connection layer to obtain feature vectors with the same dimension, wherein the obtained output feature size is 1024; the representation is

a _S ＝FC(Flatten(φ _{S_MS_Sub} (x,y)))；

a _F ＝Flatten(φ _{F_RI_Sub} (x,y))；

Wherein a is _S And a _F Output feature vectors representing the spatial domain and the frequency domain, respectively, and FC represents the fully connected layer.

Further, the cross-domain fusion process of the obtained one-dimensional spatial feature and the frequency domain feature vector is as follows:

a _SF,1 ＝[FC ₁ (a _S )，FC ₁ (a _F )]

a _SF,2 ＝[FC ₂ (a _F )，FC ₂ (a _S )]

where [ ·, ] denotes stitching together two feature vectors, the intersection between the network and the feature makes the output feature more compact while preserving the domain features of each dimension.

By combining all the technical schemes, the invention has the advantages and positive effects that: the invention starts from the angle of multi-dimensional domain feature combination, and excavates the space multi-scale feature and the frequency domain rotation invariant feature of the SAR vessel target by further excavating the multi-dimensional domain complementary information of the SAR vessel target, thereby providing a novel multi-dimensional domain feature combination SAR vessel intelligent detection network, effectively overcoming the rotating behavior of the scale transformation of the SAR vessel target and improving the detection performance of the SAR vessel detection algorithm.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present application, the following description will briefly explain the drawings needed in the embodiments of the present application, and it is obvious that the drawings described below are only some embodiments of the present application, and that other drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.

Fig. 1 is a flowchart of a multi-dimensional domain feature combined SAR ship intelligent detection method provided by an embodiment of the invention.

Fig. 2 is a schematic diagram of a multi-dimensional domain feature combined SAR ship intelligent detection network provided by an embodiment of the present invention.

Fig. 3 is a schematic diagram of a spatial multi-scale feature extraction network according to an embodiment of the present invention.

Fig. 4 is a schematic diagram of a proposed network structure according to an embodiment of the present invention.

Fig. 5 is a schematic diagram of image rotation behavior under rectangular coordinates and polar coordinates according to an embodiment of the present invention.

Fig. 6 is a schematic diagram of a frequency domain feature extraction network according to an embodiment of the present invention.

Fig. 7 is a schematic diagram of a spatial-frequency domain feature fusion network according to an embodiment of the present invention.

Fig. 8 and 9 show the results of offshore and onshore vessel target detection, respectively, wherein fig. (a), (b), (c) show the results of real vessel targets, the detection result of the fast RCNN algorithm, and the detection result of the present invention, respectively.

Detailed Description

The present invention will be described in further detail with reference to the following examples in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.

Aiming at the problems existing in the prior art, the invention provides an intelligent detection method for a multi-dimensional domain feature combined SAR vessel, and the invention is described in detail below with reference to the accompanying drawings.

As shown in fig. 1, the multi-dimensional domain feature combined SAR ship intelligent detection method provided by the embodiment of the invention includes:

s101: analyzing the multi-scale problem and the rotation problem existing in the existing SAR vessel targets, acquiring layered spatial characteristics from bottom to top through a backbone network, and acquiring the multi-scale characteristics by adopting FPN.

S102: and (3) acquiring coordinate information and category information of the SAR image candidate region through the RPN according to the spatial multi-scale feature acquired in the step (S101), and mapping the coordinate information and the category information to an original image and a multi-scale feature map respectively.

S103: processing is carried out by combining a time-frequency domain and a dimension domain aiming at the rotation behavior of the ship; and (3) according to the candidate region of the original image obtained in the step (S102), obtaining rotation invariant information of the SAR vessel target by carrying out Fourier-polar coordinate transformation on the candidate region, and obtaining the rotation invariant feature of the SAR vessel target frequency domain by carrying out feature extraction on the rotation invariant information.

S104: aiming at space-frequency domain feature fusion, a finer feature fusion network is designed, and by interactively updating parameters of different networks, more compact feature expression is obtained in different modes, so that the features of each dimension domain can be comprehensively represented.

In S101 provided by the embodiment of the present invention, the neural network Ω is convolved through a spatial channel _S The spatial feature phi can be easily obtained from the SAR image _S (x, y), which can be expressed as

φ _S (x,y)＝Ω _S (I(x,y),θ _s )；

In S102 provided by the embodiment of the present invention, the spatial feature is shared to the RPN to obtain the proposed area of the image, and the specific process is as follows:

s201: the shared feature map is convolved using a 3 x 3 convolution kernel in order to make the extracted features more robust.

S202: the category information and the position information of the candidate region are obtained using a convolution kernel of 1×1 twice.

S203: mapping the candidate region position information to the original image and the multi-scale feature map to obtain I respectively _Sub (x, y) and phi _{S_MS_Sub} (x,y)。

In S203 provided by the embodiment of the present invention, for the obtained original image candidate region I _Sub (x, y) performing polar coordinate transformation, wherein the transformation relation is as follows:

The obtained polar coordinate image is subjected to Fourier transformation and the amplitude spectrum is taken

The rotation invariant information M obtained by convolution neural network pair _Sub (u, v) feature extraction

φ _{F_RI_Sub} (x,y)＝Ω _F (M _Sub (u,v),θ _F )；

In S103 provided by the embodiment of the present invention, the rotation behavior of the SAR vessel target is processed, and the rotation invariant feature is extracted, which specifically includes the following steps:

the rotation behavior under the rectangular coordinates of the image is represented as translation of the target under the polar coordinates, so that the rotation transformation under the rectangular coordinates is converted into translation transformation by utilizing the polar coordinate transformation; and simultaneously, combining the characteristic of Fourier transformation, and taking the magnitude spectrum of the spectrum information of the SAR ship target so as to adapt to any rotation angle of the SAR ship target, thereby obtaining the rotation invariant information of the SAR ship target. And extracting the characteristics of the SAR vessel target through a convolutional neural network to obtain the rotation invariant characteristics of the SAR vessel target.

In S104 provided by the embodiment of the present invention, a finer feature fusion network is designed, and the spatial-frequency domain features are fused by interactively updating parameters of different networks, which is specifically based on the following principle:

in order to ensure the dimension consistency of the spatial features and the frequency features, the spatial features are unfolded into one-dimensional vectors, the size of the obtained output features is 1024 through the full connection layer, and the frequency domain features are unfolded to obtain feature vectors with the same dimension; the representation is

a _S ＝FC(Flatten(φ _{S_MS_Sub} (x,y)))；

a _F ＝Flatten(φ _{F_RI_Sub} (x,y))；

The cross-domain fusion process of the obtained one-dimensional spatial feature and the frequency domain feature vector comprises the following steps:

a _SF,1 ＝[FC ₁ (a _S )，FC ₁ (a _F )]

a _SF,2 ＝[FC ₂ (a _F )，FC ₂ (a _S )]

where [ ·, ] denotes stitching together two feature vectors, the intersection between the network and the features makes the output feature more compact while preserving the features of the dimension domain, with which a better, more efficient spatial frequency feature expression is achieved.

The technical scheme of the invention is further described below with reference to specific embodiments.

The intelligent detection network for the multi-dimensional domain feature combined SAR vessel provided by the invention supplements and expands the target space information by exploring and utilizing the multi-dimensional domain complementary feature, and deeply excavates the space multi-scale feature and the frequency domain rotation invariant feature, so that the intelligent detection network for the multi-dimensional domain feature combined SAR vessel is provided, and the detection performance of the SAR vessel is improved.

And (1) analyzing the multi-scale problem and the rotation problem existing in the existing SAR vessel target, and extracting the multi-scale characteristics of the SAR vessel target by using a characteristic pyramid, wherein the specific implementation is shown in figure 3.

And (2) acquiring candidate areas by utilizing an RPN (remote procedure network) according to the multi-scale features obtained in the step (1), and respectively mapping coordinate information onto the feature images and the original images so as to realize the alignment of space frequency information, wherein the specific implementation is shown in figure 4.

And (3) aiming at the rotation behavior of the ship target, obtaining rotation invariant information of the SAR ship target by utilizing Fourier-polar coordinate transformation, and extracting characteristics, wherein the network structure is shown in figure 6.

And (4) aiming at the spatial multi-scale characteristics and the frequency domain rotation invariant characteristics obtained in the step (2) and the step (3), utilizing a cross-fusion mode to perform cross-domain fusion of experimental spatial-frequency domain characteristics, and particularly realizing the cross-domain fusion shown in the figure 7.

The experimental results are shown in the figures, wherein figures 8 and 9 respectively show the offshore and onshore vessel target detection results, and figures (a), (b) and (c) respectively show the real vessel target, the detection result of the fast RCNN algorithm and the detection result of the invention. From the figure, the SAR vessel detection network provided by the invention can effectively reduce false alarm and omission under the background of offshore and onshore, and improve detection performance.

In the description of the present invention, unless otherwise indicated, the meaning of "a plurality" is two or more; the terms "upper," "lower," "left," "right," "inner," "outer," "front," "rear," "head," "tail," and the like are used as an orientation or positional relationship based on that shown in the drawings, merely to facilitate description of the invention and to simplify the description, and do not indicate or imply that the devices or elements referred to must have a particular orientation, be constructed and operated in a particular orientation, and therefore should not be construed as limiting the invention. Furthermore, the terms "first," "second," "third," and the like are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.

The foregoing is merely illustrative of specific embodiments of the present invention, and the scope of the invention is not limited thereto, but any modifications, equivalents, improvements and alternatives falling within the spirit and principles of the present invention will be apparent to those skilled in the art within the scope of the present invention.

Claims

1. The intelligent detection method for the multi-dimensional domain feature combined SAR vessel is characterized by comprising the following steps of:

analyzing multi-scale problems and rotation problems existing in the existing SAR vessel targets, acquiring layered space characteristics from bottom to top through a backbone network, and acquiring the multi-scale characteristics by adopting a characteristic pyramid FPN;

step two, the spatial multi-scale characteristics obtained in the step one are obtained through an RPN network, coordinate information and category information of candidate areas of SAR images are obtained, and the coordinate information is mapped to an original image and a multi-scale characteristic image respectively;

step three, processing is carried out by combining a time-frequency domain dimension domain aiming at the rotation behavior of the ship; obtaining rotation invariant information of the SAR vessel target by carrying out Fourier-polar coordinate transformation on the candidate region of the original image obtained in the second step, and obtaining the rotation invariant feature of the SAR vessel target frequency domain by feature extraction;

and step four, designing a finer feature fusion network aiming at space-frequency domain feature fusion, and obtaining more compact feature expression by interactively updating parameters of different networks so as to comprehensively characterize the features of each dimension domain.

2. The intelligent detection method for the multi-dimensional domain feature combined SAR vessel according to claim 1, wherein in the first step, the neural network Ω is convolved through a spatial channel _S The spatial feature phi can be easily obtained from the SAR image _S (x, y), which can be expressed as

φ _S (x,y)＝Ω _S (I(x,y),θ _s )；

Wherein I (x, y) represents an input SAR vessel image, θ _s Representing omega _S Related parameter sets in (a); upsampling high-level features via feature pyramid FPN and enhancement via lateral join to obtain spatial multi-scale features φ _{S_MS} (x,y)。

3. The intelligent detection method of multi-dimensional domain feature combined SAR vessel as set forth in claim 1, wherein in the second step, the proposed area of the image is obtained by sharing the spatial feature to the RPN, and the specific process is as follows:

4. The intelligent detection method for multi-dimensional domain feature combined SAR vessel as set forth in claim 3, wherein in said third step, the obtained original image candidate region I _Sub (x, y) performing polar coordinate transformation, wherein the transformation relation is as follows:

where r represents the distance from the point (x, y) to the origin in a rectangular coordinate system, θ is the angle between the x-axis and the line connecting the point with the origin, and is positive in the counterclockwise direction.

5. The intelligent detection method for multi-dimensional domain feature combined SAR vessel as set forth in claim 4, wherein the obtained polar coordinate image is subjected to Fourier transform and the amplitude spectrum thereof is taken

Wherein F is _Sub (mu, v) represents candidate region frequency domain information, mu, v are corresponding two-dimensional frequency domain variables respectively, and the amplitude spectrum is taken to obtain rotation invariant information M _Sub (μ,ν)。

6. The intelligent detection method for the multi-dimensional domain feature combined SAR vessel according to claim 5, wherein the rotation invariant information M is obtained through convolution neural network pair _Sub (μV) performing feature extraction

φ _{F_RI_Sub} (x,y)＝Ω _F (M _Sub (u,v),θ _F )；

7. The intelligent detection method of the multi-dimensional domain feature combined SAR vessel according to claim 1, wherein in the third step, the rotation behavior of the SAR vessel target is processed, and the rotation invariant feature is extracted, and the specific process is as follows:

8. The intelligent detection method of the multi-dimensional domain feature combined SAR vessel according to claim 1, wherein in the third step, the rotation invariant feature of the SAR vessel target is obtained by feature extraction through a convolutional neural network.

9. The intelligent detection method of multi-dimensional domain feature combined SAR vessel as set forth in claim 6, wherein in the fourth step, a finer feature fusion network is designed, and the spatial-frequency domain features are fused by interactively updating parameters of different networks, wherein the specific process is as follows:

α _S ＝FC(Flatten(φ _{S_MS_Sub} (x,y)))；

α _F ＝Flatten(φ _{F_RI_Sub} (x,y))；

10. The intelligent detection method of multi-dimensional domain feature combined SAR vessel according to claim 9, wherein the cross-domain fusion process of the obtained one-dimensional spatial feature and the frequency domain feature vector is:

α _SF,1 ＝[FC ₁ (α _S ),FC ₁ (α _F )]

α _SF,2 ＝[FC ₂ (α _F ),FC ₂ (α _S )]

where [.] means that the two feature vectors are stitched together and the intersection between the network and the features makes the output feature more compact while maintaining the features of the various dimensions.