CN113989632A

CN113989632A - Bridge detection method and device for remote sensing image, electronic equipment and storage medium

Info

Publication number: CN113989632A
Application number: CN202111070328.7A
Authority: CN
Inventors: 唐睿; 董刚刚
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2021-09-13
Filing date: 2021-09-13
Publication date: 2022-01-28

Abstract

The invention discloses a bridge detection method and device for remote sensing images, electronic equipment and a storage medium, wherein the method comprises the following steps: acquiring a remote sensing image to be detected; inputting a remote sensing image to be detected into a bridge detector network model obtained by pre-training to obtain bridge position information in the remote sensing image; the bridge detector network model comprises an FPN network and an improved Faster R-CNN network; the improved Faster R-CNN network comprises an improved backhaul network, an RPN network and a RoiHead network; the improved backhaul network is formed by introducing a modulation deformable convolution module into a conventional backhaul network on the basis of the conventional backhaul network; the bridge detector network model is obtained by pre-training according to a training image set, wherein each training image in the training image set is provided with position marking information. The invention improves the bridge detection performance.

Description

Bridge detection method and device for remote sensing image, electronic equipment and storage medium

Technical Field

The invention belongs to the technical field of signal and information processing, and particularly relates to a remote sensing image bridge detection method and device, electronic equipment and a storage medium.

Background

With the development of electronic and information technologies, the acquisition of large-scale airborne satellite-borne remote sensing data is easier, and the quality of remote sensing images is higher. The bridge is a key node in a transportation network, and the accurate position and the number of the bridge in the remote sensing image are obtained, so that the method has important significance for city monitoring, disaster assessment and military reconnaissance.

Aiming at the bridge detection problem, a large amount of research is carried out by scholars at home and abroad, and the existing achievements can be summarized into two types, namely a knowledge-driven detection method and a data-driven detection method. The knowledge-driven detection method relies on artificially set prior knowledge about a target, and generally the method firstly analyzes a scene, artificially defines the occurrence condition of a bridge and the characteristics of the bridge, and takes the situation as a judgment rule. And then, processing the remote sensing image by using technologies such as mathematical morphology and the like, extracting corresponding characteristics according to a judgment rule, and obtaining a final bridge target. The method has clear physical significance, high real-time performance and low requirement on data quantity. However, in practical application scenarios, the bridge has different form and size and different background environments, and the artificially defined prior knowledge is difficult to cover all practical situations. Data-driven detection methods typically first extract features extensively from a dataset. And then matching the extracted characteristics with the target class through an optimization algorithm, wherein the process is called a training process. The trained model has the capability of extracting relevant bridge features from various features and realizing bridge detection. The method does not need to artificially define the prior knowledge about the bridge, but whether the characteristic extraction is sufficient or not and whether the optimization algorithm is proper or not directly influences the expression of the model. A target detection method based on deep learning which is popular in recent years belongs to a data-driven detection method, and a typical bridge detector network model comprises a two-stage fast R-CNN network model and a single-stage YOLO algorithm family.

However, most of bridge detector network models based on deep learning rely on a conventional convolutional neural network to extract features, whereas the conventional convolutional neural network receptive field is a fixed square, and when the conventional convolutional neural network receptive field is applied to a bridge detection task, the problems of insufficient bridge feature extraction and irrelevant background information interference exist.

Disclosure of Invention

In order to solve the problems in the prior art, the invention provides a remote sensing image bridge detection method, a remote sensing image bridge detection device, electronic equipment and a storage medium. The technical problem to be solved by the invention is realized by the following technical scheme:

in a first aspect, an embodiment of the present invention provides a remote sensing image bridge detection method, including:

acquiring a remote sensing image to be detected;

inputting the remote sensing image to be detected into a bridge detector network model obtained by pre-training to obtain bridge position information of the remote sensing image;

wherein the bridge detector network model comprises an FPN network and an improved Faster R-CNN network; the improved Faster R-CNN network comprises an improved backhaul network, an RPN network and a RoiHead network; the improved Backbone network is formed by introducing a modulation deformable convolution module into a conventional Backbone network on the basis of the conventional Backbone network; the bridge detector network model is obtained by pre-training according to a training image set, wherein each training image in the training image set is provided with position marking information.

In one embodiment of the invention, in the bridge detector network model,

the improved backhaul network is used for extracting the characteristics of the input remote sensing image to be detected to obtain a plurality of characteristic information;

the FPN network is used for carrying out feature fusion on the feature information to obtain a multi-scale feature map;

the RPN is used for selecting a candidate region of the multi-scale feature map to obtain a candidate recommendation region;

and the RoiHead network is used for adjusting position information according to the multi-scale feature map and the candidate recommendation area and outputting the bridge position information of the remote sensing image.

In one embodiment of the invention, the improved backhaul network comprises a preprocessing convolution layer and 4 stage modules which are connected in sequence; each stage module comprises a plurality of block modules which are connected in sequence, the input end and the output end of each block module are connected, and each block module comprises a first convolution layer, a second convolution layer and a third convolution layer which are connected in sequence; the preprocessing convolution layer, the first convolution layer and the third convolution layer in all the stage modules are conventional convolution layers; in the first stage module, all of the second convolutional layers are conventional convolutional layers, and in the other three stage modules, all of the second convolutional layers are modulated deformable convolutional modules.

In one embodiment of the present invention, the convolution kernels of the preprocessed convolutional layers have a size of 7 × 7, and the convolution kernels of all of the first convolutional layers and the third convolutional layers have a size of 3 × 3.

In one embodiment of the invention, the size of the convolution kernel of the modulated deformable convolution module is 3 x 3.

In one embodiment of the present invention, the computation process of the modulation deformable convolution module is expressed as:

wherein, N is the total number of sampling points in the convolution process, x and f respectively represent an input characteristic diagram and an output characteristic diagram, k represents the central position of all the sampling points, and k + k_nN-th sample point, k, representing a conventional convolution_n∈(-1,1),(-1,0),...,(1,1)，Δk_nAnd Δ m_nRespectively representing the adaptive deviation and the adaptive modulation coefficient added by the modulation deformable convolution module at the nth sampling point, x (k + k)_n+Δk_n) Obtained by a bilinear interpolation method.

In a second aspect, an embodiment of the present invention provides a remote sensing image bridge detection apparatus, including:

the data acquisition module is used for acquiring a remote sensing image to be detected;

the data detection module is used for inputting the remote sensing image to be detected into a bridge detector network model obtained by pre-training to obtain bridge position information of the remote sensing image;

In a third aspect, an embodiment of the present invention provides an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete mutual communication through the communication bus;

the memory is used for storing a computer program;

the processor is used for realizing the steps of the remote sensing image bridge detection method provided by the embodiment of the invention when the program stored in the memory is executed.

In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the remote sensing image bridge detection method provided in the embodiment of the present invention are implemented.

The invention has the beneficial effects that:

according to the remote sensing image bridge detection method provided by the invention, the improved Faster R-CNN with the FPN network is used for bridge detection, and the improved Faster R-CNN is introduced with the modulation deformable convolution module, so that the receptive field of the improved Faster R-CNN network is more diverse, the receptive field shape can be self-adaptively fitted to various bridge shapes, the bridge characteristic extraction is more sufficient, the bridge detection performance is effectively improved, and the bridge detection method is more adaptive to the actual bridge detection application scene.

The present invention will be described in further detail with reference to the accompanying drawings and examples.

Drawings

FIG. 1 is a schematic flow chart of a bridge detection method using remote sensing images according to an embodiment of the present invention;

FIG. 2 is a schematic structural diagram of a bridge detector network model provided by an embodiment of the invention;

FIG. 3 is a schematic diagram of a specific structure of a bridge detector network model according to an embodiment of the present invention;

fig. 4 is a schematic structural diagram of each block module in an improved backhaul network according to an embodiment of the present invention;

FIG. 5 is a diagram illustrating a specific convolution process of a modulated deformable convolution module in each block module according to an embodiment of the present invention;

FIGS. 6a to 6e are schematic diagrams of bridge detection results of an original model and a model of the invention in a multi-target scene provided by an embodiment of the invention;

FIGS. 7a to 7e are schematic diagrams of bridge detection results of an original model and a model of the present invention in a dense target scene according to an embodiment of the present invention;

FIG. 8 is a schematic structural diagram of a bridge detection device using remote sensing images according to an embodiment of the present invention;

fig. 9 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to specific examples, but the embodiments of the present invention are not limited thereto.

Example one

The main task of object detection is to find an object of a specified category in an input scene image and to obtain its precise location. The embodiment of the invention focuses on the bridge detection problem in the remote sensing image, introduces the modulation deformable convolution module, improves the existing data-driven detection method based on deep learning, and focuses on solving the problem that the main network receptive field of a bridge detector network model is fixed and various bridge characteristics cannot be fully extracted.

In a first aspect, an embodiment of the present invention provides a remote sensing image bridge detection method, including the following steps:

s101, obtaining a remote sensing image to be detected.

S102, inputting the remote sensing image to be detected into a bridge detector network model obtained through pre-training to obtain bridge position information of the remote sensing image.

Specifically, referring to fig. 2, when detecting a bridge in a remote sensing image according to an embodiment of the present invention, a Network model of a bridge detector is a Network structure including a Feature Pyramid Network (FPN for short) and an improved fast R-CNN Network; the improved Faster R-CNN Network comprises an improved backhaul Network, a regional selection Network (RPN for short) and a RoiHead Network; the improved Backbone network is formed by introducing a modulation deformable convolution module into a conventional Backbone network on the basis of the conventional Backbone network.

In the bridge detector network model, specifically: the improved backhaul network is used for extracting the characteristics of an input remote sensing image to be detected to obtain a plurality of characteristic information; the FPN network adopts a top-down path and is used for carrying out feature fusion on a plurality of feature information to obtain a multi-scale feature map; the RPN is used for selecting a candidate region for the multi-scale feature map to obtain a candidate recommended region; and the RoiHead network is used for adjusting the position information according to the multi-scale characteristic diagram and the candidate recommended region and outputting the bridge position information of the remote sensing image.

Referring to fig. 3, an embodiment of the present invention provides a structure of an optional improved backhaul network, where the improved backhaul network includes a preprocessing convolutional layer and 4 stage modules connected in sequence; each stage module comprises a plurality of block modules which are connected in sequence, the input end and the output end of each block module are connected, and the input end and the output end of each block module are subjected to addition operation through an adder; referring to fig. 4, each block module includes a first convolution layer, a second convolution layer, and a third convolution layer connected in sequence, where the first convolution layer, the second convolution layer, and the third convolution layer are respectively connected to an activation layer, for example, by using a Relu activation function; preprocessing the convolution layer, wherein the first convolution layer and the third convolution layer in all the stage modules are conventional convolution layers; in the first stage Module, all the second convolution layers are normal convolution layers, and in the other three stage modules, all the second convolution layers are Modulated Deformable convolution modules (md conv).

Preferably, the size of the convolution kernel of the preprocessed convolutional layer is 7 × 7, and the size of the convolution kernels of all the first convolutional layers and the third convolutional layers is 3 × 3.

Preferably, the size of the convolution kernel of the modulated deformable convolution module is 3 x 3.

The calculation process of the modulation deformable convolution module of the embodiment of the invention can be expressed as follows:

wherein, N is the total number of sampling points in the convolution process, x and f respectively represent an input characteristic diagram and an output characteristic diagram, k represents the central position of all the sampling points, and k + k_nN-th sample point, k, representing a conventional convolution_n∈(-1,1),(-1,0),...,(1,1)，Δk_nAnd Δ m_nRespectively representing the adaptive deviation and the adaptive modulation coefficient added by the modulation deformable convolution module at the nth sampling point. Wherein, x (k + k)_n+Δk_n) Can be obtained by using a bilinear interpolation method.

The convolution implementation process of the modulation deformable convolution module is illustrated in fig. 5, and fig. 5 shows a specific implementation process of the k × k modulation deformable convolution module when k is 3: inputting an input feature map with dimensions H multiplied by W multiplied by C into a modulation deformable convolution module, wherein H, W, C respectively represents the height, width and channel number of the input feature map, obtaining a three-dimensional tensor H multiplied by W multiplied by 3k through an additional conventional convolution branch, controlling the range of a modulation deformable convolution operation receptive field with (H, W) as the central position of a sampling point by the 3k variable values corresponding to any (H, W) position of the tensor, and 0< H < H < W. Wherein 2k variables are adaptive offsets (offsets) of k sampling points, and the remaining k variables are adaptive modulation coefficients (modulation scales) of k sampling points.

According to the embodiment of the invention, the modulation deformable convolution branch is introduced, the deviation and the modulation coefficient of each sampling point are acquired in a self-adaptive manner, the sensing field shape can be self-adaptively fitted to various bridge shapes, the bridge characteristics are more fully extracted, and the bridge detection performance is effectively improved.

Meanwhile, please refer to fig. 3 again, the specific network structures of the FPN network, the RPN network, and the RoiHead network are respectively schematically given, and here, the network structures are only schematically given, and the corresponding specific network structures can be respectively implemented by using the existing FPN network, RPN network, and RoiHead network, and the network structures of the FPN network, RPN network, and RoiHead network are not described in detail herein.

For the bridge detector network model, the embodiment of the invention performs training to determine according to the training image set, each training image in the training image set has position labeling information, and the labeling can be performed by using a rectangular frame, but is not limited to the rectangular frame. In the training process, a random Gradient Descent (SGD) optimizer is used for optimizing parameters of the bridge detector network model, and the bridge detector network model is trained to obtain a trained bridge detector network model. The loss function adopted in the training process can be a cross entropy loss function and a mean square error loss function.

And after training is completed, obtaining a trained bridge detector network model, inputting the remote sensing image to be detected into the trained bridge detector network model, so that the bridge target position information of the remote sensing image can be obtained, and bridge detection is realized.

In order to verify the effectiveness of the remote sensing image bridge detection method provided by the embodiment of the invention, the following experiment is used for explanation.

1. Conditions of the experiment

The method provided by the invention is verified by using the disclosed bridge remote sensing image set, the resolution of each bridge remote sensing image in the bridge remote sensing image set is 1-4 m, each bridge remote sensing image at least comprises one bridge target, and the bridge remote sensing image set comprises 2000 bridge remote sensing images and 3706 bridge instances. Carrying out division of a bridge remote sensing image training set and a bridge remote sensing image testing set on the bridge remote sensing image set: and randomly selecting 80% for training the bridge detector network model and 20% for testing the bridge detector network model. The method provided by the invention is verified by taking a Faster R-CNN network model with an FPN network as a basic framework. The experimental operating platform used NVIDIA GTX1660SUPER GPU and 16GB RAM for experimental validation based on python3.8 and pytorch 1.7.

2. Analysis of experimental content and results

The embodiment of the invention mainly solves the problem that the main network receptive field in the existing bridge detector network model based on deep learning is fixed and various bridge characteristics can not be effectively extracted. By comparing whether a modulation deformable convolution module is used in several types of backhaul networks or not, the training conditions of the bridge detector network models of all groups are ensured to be consistent, the detection performance of the bridge detector network models of all groups is analyzed quantitatively and qualitatively, and the effectiveness of the method provided by the invention is verified.

The first set of experiments: based on a Faster R-CNN network framework with an FPN network, specifically using ResNet as a backhaul network, respectively training a model using original ResNet as a conventional backhaul network and a model using ResNet introduced with mdconv as an improved backhaul network, testing all test images in a bridge remote sensing image test set (remote sensing images to be detected) by using the pre-trained models, and respectively using two indexes of Average Precision (AP for short) and Recall rate (Recall) to quantitatively analyze test results, wherein the test results are shown in Table 1.

TABLE 1 evaluation comparison of test results of the first set of experiments

Model (model)	AP(％)	Recall(％)
			Original model	93.7	73.7
Model of the invention	95.7	74.4

From the experimental results in table 1, it can be seen that both the AP and the Recall indexes of the method provided in the embodiment of the present invention are superior to the original model, and particularly, the improvement range of the AP index is 2%, which indicates that in a bridge detector network model using a ResNet network as a backhaul network, the detection accuracy of a bridge can be effectively improved by using md conv, and the extraction capability of the bridge target feature reflecting the improved backhaul network is effectively improved.

The second set of experiments: based on a Faster R-CNN network framework with an FPN network, specifically using ResNeXt as a Backbone network, respectively training a model using original ResNeXt as a conventional Backbone network and a model using ResNeXt with md conv introduced as an improved Backbone network, testing all test images in a test image set by using a trained bridge detector network model, and respectively using AP and Recall indexes to perform quantitative analysis on test results, wherein the test results are shown in Table 2.

TABLE 2 evaluation comparison of test results of the second set of experiments

Model (model)	AP(％)	Recall(％)
			Original model	94.4	74.5
Model of the invention	95.8	75.6

As can be seen from the experimental results in table 2, the method provided in the embodiment of the present invention can significantly improve the performance of the bridge detector network model in the bridge detector network model using ResNeXt as a backhaul. The AP index is improved by 1.4%, the Recall index is improved by 1.1%, which shows that the detection precision of the bridge detector network model is obviously improved, the missed detection condition of the bridge detector network model is less, and the extraction capability of the bridge target characteristics of the improved backhaul network is reflected laterally and effectively improved.

Referring to fig. 6a to 6e, fig. 6a to 6e respectively show a real bridge distribution and a corresponding bridge detection result in a multi-target scene, specifically, a marking condition of the real bridge distribution and the bridge position in the multi-target scene, a bridge detection result using an original model in a first set of experiments, a bridge detection result using the model of the present invention in the first set of experiments, a bridge detection result using the original model in a second set of experiments, and a bridge detection result using the model of the present invention in the second set of experiments are sequentially marked from left to right. The black rectangular boxes in fig. 6a are marked with a plurality of targets existing in the original scene, and the gray rectangular boxes in fig. 6b to 6e are marked with a plurality of targets detected by the detector network model, so that it can be seen that in the multi-target scene, the positions of the bounding boxes of the bridge targets corresponding to the bridge detection results of fig. 6c and 6e are more fitted with the real marking conditions by using the bridge detector network model provided by the invention in two groups of experiments. It should be noted that the bridge detection results obtained by using the original model in the second set of experiments are shown in fig. 6d, which shows that the missing detection condition occurs, but the model of the present invention can better detect all bridge targets in the multi-target scene.

Referring to fig. 7a to 7e, fig. 7a to 7e respectively show the real bridge distribution and the corresponding bridge detection results in the dense target scene, specifically, the real bridge distribution and position labeling conditions in the dense target scene, the bridge detection results using the original model in the first set of experiments, the bridge detection results using the model of the present invention in the first set of experiments, the bridge detection results using the original model in the second set of experiments, and the bridge detection results using the model of the present invention in the second set of experiments are sequentially shown from left to right. Like fig. 6a to 6e, the black rectangular boxes in fig. 7a mark a plurality of objects existing in the original dense object scene,

the gray rectangular boxes in fig. 7 b-7 e mark a plurality of targets detected by the detector network model, and it can be seen that in a dense multi-target scene, the bridge detector network model provided by the invention is used in two groups of experiments, and the model of the invention can better detect all bridge targets and the position regression is more accurate. It should also be noted that the bridge detection results obtained by using the original model in the first set of experiments are missing as shown in fig. 7b, but the model of the present invention can better detect all bridge targets in a dense target scene.

As can be seen from FIGS. 6a to 6e and FIGS. 7a to 7e, the bridge detector model provided by the present invention can better detect all bridge targets regardless of the multi-target bridge detection scenario or the dense multi-target bridge detection scenario, and is more suitable for the actual bridge detection application scenario.

In summary, the remote sensing image bridge detection method provided by the embodiment of the invention utilizes the improved Faster R-CNN with the FPN network to perform bridge detection, and the improved Faster R-CNN is introduced with the modulated deformable convolution module, so that the sensing fields of the improved Faster R-CNN network are more diverse, the shapes of the sensing fields can be self-adaptively fitted to various bridge shapes, the bridge characteristics are more fully extracted, the bridge detection performance is effectively improved, and the bridge detection method is more suitable for the actual bridge detection application scene.

In addition, the method provided by the embodiment of the invention does not need the prior knowledge about the target set by people, and can self-adaptively complete the bridge detection task; the bridge detection improvement method (introducing modulation deformable convolution) provided by the embodiment of the invention can be used for various detection models based on deep learning, including a two-stage detector network model and a single-stage detector network model, and has strong universality and easy embedding.

In a second aspect, an embodiment of the present invention provides a remote sensing image bridge detection apparatus, please refer to fig. 7, including:

and the data acquisition module 801 is used for acquiring the remote sensing image to be detected.

The data detection module 802 is configured to input the remote sensing image to be detected into a bridge detector network model obtained through pre-training, so as to obtain bridge position information of the remote sensing image;

the bridge detector network model comprises an FPN network and an improved Faster R-CNN network; the improved Faster R-CNN network comprises an improved backhaul network, an RPN network and a RoiHead network; the improved backhaul network is formed by introducing a modulation deformable convolution module into a conventional backhaul network on the basis of the conventional backhaul network; the bridge detector network model is obtained by pre-training according to a training image set, wherein each training image in the training image set is provided with position marking information.

Further, in the data detection module 802 according to the embodiment of the present invention, particularly in the bridge detector network model,

the improved backhaul network is used for extracting the characteristics of an input remote sensing image to be detected to obtain a plurality of characteristic information;

the RPN is used for selecting a candidate region for the multi-scale feature map to obtain a candidate recommended region;

and the RoiHead network is used for adjusting the position information according to the multi-scale characteristic diagram and the candidate recommended region and outputting the bridge position information of the remote sensing image.

Further, in the data detection module 802 according to the embodiment of the present invention, the specifically improved backhaul network includes a preprocessing convolutional layer and 4 stage modules connected in sequence; each stage module comprises a plurality of block modules which are connected in sequence, the input end and the output end of each block module are connected, and each block module comprises a first convolution layer, a second convolution layer and a third convolution layer which are connected in sequence; preprocessing the convolution layer, wherein the first convolution layer and the third convolution layer in all the stage modules are conventional convolution layers; in the first stage module, all of the second convolutional layers are conventional convolutional layers, and in the other three stage modules, all of the second convolutional layers are modulated deformable convolutional modules.

Further, in the data detection module 802 according to the embodiment of the present invention, the size of the convolution kernel of the preprocessed convolutional layer is 7 × 7, and the sizes of the convolution kernels of all the first convolutional layer and the third convolutional layer are 3 × 3.

Further, in the data detection module 802 according to the embodiment of the present invention, the size of the convolution kernel of the modulation deformable convolution module is 3 × 3.

Further, in the data detection module 802 according to the embodiment of the present invention, the calculation process of the specific modulation deformable convolution module is represented as:

wherein, N is the total number of sampling points in the convolution process, x and f respectively represent an input characteristic diagram and an output characteristic diagram, k represents the central position of all the sampling points, and k + k_nRepresents a conventionSample point n of convolution, k_n∈(-1,1),(-1,0),...,(1,1)，Δk_nAnd Δ m_nRespectively representing the adaptive deviation and the adaptive modulation coefficient added by the modulation deformable convolution module at the nth sampling point, x (k + k)_n+Δk_n) Obtained by a bilinear interpolation method.

In a third aspect, an embodiment of the present invention further provides an electronic device, please refer to fig. 8, which includes a processor 901, a communication interface 902, a memory 903 and a communication bus 904, where the processor 901, the communication interface 902, and the memory 903 complete communication with each other through the communication bus 904,

a memory 903 for storing computer programs;

the processor 901 is configured to implement the steps of the remote sensing image bridge detection method according to the first aspect when executing the program stored in the memory 903.

The electronic device may be: desktop computers, laptop computers, intelligent mobile terminals, servers, and the like. Without limitation, any electronic device that can implement the present invention is within the scope of the present invention.

The communication bus mentioned in the electronic device may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.

The communication interface is used for communication between the electronic equipment and other equipment.

The Memory may include a Random Access Memory (RAM) or a Non-Volatile Memory (NVM), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.

The Processor may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the Integrated Circuit may also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.

In a fourth aspect, corresponding to the remote sensing image bridge detection method provided in the first aspect, an embodiment of the present invention further provides a computer readable storage medium, where a computer program is stored in the computer readable storage medium, and when the computer program is executed by a processor, the steps of the remote sensing image bridge detection method provided in the embodiment of the present invention are implemented.

It should be noted that the apparatus, the electronic device and the storage medium according to the embodiments of the present invention are respectively an apparatus, an electronic device and a storage medium to which the remote sensing image bridge detection method is applied, and all embodiments of the remote sensing image bridge detection method are applicable to the apparatus, the electronic device and the storage medium, and can achieve the same or similar beneficial effects.

In the description of the present invention, it is to be understood that the terms "first", "second" and the like are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implying any number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples described in this specification can be combined and combined by those skilled in the art.

While the present application has been described in connection with various embodiments, other variations to the disclosed embodiments can be understood and effected by those skilled in the art in practicing the claimed application, from a review of the drawings, the disclosure, and the appended claims. In the claims, the word "comprising" does not exclude other elements or steps, and the word "a" or "an" does not exclude a plurality. A single processor or other unit may fulfill the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.

The foregoing is a more detailed description of the invention in connection with specific preferred embodiments and it is not intended that the invention be limited to these specific details. For those skilled in the art to which the invention pertains, several simple deductions or substitutions can be made without departing from the spirit of the invention, and all shall be considered as belonging to the protection scope of the invention.

Claims

1. A bridge detection method based on remote sensing images is characterized by comprising the following steps:

acquiring a remote sensing image to be detected;

inputting the remote sensing image to be detected into a bridge detector network model obtained by pre-training to obtain bridge position information in the remote sensing image;

2. The remote sensing image bridge inspection method of claim 1, wherein, in the bridge inspection network model,

the RPN is used for selecting a candidate region based on the multi-scale feature map to obtain a candidate recommendation region;

3. The remote sensing image bridge detection method according to claim 1, wherein the improved backhaul network comprises a preprocessing convolutional layer and 4 stage modules which are connected in sequence; each stage module comprises a plurality of block modules which are connected in sequence, the input end and the output end of each block module are connected, and each block module comprises a first convolution layer, a second convolution layer and a third convolution layer which are connected in sequence; the preprocessing convolution layer, the first convolution layer and the third convolution layer in all the stage modules are conventional convolution layers; in the first stage module, all of the second convolutional layers are conventional convolutional layers, and in the other three stage modules, all of the second convolutional layers are modulated deformable convolutional modules.

4. The remote sensing image bridge detection method of claim 3, wherein the size of the convolution kernel of the preprocessed convolutional layer is 7 x 7; the convolution kernels of all the first convolution layers and the third convolution layers are 3 × 3 in size.

5. The remote sensing image bridge detection method of claim 3, wherein the size of the convolution kernel of the modulated deformable convolution module is 3 x 3.

6. The remote sensing image bridge detection method of claim 1, wherein the computation process of the modulation deformable convolution module is expressed as:

wherein, N is the total number of sampling points in the convolution process, x and f respectively represent an input characteristic diagram and an output characteristic diagram, k represents the central position of all the sampling points and represents the nth sampling point of the conventional convolution, and k represents the nth sampling point of the conventional convolution_n∈(-1,1),(-1,0),...,(1,1)，Δk_nAnd Δ m_nRespectively representing the adaptive deviation and the adaptive modulation coefficient added by the modulation deformable convolution module at the nth sampling point, x (k + k)_n+Δk_n) Obtained by a bilinear interpolation method.

7. A remote sensing image bridge detection device is characterized by comprising:

8. An electronic device, comprising a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory complete communication with each other through the communication bus;

the memory is used for storing a computer program;

the processor is used for realizing the steps of the remote sensing image bridge detection method according to any one of claims 1 to 6 when executing the program stored in the memory.

9. A computer-readable storage medium, characterized in that,

the computer readable storage medium stores a computer program which when executed by a processor implements the steps of the remote sensing image bridge detection method of any one of claims 1 to 6.