CN115761449A - High-resolution image small target detection method - Google Patents

High-resolution image small target detection method Download PDF

Info

Publication number
CN115761449A
CN115761449A CN202211546527.5A CN202211546527A CN115761449A CN 115761449 A CN115761449 A CN 115761449A CN 202211546527 A CN202211546527 A CN 202211546527A CN 115761449 A CN115761449 A CN 115761449A
Authority
CN
China
Prior art keywords
target
image
target detection
training
resolution image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211546527.5A
Other languages
Chinese (zh)
Inventor
邹尔博
王南
雷波
谭海
舒朗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Huazhong Tianjing Tongshi Technology Co ltd
Original Assignee
Wuhan Huazhong Tianjing Tongshi Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Huazhong Tianjing Tongshi Technology Co ltd filed Critical Wuhan Huazhong Tianjing Tongshi Technology Co ltd
Priority to CN202211546527.5A priority Critical patent/CN115761449A/en
Publication of CN115761449A publication Critical patent/CN115761449A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a high-resolution image small target detection method, which adopts a blocking scheme to process aiming at high resolution, divides an original image into a plurality of subgraphs by off-line multi-scale cutting in a training stage, and trains a network by utilizing the subgraphs; in the inference stage, prediction is carried out in the form of an online multi-scale sliding window. The method is suitable for the target detection problem of high-resolution images such as aerial images, satellite remote sensing images, array images and the like.

Description

High-resolution image small target detection method
Technical Field
The invention relates to the technical field of image processing, in particular to a method for detecting a small target of a high-resolution image.
Background
The high-resolution image cannot be directly used for deep neural network input training or detection, because image information is lost due to down-sampling, the detection performance of the small target is influenced, on one hand, the down-sampling causes the small target to have the pixel size of single digit basically in the size of a characteristic diagram, and the designed target detection classifier has poor classification effect on the small target; on the other hand, the scope of the small target features mapped back to the original image due to the downsampling process may be larger than the size of the small target on the original image, which results in poor detection effect.
Disclosure of Invention
The invention aims to solve the technical problem of detecting a small target of a high-resolution image and provides a method for detecting the small target of a deep neural network based on image segmentation.
The technical scheme adopted by the invention for solving the technical problem is as follows: a high-resolution image small target detection method comprises the following steps:
s1, training process
S11, constructing an algorithm training set: cutting the original high-resolution image into pictures according to 1/4, 1/8 and 1/16, blocking according to the proportion of an overlapping area of 0.2, and generating a multi-scale blocked image offline as a training image to perform algorithm training with a target detection label file; intercepting an area image by adopting a sliding window from left to right and from top to bottom, intercepting the intersection ratio IOU of a target/a complete target to be more than 0.5, generating a new marked coordinate of the target, and storing a multi-scale block diagram and a target detection label file to obtain a new training sample;
s12, initializing a deep learning target detection model randomly, importing a training sample into the deep learning target detection model to carry out training data random sampling, predicting to obtain a detection frame pixel coordinate and detection frame classification information, calculating a regression and classification loss function and a gradient, updating algorithm model parameters, judging whether set iteration times are reached, if so, finishing training, otherwise, repeatedly carrying out training data random sampling;
s13, obtaining a trained deep learning target detection model;
s2, reasoning process
S21, a high-resolution image online sliding window reasoning algorithm: generating sliding windows of the original high-resolution image according to 1/4, 1/8, 1/16 and the proportion of the overlapping area of 0.2, and sliding the windows from left to right and from top to bottom;
s22, sending the image in the sliding window area into a deep learning target detection model to obtain an area image detection result;
s23, resolving the target pixel coordinates in the sliding window back to the pixel coordinates corresponding to the original high-resolution image, predicting to obtain the pixel coordinates of a detection frame and classification information of the detection frame, outputting the pixel coordinates and classification information of the detection target, and loading the trained model weight file to a deep learning target detection model;
and S24, inhibiting redundant detection frames by adopting a global NMS algorithm and an In-box NMS algorithm to obtain a final result.
The beneficial effects of the invention are: the detection method adopts an image blocking mode to process in an algorithm training stage, a training set is cut into small blocks of images in an off-line mode and is input into a deep neural network for training, online reasoning is carried out in a sliding window mode in a reasoning stage, the proper image blocking size and the proportion of an overlapped area between the blocks can be determined according to the size of a target in an original image, and compared with the current method of directly zooming a high-resolution image and inputting a deep learning target detection model, the detection performance of the small target in the high-resolution image is better. The method is suitable for the target detection problem of high-resolution images such as aerial images, satellite remote sensing images, array images and the like.
Drawings
FIG. 1 is an overall framework of the algorithm of the present invention;
FIG. 2 is a flow chart of a high-resolution image block training and online reasoning algorithm.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
As shown in FIG. 1, the present invention provides an algorithm overall framework of a high resolution image small target detection method based on image blocking. The whole algorithm framework consists of a training phase and an inference phase.
And processing the original high-resolution image by adopting a blocking scheme, in a training stage, intercepting the image of the area by adopting an offline multi-scale blocking scheme according to the configuration of the original resolution ratio of 1/4, 1/8 and 1/16 and the proportion of the overlapping area of 0.2 and adopting a sliding window from left to right and from top to bottom, generating a new target labeling coordinate if the intersection ratio (IOU) of the intercepted target/the complete target is greater than 0.5, and storing the multi-scale blocking diagram and the label file to obtain a new training sample. And inputting the subgraph and the label file into a deep learning target detection algorithm model for training to obtain a trained model weight file.
And training a deep learning target detection model by using the block images to obtain a trained target detection model.
In the step, an original image is split into a plurality of sub-images through off-line multi-scale cutting, the sub-images and corresponding label files are generated, and a network is trained by using the sub-images.
And in the inference stage, the original high-resolution image adopts online multi-scale sliding window inference, a sliding window is generated according to the settings of 1/4, 1/8 and 1/16 of the original resolution and the proportion of 0.2 of the overlapping area in sequence, and the sliding window slides on the high-resolution image from left to right and from top to bottom in sequence. And predicting in the form of an online multi-scale sliding window.
And loading the trained model weight file by the deep learning target detection algorithm model, and sending the image in the sliding window area into the target detection model to obtain the area image detection result.
And the sliding window slides in the original high-resolution image from left to right and from top to bottom sequentially, the image in the sliding window is input into the target detection model, and the pixel coordinate information of the detected target is output.
And resolving the coordinates of the target pixels in the sliding window back to the coordinates of the corresponding pixels of the original high-resolution image. That is, the detection result is converted back to the original image according to the pixel coordinates of the sliding window on the original image. And adopting a global NMS and In-box NMS algorithm to inhibit redundant detection frames to obtain a final detection target coordinate information result.
As shown in FIG. 2, the invention provides an algorithm training and reasoning process of a high-resolution image small target detection method based on image segmentation, and pseudo codes are described as follows.
Algorithm 1: high-resolution image blocking training algorithm
Figure DEST_PATH_IMAGE001
And 2, algorithm: high-resolution image online sliding window reasoning algorithm
Figure 138455DEST_PATH_IMAGE002
The above-described embodiments are merely illustrative of the principles and effects of the present invention, and some embodiments may be applied, and it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the inventive concept of the present invention, and these embodiments are within the scope of the present invention.

Claims (2)

1. A method for detecting a small target of a high-resolution image is characterized by comprising the following steps: comprises the following steps
S1, training process
S11, cutting the original high-resolution image into pictures according to 1/4, 1/8 and 1/16, blocking according to the overlapped area, and generating a multi-scale blocked image in an off-line manner to serve as a training image and a target detection label file for algorithm training; intercepting an area image by adopting a sliding window from left to right and from top to bottom, intercepting the intersection ratio IOU of a target/a complete target to be more than 0.5, generating a new marked coordinate of the target, and storing a multi-scale block diagram and a target detection label file to obtain a new training sample;
s12, initializing a deep learning target detection model randomly, importing a training sample into the deep learning target detection model to carry out training data random sampling, predicting to obtain a detection frame pixel coordinate and detection frame classification information, calculating a regression and classification loss function and a gradient, updating algorithm model parameters, judging whether set iteration times are reached, if so, finishing training, otherwise, repeatedly carrying out training data random sampling;
s13, obtaining a trained deep learning target detection model;
s2, reasoning process
S21, generating sliding windows of the original high-resolution image according to 1/4, 1/8, 1/16 and the overlapping area, and sliding the windows from left to right and from top to bottom;
s22, sending the image in the sliding window area into a deep learning target detection model to obtain an area image detection result;
s23, resolving the target pixel coordinates in the sliding window back to the pixel coordinates corresponding to the original high-resolution image, outputting the pixel coordinates and category information of the detection target, and loading the weight file of the trained model to the deep learning target detection model;
and S24, inhibiting redundant detection frames by adopting a global NMS algorithm and an In-box NMS algorithm to obtain a final result.
2. The method as claimed in claim 1, wherein the ratio of the overlapping area is 0.2.
CN202211546527.5A 2022-12-05 2022-12-05 High-resolution image small target detection method Pending CN115761449A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211546527.5A CN115761449A (en) 2022-12-05 2022-12-05 High-resolution image small target detection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211546527.5A CN115761449A (en) 2022-12-05 2022-12-05 High-resolution image small target detection method

Publications (1)

Publication Number Publication Date
CN115761449A true CN115761449A (en) 2023-03-07

Family

ID=85343097

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211546527.5A Pending CN115761449A (en) 2022-12-05 2022-12-05 High-resolution image small target detection method

Country Status (1)

Country Link
CN (1) CN115761449A (en)

Similar Documents

Publication Publication Date Title
US11475660B2 (en) Method and system for facilitating recognition of vehicle parts based on a neural network
EP3673417B1 (en) System and method for distributive training and weight distribution in a neural network
WO2019089578A1 (en) Font identification from imagery
CN110570433B (en) Image semantic segmentation model construction method and device based on generation countermeasure network
CN111615702B (en) Method, device and equipment for extracting structured data from image
US11314989B2 (en) Training a generative model and a discriminative model
CN115885289A (en) Modeling dependency with global self-attention neural networks
CN113159120A (en) Contraband detection method based on multi-scale cross-image weak supervision learning
US20200349393A1 (en) Optimizing Supervised Generative Adversarial Networks via Latent Space Regularizations
CN115937655B (en) Multi-order feature interaction target detection model, construction method, device and application thereof
CN110826457B (en) Vehicle detection method and device under complex scene
US20240119697A1 (en) Neural Semantic Fields for Generalizable Semantic Segmentation of 3D Scenes
JP7310718B2 (en) Road obstacle detection device, road obstacle detection method, and road obstacle detection program
CN111027576A (en) Cooperative significance detection method based on cooperative significance generation type countermeasure network
CN114049280A (en) Image erasing and repairing method and device, equipment, medium and product thereof
CA3193958A1 (en) Processing images using self-attention based neural networks
CN116645592B (en) Crack detection method based on image processing and storage medium
CN112183649A (en) Algorithm for predicting pyramid feature map
CN115019314A (en) Commodity price identification method, device, equipment and storage medium
CN115995042A (en) Video SAR moving target detection method and device
CN111476226B (en) Text positioning method and device and model training method
CN113344784A (en) Optimizing supervised generation countermeasure networks through latent spatial regularization
CN115761449A (en) High-resolution image small target detection method
CN114462490A (en) Retrieval method, retrieval device, electronic device and storage medium of image object
Podduturi Data augmentation for supervised learning with generative adversarial networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination