CN112581470A

CN112581470A - Small target object detection method

Info

Publication number: CN112581470A
Application number: CN202011639587.2A
Authority: CN
Inventors: 伏广伟; 张珍竹; 崔绮嫦; 王斌; 罗桂莲; 郑少锋; 何南坚; 李伟才; 黄慧宇; 王文; 余娟; 毕兴忠; 王军
Original assignee: Guangzhou Guantu Vision Technology Co ltd; Foshan Zhongfanglian Inspection Technology Service Co ltd
Current assignee: CHINA TEXTILE ENGINEERING SOCIETY; FOSHAN ZHONGFANGLIAN INSPECTION TECHNOLOGY SERVICE Co.,Ltd.; Guangzhou guantu Vision Technology Co.,Ltd.
Priority date: 2020-09-15
Filing date: 2020-12-31
Publication date: 2021-03-30
Also published as: CN112184641A

Abstract

The invention discloses a small target object detection method, which relates to the technical field of computers, and is characterized in that each time of convolution and downsampling is carried out to collect a characteristic image, each time of characteristic image is superposed with a characteristic output by downsampling next time, so that the small target object is predicted by using a large-size characteristic image with less lost information, a large target object is predicted by using a small-size characteristic image with more abstract semantic information obtained through multiple times of convolution, and the accuracy of a prediction result is improved.

Description

Small target object detection method

Technical Field

The invention relates to the technical field of computers, in particular to a small target object detection method.

Background

For the detection of small target objects such as colonies, the traditional scheme is to culture the colonies on a culture dish, observe the colonies with human eyes, mark the colonies on the culture dish with a black marker pen, and count the colonies. The whole process is large in workload, tedious and tedious, and much in repetitive labor, and because many bacterial colonies are very fine, human eyes can easily see the bacterial colonies in the counting process, the bacterial colonies need to be gathered to see mentally, fatigue is easily caused, the efficiency is low, and the labor cost is high.

In order to improve efficiency and reduce labor cost, the conventional detection of small target objects mainly performs convolution and pooling operations on original images through a convolution neural network model to obtain feature images with different sizes, predicts the feature image of the last layer, ignores the feature images of other layers and is low in accuracy.

Disclosure of Invention

In order to overcome the defects in the prior art, the embodiment of the invention provides a small target object detection method, which comprises the following steps:

s1, continuously and repeatedly carrying out down-sampling on the original image where the small target object is located by using the convolutional neural network model to obtain a plurality of characteristic images;

s2, the convolutional neural network model sorts the plurality of feature images according to the time sequence of the down sampling to obtain a feature image group;

s3 selecting the last n characteristic images in the characteristic image group by the convolutional neural network model, and sequentially marking the n characteristic images as C according to the size sequence₁、C₂、C₃…C_n；

S4 characteristic image C is processed by using convolution neural network model_nPerforming convolution operation by 1 to obtain a characteristic image M_n；

S5 performing M on the characteristic image by using the convolutional neural network model_nUp-sampling to obtain size and characteristic image C₄Consistent feature image M_n ^-；

S6 characteristic image C is processed by using convolution neural network model_n-1Performing convolution operation by 1 to obtain a characteristic image C_n-1 ^-；

S7 using convolution nerve network model to analyze the characteristic image M_n ^-And the characteristic image C_n-1 ^-Overlapping to obtain a characteristic image M_n-1；

S8 repeating the above steps S6-S7 to obtain the feature image M_n-2… characteristic image M₂Characteristic image M₁；

S9 using convolution neural network model to respectively process the characteristic images M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nPerforming convolution operation to make the characteristic image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nThe number of output channels is the same;

the S10 convolutional neural network model adopts the sizes of [8 × 8, 16 × 16, 32 × 32, 64 × 64, 128 × 128 … 2 [ ]ⁿ⁺²*2ⁿ⁺²]n groups of prediction frames respectively predict characteristic image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nSmall target object in (1).

Preferably, the feature image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nThe number of output channels is 256.

Preferably, n has a value of 5.

The small target object detection method provided by the embodiment of the invention has the following beneficial effects:

the method fully utilizes the characteristic images acquired in each convolution and downsampling process, superposes each characteristic with the characteristic output by downsampling next time, realizes the prediction of small target objects by using large-size characteristic images with less lost information, and predicts large target objects by using small-size characteristic images with more abstract semantic information obtained through multiple convolutions, so that the final prediction result is more accurate.

Drawings

Fig. 1 is a schematic flow chart of a small target object detection method according to an embodiment of the present invention;

fig. 2a to fig. 2d are schematic diagrams of effects obtained by using the method for detecting a small target object according to the embodiment of the present invention.

Detailed Description

The invention is described in detail below with reference to the figures and the embodiments.

As shown in fig. 1, the method for detecting a small target object provided by the embodiment of the present invention includes the following steps:

s101, continuously and repeatedly carrying out down-sampling on an original image where a small target object is located by using a convolutional neural network model to obtain a plurality of characteristic images;

s102, sequencing a plurality of feature images by the convolutional neural network model according to the time sequence of the down sampling to obtain a feature image group;

s103, selecting the last n characteristic images in the characteristic image group by the convolutional neural network model, and sequentially marking the n characteristic images as C according to the size sequence₁、C₂、C₃…C_n；

S104, utilizing the convolution neural network model to perform comparison on the characteristic image C_nPerforming convolution operation by 1 to obtain a characteristic image M_n；

S105, performing M on the characteristic image by using the convolutional neural network model_nUp-sampling to obtain size and characteristic image C₄Consistent feature image M_n ^-；

S106, using the convolution neural network model to perform characteristic image C_n-1Performing convolution operation by 1 to obtain a characteristic image C_n-1 ^-；

S107, utilizing the convolution neural network model to carry out feature image M_n ^-And the characteristic image C_n-1 ^-Overlapping to obtain a characteristic image M_n-1The fusion of information among different characteristic images is realized through pixel-by-pixel addition;

s108, repeating the steps S106-S107 to obtain the characteristic image M_n-2… characteristic image M₂Characteristic image M₁；

S109, respectively carrying out comparison on the characteristic images M by using the convolutional neural network model₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nPerforming convolution operation to make the characteristic image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nThe number of output channels is the same;

s110, adopting the sizes of the convolutional neural network model as [8 × 8, 16 × 16, 32 × 32, 64 × 64, 128 × 128 … 2ⁿ ⁺²*2ⁿ⁺²]n groups of prediction frames respectively predict characteristic image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nSmall target object in (1).

And (4) sliding the prediction frame on the characteristic image, classifying the result after each sliding, and realizing the detection on the small target object in the characteristic image.

Optionally, the characteristic image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nThe number of output channels of (2) is 256.

Wherein the number of output channels is related to the parameters and the computing power of the convolutional neural network model.

Optionally, n has a value of 5.

Generally, the larger the value of n, the more accurate the final prediction result, but since up-sampling is performed once each time, the more the calculation amount is increased later, and therefore, 5 times are values that are preferably selected after being weighted between the accuracy and the calculation amount.

According to the small target object detection method provided by the embodiment of the invention, each feature is superposed with the feature output by the next downsampling by using the feature image acquired in each convolution and downsampling process, so that the small target object is predicted by using the large-size feature image with less lost information, the large target object is predicted by using the small-size feature image with more abstract semantic information obtained through multiple convolutions, and the accuracy of the prediction result is improved.

In the foregoing embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.

It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.

As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The above are merely examples of the present application and are not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

Claims

1. A small target object detection method, comprising:

s1, continuously and repeatedly carrying out down-sampling on the original image where the small target object is located to obtain a plurality of characteristic images;

s2, sequencing the plurality of feature images according to the time sequence of the down-sampling to obtain a feature image group;

s3, selecting the last n characteristic images in the characteristic image group, and sequentially recording the n characteristic images as the characteristic images according to the size sequenceC₁、C₂、C₃…C_n；

S4 pairs of characteristic images C_nPerforming convolution operation by 1 to obtain a characteristic image M_n；

S5 performing M on the characteristic image_nUp-sampling to obtain size and characteristic image C₄Consistent feature image M_n ^-；

S6 pairs of characteristic images C_n-1Performing convolution operation by 1 to obtain a characteristic image C_n-1 ^-；

S7 feature image M_n ^-And the characteristic image C_n-1 ^-Overlapping to obtain a characteristic image M_n-1；

S8 repeating the steps S6-S7 to obtain the characteristic image M_n-2… characteristic image M₂Characteristic image M₁；

S9 corresponding to the feature image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nPerforming convolution operation to make the characteristic image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nThe number of output channels is the same;

s10 adopts the size of [8 × 8, 16 × 16, 32 × 32, 64 × 64, 128 × 128 … 2 [ ]ⁿ⁺²*2ⁿ⁺²]n groups of prediction frames respectively predict characteristic image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nSmall target object in (1).

2. The small target object detection method according to claim 1, characterized in that the feature image M₁Characteristic image M₂Characteristic image M₃… characteristic image M_n-1Characteristic image M_nThe number of output channels is 256.

3. The small target object detection method according to claim 1, characterized in that the value of n is 5.

4. A computer arrangement comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of claims 1-2 are implemented when the computer program is executed by the processor.