CN112183579B

CN112183579B - Method, medium and system for detecting micro target

Info

Publication number: CN112183579B
Application number: CN202010905792.2A
Authority: CN
Inventors: 赵欣洋; 叶涛; 王玄之; 崔鹏; 杜巍; 赵希洋; 朱颖; 刘亮
Original assignee: State Grid Ningxia Electric Power Co Ltd
Current assignee: State Grid Ningxia Electric Power Co Ltd
Priority date: 2020-09-01
Filing date: 2020-09-01
Publication date: 2023-05-30
Anticipated expiration: 2040-09-01
Also published as: CN112183579A

Abstract

The invention discloses a method, a medium and a system for detecting a micro target. The method comprises the following steps: inputting an image containing a micro target into a deep learning target detection network, and outputting the micro target extracted from the image, wherein the area of the micro target is smaller than a preset area; inputting the extracted micro target into an amplifying deep learning network, and outputting the micro target amplified by a preset multiple; and inputting the tiny targets amplified by a preset multiple into a deep learning classification network, and outputting the categories of the tiny targets. The invention can effectively improve the detection precision of the micro target and reduce the false detection rate, thereby improving the performance and stability of the whole computer vision system and having better practical application value and economic benefit.

Description

Method, medium and system for detecting micro target

Technical Field

The present invention relates to the field of target detection technologies, and in particular, to a method, medium, and system for detecting a micro target.

Background

In recent years, with the continuous development and breakthrough of deep learning and computer computing power, the deep learning achieves incomparable effects of the traditional algorithm in the fields of computer vision such as classification tasks, target detection, semantic segmentation and the like, and is applied to various industries.

In the field of target detection, currently mainstream application algorithms are: SSD, fasterRcnn, YOLO, etc., which have good effects in practical applications, but these algorithms have a problem that they generally have low detection accuracy when detecting a minute target. The main reason is that firstly, the tiny target contains fewer pixels, the data information carried by the tiny target is less, and secondly, the general rule of deep learning is that the deeper the network depth is, the higher the detection precision is. Therefore, the tiny target can pass through a deeper deep learning network, and less information carried by the tiny target is almost disappeared in a deeper characteristic diagram under the effect of downsampling, so that the detection precision of the tiny target is reduced.

Disclosure of Invention

The embodiment of the invention provides a method, a medium and a system for detecting a micro target, which are used for solving the problem of low precision of detecting the micro target in the prior art.

In a first aspect, a method for detecting a minute object is provided, including: inputting an image containing a micro target into a deep learning target detection network, and outputting the micro target extracted from the image, wherein the area of the micro target is smaller than a preset area; inputting the extracted micro target into an amplifying deep learning network, and outputting the micro target amplified by a preset multiple; and inputting the tiny targets amplified by a preset multiple into a deep learning classification network, and outputting the categories of the tiny targets.

In a second aspect, there is provided a computer readable storage medium having computer program instructions stored thereon; the computer program instructions, when executed by a processor, implement the minute object detection method as described in the embodiment of the first aspect.

In a third aspect, there is provided a minute object detection system comprising: the computer readable storage medium as in the second aspect embodiment.

Therefore, the embodiment of the invention can effectively improve the detection precision of the micro target and reduce the false detection rate, thereby improving the performance and stability of the whole computer vision system and having better practical application value and economic benefit.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the description of the embodiments of the present invention will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of a minute object detection method according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of an enlarged deep learning network according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of the partial structure of a residual ResNet network of an embodiment of the present invention;

figure 4 is a schematic diagram of a period shuffler operator in accordance with an embodiment of the present invention.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

The embodiment of the invention discloses a micro target detection method. As shown in fig. 1, the minute object detection method includes the steps of:

step S1: after an image containing a micro target is input into a deep learning target detection network, the micro target extracted from the image is output.

The deep learning object detection network employs a well-known neural network, such as a YoloV3 object detection network, an SSD object detection network, a FasterRcnn object detection network. It should be appreciated that the images of a typical input deep learning object detection network should all be scaled to a certain size. Therefore, before step S1, the method according to the embodiment of the present invention further includes: the original image is scaled to an image of preset pixels. The preset pixels may be determined based on the selected deep learning object detection network. For example, for a YoloV3 target detection network, the scaled image is a 608×608 (px) image. Specifically, the original image can be read by the immread method of opencv, and then the read original image is scaled to a size of 608×608 by the resize method of opencv.

The area of the micro target in the embodiment of the invention is smaller than the preset area. For example, in the embodiment of the present invention, the preset area is 20×20 (px). It should be appreciated that the tiny target is directed to an image input to the deep learning target detection network and not the original image.

Specifically, the deep learning object detection network may output the width and the height of the image, and may calculate the area according to the product of the width and the height, thereby determining whether it belongs to a minute object.

Step S2: and inputting the extracted micro target into an amplifying deep learning network, and outputting the micro target amplified by a preset multiple.

Specifically, as shown in fig. 2, the structure of the amplifying deep learning network is composed of a tiny target ResNet network structure block and a sub-pixel convolution layer which are connected in sequence.

The micro target ResNet network structure block consists of residual ResNet networks which are cascaded for preset times, a deeper network structure is formed, and micro target depth semantic information can be fully extracted. In the embodiment of the invention, the residual ResNet network is cascaded for 9 times. As shown in fig. 3, the two weight layers (weight layers) in the residual res net network are replaced with a dense densnet network. The residual ResNet network and the dense DenseNet network are well known network structures and are not described in detail herein.

Output result X of residual ResNet network _l ＝H _l (X _l-1 ,w _i ,b _i )+X _l-1 . Wherein X is _l-1 Representing the input object of the current residual res net network. H _l Representing a function of a dense DenseNet network. w (w) _i And b _i Parameters representing a dense DenseNet network can be derived from model training in conjunction with instances.

Tiny target feature map F output by tiny target ResNet network structure block _l-1 ＝R(X _lr W, B). Wherein X is _lr Representing the tiny target of the incoming tiny target ResNet network fabric block. R represents a nonlinear function of the tiny target ResNet network structure block, and is generally a Relu nonlinear function. W and B represent parameter weights and bias values of the micro target ResNet network structure block, and can be obtained by model training in combination with examples. The size of the micro target feature map is w×h×c×r ² . r represents the magnification of the minute object feature map. w×h×c represents the size of the minute object inputting the minute object res net network structural block, w represents the width of the minute object inputting the minute object res net network structural block, h represents the height of the minute object inputting the minute object res net network structural block, and c represents the number of image channels, typically 3, that is, RGB three channels.

By combining the residual ResNet network with the dense DenseNet network, a new network structure block is provided, the residual ResNet network ensures the depth of the network, and the dense DenseNet network ensures that more tiny target information is transferred to the deep feature map.

Because the small target itself carries less information because of less pixels, in order to make the amplified target carry more target characteristic information, the embodiment of the invention amplifies the small target by utilizing sub-pixel convolution. Specifically, the subpixel convolution formula of the subpixel convolution layer is I _sr ＝PS(W _l ×F _l-1 +B _l ). Wherein I is _sr Representing the output result of the subpixel convolutional layer. F (F) _l-1 And a tiny target feature map which represents the output of the tiny target ResNet network structure block. PS represents a period shuffle operator, as shown in FIG. 4, low-pixel feature map is obtained by r ² The convolution kernels yield r ² A characteristic diagram, and then r ² Sub-pixel points in the feature map are sequentially arranged from left to right and from top to bottom to obtain a final amplified image. W (W) _l And B _l Respectively represent sub-pixel convolutionThe weight and bias values can be obtained by model training in combination with examples. In the embodiment of the invention, the preset multiple is 8 times, and the output tiny target is 8*w _i ×8*h _i X 3, wherein 3 represents R, G, B channels.

Step S3: and inputting the tiny targets amplified by a preset multiple into a deep learning classification network, and outputting the categories of the tiny targets.

The deep learning classification network adopts a well-known neural network, such as an ideptionv 4 classification network, a vgg classification network and a resnet50 classification network.

In one embodiment of the present invention, the result (p) is output using an accept v4 classification network _i ,c _i ). Wherein p is _i Representing confidence of the ith tiny object, c _i The classification result of the i-th minute object is represented. Generally, the classification result is determined by a specific example, and may include, for example, pedestrians and vehicles.

In addition to the above classification result of the micro target, the method according to the embodiment of the present invention may further detect the position information of the micro target, and preferably, the micro target detection method according to the embodiment of the present invention further includes the following steps:

and inputting the image containing the micro target into a deep learning target detection network, and outputting the position information of the micro target.

The deep learning object detection network is described in the foregoing step S1, and will not be described in detail herein. Outputting the position information (x) of all the minute objects through the deep learning object detection network _i ,y _i ,w _i ,h _i ). Wherein x is _i An abscissa representing the center of the ith minute object, y _i An ordinate representing the center of the ith minute object, w _i For the width of the ith micro target, h _i Is the height of the ith tiny target. w (w) _i And h _i Can be used to calculate the area of the output object to determine if the object is a tiny object.

Therefore, the tiny target detection method of the embodiment of the invention not only can detect the category of the tiny target, but also can obtain the position information of the tiny targetRest, available (p) _i ,c _i ,x _i ,y _i ,w _i ,h _i ) Representing the combined result containing the minute object category and the location information.

The embodiment of the invention also discloses a computer readable storage medium, wherein the computer readable storage medium is stored with computer program instructions; the computer program instructions, when executed by a processor, implement the minute object detection method as described in the above embodiments.

The embodiment of the invention also discloses a micro target detection system, which comprises: the computer-readable storage medium as in the above embodiments.

In summary, the embodiment of the invention can effectively improve the detection precision of the micro target and reduce the false detection rate, thereby improving the performance and stability of the whole computer vision system and having better practical application value and economic benefit.

The foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any person skilled in the art will readily recognize that variations or substitutions are within the scope of the present invention. Therefore, the protection scope of the invention is subject to the protection scope of the claims.

Claims

1. A method for detecting a minute object, comprising:

inputting an image containing a micro target into a deep learning target detection network, and outputting the micro target extracted from the image, wherein the area of the micro target is smaller than a preset area;

inputting the extracted micro target into an amplifying deep learning network, and outputting the micro target amplified by a preset multiple;

inputting the tiny targets amplified by a preset multiple into a deep learning classification network, and outputting the categories of the tiny targets;

the structure of the amplifying deep learning network consists of tiny target ResNet network structure blocks and sub-pixel convolution layers which are sequentially connected, wherein the tiny target ResNet network structure blocks consist of residual ResNet networks with cascading preset times, two weight layers in the residual ResNet networks are replaced by dense DenseNet networks, the residual ResNet networks ensure the depth of the networks, and the dense DenseNet networks ensure that more tiny target information is transferred to a deep feature map.

2. The minute object detection method according to claim 1, characterized in that: output result X of the residual ResNet network _l ＝H _l (X _l-1 ,w _i ,b _i )+X _l-1 Wherein X is _l-1 Representing the input object, H, of the residual ResNet network _l A function representing the dense DenseNet network, w _i And b _i Parameters representing the dense DenseNet network.

3. The minute object detection method according to claim 1, characterized in that: a tiny target feature map F output by the tiny target ResNet network structure block _l-1 ＝R(X _lr W, B), wherein X _lr Representing the tiny targets input into the tiny target ResNet network structure block, R representing a nonlinear function of the tiny target ResNet network structure block, and W and B representing parameter weights and bias values of the tiny target ResNet network structure block.

4. The minute object detection method according to claim 3, wherein: the size of the tiny target feature map is w×h×c×r ² R represents the magnification of the tiny target feature map, and w×h×c represents the size of the tiny target input to the tiny target ResNet network structure block.

5. The minute object detection method according to claim 1, characterized in that: the subpixel convolution formula of the subpixel convolution layer is I _sr ＝PS(W _l ×F _l-1 +B _l ) Wherein I _sr Representing the output result of the sub-pixel convolution layer, F _l-1 Representing a tiny target feature map output by the tiny target ResNet network structure block, wherein PS represents a periodic shuffling operator, W _l And B _l Respectively representing the sub-pixel convolution weight and the bias.

6. The minute object detection method according to claim 1, characterized by further comprising:

7. The minute object detection method according to claim 1 or 6, wherein before the step of inputting the image containing the minute object into the deep learning object detection network, the method further comprises:

scaling the original image to the image of the preset pixels.

8. A computer-readable storage medium, characterized by: the computer readable storage medium has stored thereon computer program instructions; the computer program instructions, when executed by a processor, implement the minute object detection method according to any of claims 1 to 7.

9. A minute object detection system, characterized by comprising: the computer-readable storage medium of claim 8.