CN109766887A - A kind of multi-target detection method based on cascade hourglass neural network - Google Patents

A kind of multi-target detection method based on cascade hourglass neural network Download PDF

Info

Publication number
CN109766887A
CN109766887A CN201910038893.1A CN201910038893A CN109766887A CN 109766887 A CN109766887 A CN 109766887A CN 201910038893 A CN201910038893 A CN 201910038893A CN 109766887 A CN109766887 A CN 109766887A
Authority
CN
China
Prior art keywords
confidence level
network
hourglass
target
cascade
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910038893.1A
Other languages
Chinese (zh)
Other versions
CN109766887B (en
Inventor
胡钦涛
段倩文
毛耀
刘琼
吴水琴
周翕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Optics and Electronics of CAS
Original Assignee
Institute of Optics and Electronics of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Optics and Electronics of CAS filed Critical Institute of Optics and Electronics of CAS
Priority to CN201910038893.1A priority Critical patent/CN109766887B/en
Publication of CN109766887A publication Critical patent/CN109766887A/en
Application granted granted Critical
Publication of CN109766887B publication Critical patent/CN109766887B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Image Analysis (AREA)
  • Radar Systems Or Details Thereof (AREA)

Abstract

The present invention provides a kind of multi-target detection methods based on cascade hourglass neural network, it is intended to the technical issues of it is excessively slow to solve existing detection method speed, and is difficult to for Small object.The present invention is the following steps are included: step 1, acquisition training sample;Step 2 builds deep learning frame and constructs the backbone network cascade hourglass network of target detection;Step 3, project training sample label be confidence level thermal map;The loss function that step 4, design cascade hourglass network optimizes it;Step 5, training cascade hourglass network obtain detection model;Step 6, multi-target detection.The beneficial technical effect of the present invention lies in: it can quickly and accurately identify the target of plurality of classes, improve the recognition capability to Small object.

Description

A kind of multi-target detection method based on cascade hourglass neural network
Technical field
The present invention relates to multi-target detection technical fields, and in particular to a kind of multiple target based on cascade hourglass neural network Detection method.
Background technique
Multi-target detection is an important directions in computer vision field, and main task is that sense is oriented from image The target of interest and the specific category for judging each target.It is driven automatically in medical target detection, intelligent video monitoring, vehicle It sails, pedestrian detection, vehicle flowrate etc. are widely applied.Traditional method is using histograms of oriented gradients, part two The feature that the methods of value pattern feature extracts image uses support vector machines, random forest and neural network to classify again.But Being it, there are speed is excessively slow and not high two disadvantages of precision.
Convolutional neural networks obtain extensive success in the picture in recent years.Girshick proposes RCNN and Fast- RCNN greatly improves the detection accuracy of traditional detection method.Ren et al. has also been proposed Faster R-CNN and further improves Detect speed.But speed is still undesirable;Joseph et al. proposes YOLO and YOLO9000 greatly and improves speed can be real When handle picture to be detected, but for Small object, this method be easy to cause missing inspection and erroneous detection.
Summary of the invention
The technical problem to be solved in the present invention is to provide it is a kind of based on cascade hourglass neural network multi-target detection method, Effect is poor and slow when solving the problems, such as that existing detection model is applied to identification Small object.
In order to solve the above technical problems, the present invention adopts the following technical scheme: a kind of based on cascade hourglass neural network Multi-target detection method, comprising the following steps:
Step 1, acquisition training sample: by image capture device target image to be detected, target image is marked And to pre-processing, image is made to meet call format, constructs training sample set;Image preprocessing uses image enhancement, including (- 180 °, 180 °) of random angles rotations and random scaled (0.5 times -2 times).
Step 2 builds deep learning frame and constructs the backbone network cascade hourglass network of target detection: used grade Connection hourglass neural network is formed by 4 hourglass cascades, and each hourglass network includes 4 up-sampling layers and 4 down-sampling layers Constitute, have including 12 convolutional layers, 12 ReLU layer, 12 BatchNorm2d layers with 4 MaxPool2d layers.
Step 3, project training sample label be confidence level thermal map: the training sample label that step 1 is acquired and marked It generates confidence level thermal map and is used for network training, the target of each type is in same layer confidence level thermal map.Confidence level generated Thermal map generates M × 64 × 64 sizes confidence level thermal map according to the species number M of required detection, and the target of each type only exists Affiliated layer maps a Gauss confidence level.
The loss function that step 4, design cascade hourglass network optimizes it: losing letter using least mean-square error Number, carries out study optimization to network with Adam optimizer.
Step 5, training cascade hourglass network obtain detection model: the collected training sample of step 1 is raw by step 3 Detection model is obtained by cascading hourglass network training at after confidence level thermal map.
Step 6, multi-target detection: target detection, the confidence level heat of the different layers of output are carried out using cascade hourglass network Figure represents different types, and confidence level position mapping in each layer represents the position of target to be detected.To confidence level heat Figure carries out non-maxima suppression, and at its frame, will map back picture to be detected on confidence level thermal map with minimum rectangle is to detect As a result.
The invention has the following advantages over the prior art:
(1) present invention does backbone network by cascade hourglass neural network, is detected by way of confidence level thermal map, is A kind of relatively new detection method.
(2) present invention uses more cascade structures, can identify smaller target, improves detection accuracy.
(3) the invention belongs to the methods of one-stage, and the speed of service is faster.
Detailed description of the invention
Fig. 1 is acquired example images, wherein Fig. 1 (a) is multiple unmanned plane images that scene once acquires, Fig. 1 (b) For the lower multiple unmanned plane images acquired of scene two, Fig. 1 (c) is the lower multiple unmanned plane images acquired of scene three, and Fig. 1 (d) is The lower multiple unmanned plane images acquired of scene four;
Fig. 2 is cascade hourglass neural network schematic diagram;
Fig. 3 is that data set label switchs to Gauss confidence level thermal map schematic diagram, wherein Fig. 3 (a) is to be generated by data set label First Gauss confidence level thermal map, Fig. 3 (b) is the second Gauss confidence level thermal map generated by data set label, Fig. 3 (c) For the third Zhang Gaosi confidence level thermal map generated by data set label, Fig. 3 (d) is the 4th Zhang Gaosi generated by data set label Confidence level thermal map, Fig. 3 (e) are the 5th Gauss confidence level thermal map generated by data set label, and Fig. 3 (f) is by data set mark The 6th Gauss confidence level thermal map generated is signed, Fig. 3 (g) is the correspondence image data of data set;
Fig. 4 is overall flow figure of the present invention;
Fig. 5 is general frame figure of the present invention.
Specific embodiment
Specific embodiments of the present invention will be described in detail with reference to the accompanying drawing.But following embodiment is used only in detail Illustrate the present invention, does not limit the scope of the invention in any way.
As shown in figure 4, a kind of multi-target detection method based on cascade hourglass neural network of the present invention, including walk as follows It is rapid:
Step 1, as shown in Figure 1, acquisition training sample: by image capture device target image to be detected, by target figure As being marked and to pre-processing, image is made to meet call format, constructs training sample set;Image preprocessing uses image Enhancing, including (- 180 °, 180 °) of random angles rotations and random scaled (0.5 times -2 times).
Step 2 builds deep learning frame and constructs the backbone network cascade hourglass network of target detection: specific such as Fig. 2 Its shown network, used cascade hourglass neural network are formed by 4 hourglass cascades, and each hourglass network includes 4 Up-sample layer and 4 down-sampling layers constituted, have including 12 convolutional layers, 12 ReLU layer, 12 BatchNorm2d layers with 4 MaxPool2d layers, each network hourglass module is connected using residual error;
Step 3, project training sample label be confidence level thermal map: as shown in figure 3, the instruction that step 1 is acquired and marked Practice sample label and generate confidence level thermal map for network training, the target of each type is in same layer confidence level thermal map.It gives birth to At confidence level thermal map according to the species number M of required detection, generate M × 64 × 64 sizes confidence level thermal map, each type Target only map a Gauss confidence level in affiliated layer.
Gaussion=(x-centerx) ^2/boxx+ (y-centery) ^2/boxy
Wherein, x, y are the coordinate of confidence map respectively, and centerx, centery are the center x of sample label respectively, and y is sat Mark, boxx, boxy are the length and width of label respectively.
The loss function that step 4, design cascade hourglass network optimizes it: losing letter using least mean-square error Number, carries out study optimization to network with Adam optimizer.Specifically, network is formed using 4 hourglass cascades, and one shares 4 A output is y_pred1-y_pred4 respectively, loss function be four outputs and;
Loss1=mse (y_pred1-label);
Loss2=mse (y_pred2-label);
Loss3=mse (y_pred3-label);
Loss4=mse (y_pred4-label);
Loss=loss1+loss2+loss3+loss4;
Step 5, training cascade hourglass network obtain detection model: as shown in figure 4, according to this method frame diagram, by step 1 Collected training sample obtains detection model by cascading hourglass network training after step 3 generates confidence level thermal map.
Step 6, multi-target detection: as shown in figure 3, target detection is carried out using cascade hourglass network, it is defeated with the last layer The confidence level thermal map of different layers out represents different types, and confidence level position mapping in each layer represents mesh to be detected Target position.Non-maxima suppression is carried out to confidence level thermal map, is mapped back on confidence level thermal map by its frame with minimum rectangle Picture to be detected is testing result.
In order to verify effectiveness of the invention, selected from Faster-RCNN and YOLO as comparative example, using shown in Fig. 1 Data set come comparison result, the Average Accuracy (mAP) and real-time of comparison algorithm, comparison result such as table one.
Method MAP (%) Real-time (frame/second)
Faster-RCNN 82.4 5
YOLO 81.52 53
The present invention 89.36 32
Table one: the result of embodiment and comparative example on data set compares.
As shown in Table 1, algorithm improves Average Accuracy and real-time compared to Faster-RCNN, although comparing YOLO reality When property decreases, but precision is very high.The result reflects the validity of this algorithm.
Program that is involved or relying on is the conventional program or simple program of the art, this field skill in embodiment Art personnel can make conventional selection or are adaptively adjusted according to concrete application scene.

Claims (5)

1. a kind of multi-target detection method based on cascade hourglass neural network, which comprises the following steps:
Step 1, acquisition training sample: by image capture device target image to be detected, target image is marked and right It pre-processes, image is made to meet call format, construct training sample set;
Step 2 builds deep learning frame and constructs the backbone network cascade hourglass network of target detection: cascade hourglass nerve net Network is formed by multiple hourglass cascades, and each hourglass network contains multiple up-sampling layers, down-sampling layer, trans-regional connection Layer and convolutional layer;
Step 3, project training sample label be confidence level thermal map: training sample label that step 1 is acquired and marked generates Confidence level thermal map is used for network training, and the target of each type is in same layer confidence level thermal map;
The loss function that step 4, design cascade hourglass network optimizes it: using least mean-square error loss function, using Adam optimizer carries out study optimization to network;
Step 5, training cascade hourglass network obtain detection model: the collected training sample of step 1 is set by step 3 generation Detection model is obtained by cascading hourglass network training after reliability thermal map;
Step 6, multi-target detection: target detection, the confidence level thermal map generation of the different layers of output are carried out using cascade hourglass network The different type of table, in each layer confidence level position mapping represents the position of target to be detected, to confidence level thermal map into Row non-maxima suppression obtains obtaining mapping in final confidence level thermal map obtaining detection target.
2. a kind of multi-target detection method based on cascade hourglass neural network according to claim 1, which is characterized in that The image preprocessing of step 1 uses image enhancement, including (- 180 °, 180 °) of random angles rotations and random scaled (0.5 times -2 times).
3. a kind of multi-target detection method based on cascade hourglass neural network according to claim 1, which is characterized in that Cascade hourglass neural network used by step 2 is formed by 4 hourglass cascades, and each hourglass network includes 4 up-samplings Layer and 4 down-sampling layers are constituted, and are had including 12 convolutional layers, 12 ReLU layer, 12 BatchNorm2d layers with 4 MaxPool2d layers.
4. a kind of multi-target detection method based on cascade hourglass neural network according to claim 1, which is characterized in that Step 3 confidence level thermal map generated generates M × 64 × 64 sizes confidence level thermal map according to the species number M of required detection, The target of each type only maps a Gauss confidence level in affiliated layer.
5. a kind of multi-target detection method based on cascade hourglass neural network according to claim 1, which is characterized in that Step 6 predicted to confidence level thermal map need using non-maximizations inhibition handle after, with minimum rectangle in confidence level thermal map On by its frame, mapping back picture to be detected is testing result.
CN201910038893.1A 2019-01-16 2019-01-16 Multi-target detection method based on cascaded hourglass neural network Active CN109766887B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910038893.1A CN109766887B (en) 2019-01-16 2019-01-16 Multi-target detection method based on cascaded hourglass neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910038893.1A CN109766887B (en) 2019-01-16 2019-01-16 Multi-target detection method based on cascaded hourglass neural network

Publications (2)

Publication Number Publication Date
CN109766887A true CN109766887A (en) 2019-05-17
CN109766887B CN109766887B (en) 2022-11-11

Family

ID=66452239

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910038893.1A Active CN109766887B (en) 2019-01-16 2019-01-16 Multi-target detection method based on cascaded hourglass neural network

Country Status (1)

Country Link
CN (1) CN109766887B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110503063A (en) * 2019-08-28 2019-11-26 东北大学秦皇岛分校 Fall detection method based on hourglass convolution autocoding neural network
CN110826403A (en) * 2019-09-27 2020-02-21 深圳云天励飞技术有限公司 Tracking target determination method and related equipment
CN110889858A (en) * 2019-12-03 2020-03-17 中国太平洋保险(集团)股份有限公司 Automobile part segmentation method and device based on point regression
CN111062981A (en) * 2019-12-13 2020-04-24 腾讯科技(深圳)有限公司 Image processing method, device and storage medium
CN111105590A (en) * 2019-12-27 2020-05-05 深圳前海微众银行股份有限公司 Alarm method and device
CN111523480A (en) * 2020-04-24 2020-08-11 北京嘀嘀无限科技发展有限公司 Method and device for detecting face obstruction, electronic equipment and storage medium
CN111913435A (en) * 2020-07-30 2020-11-10 浙江科技学院 Single/multi-target key point positioning method based on stacked hourglass network
CN112132839A (en) * 2020-08-27 2020-12-25 杭州英歌智达科技有限公司 Multi-scale rapid face segmentation method based on deep convolution cascade network
CN112598055A (en) * 2020-12-21 2021-04-02 电子科技大学 Helmet wearing detection method, computer-readable storage medium and electronic device
CN112966546A (en) * 2021-01-04 2021-06-15 航天时代飞鸿技术有限公司 Embedded attitude estimation method based on unmanned aerial vehicle scout image
CN113011365A (en) * 2021-03-31 2021-06-22 中国科学院光电技术研究所 Target detection method combined with lightweight network
WO2021217937A1 (en) * 2020-04-27 2021-11-04 平安国际智慧城市科技股份有限公司 Posture recognition model training method and device, and posture recognition method and device

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090161912A1 (en) * 2007-12-21 2009-06-25 Raviv Yatom method for object detection
CN102665062A (en) * 2012-03-16 2012-09-12 华为技术有限公司 Method and device for stabilizing target object image in video
CN106295678A (en) * 2016-07-27 2017-01-04 北京旷视科技有限公司 Neural metwork training and construction method and device and object detection method and device
US20170124409A1 (en) * 2015-11-04 2017-05-04 Nec Laboratories America, Inc. Cascaded neural network with scale dependent pooling for object detection
CN108416394A (en) * 2018-03-22 2018-08-17 河南工业大学 Multi-target detection model building method based on convolutional neural networks
CN108647742A (en) * 2018-05-19 2018-10-12 南京理工大学 Fast target detection method based on lightweight neural network
CN108875595A (en) * 2018-05-29 2018-11-23 重庆大学 A kind of Driving Scene object detection method merged based on deep learning and multilayer feature
CN108921225A (en) * 2018-07-10 2018-11-30 深圳市商汤科技有限公司 A kind of image processing method and device, computer equipment and storage medium
CN109003267A (en) * 2017-08-09 2018-12-14 深圳科亚医疗科技有限公司 From the computer implemented method and system of the automatic detected target object of 3D rendering
CN109190537A (en) * 2018-08-23 2019-01-11 浙江工商大学 A kind of more personage's Attitude estimation methods based on mask perceived depth intensified learning

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090161912A1 (en) * 2007-12-21 2009-06-25 Raviv Yatom method for object detection
CN102665062A (en) * 2012-03-16 2012-09-12 华为技术有限公司 Method and device for stabilizing target object image in video
US20170124409A1 (en) * 2015-11-04 2017-05-04 Nec Laboratories America, Inc. Cascaded neural network with scale dependent pooling for object detection
CN106295678A (en) * 2016-07-27 2017-01-04 北京旷视科技有限公司 Neural metwork training and construction method and device and object detection method and device
CN109003267A (en) * 2017-08-09 2018-12-14 深圳科亚医疗科技有限公司 From the computer implemented method and system of the automatic detected target object of 3D rendering
CN108416394A (en) * 2018-03-22 2018-08-17 河南工业大学 Multi-target detection model building method based on convolutional neural networks
CN108647742A (en) * 2018-05-19 2018-10-12 南京理工大学 Fast target detection method based on lightweight neural network
CN108875595A (en) * 2018-05-29 2018-11-23 重庆大学 A kind of Driving Scene object detection method merged based on deep learning and multilayer feature
CN108921225A (en) * 2018-07-10 2018-11-30 深圳市商汤科技有限公司 A kind of image processing method and device, computer equipment and storage medium
CN109190537A (en) * 2018-08-23 2019-01-11 浙江工商大学 A kind of more personage's Attitude estimation methods based on mask perceived depth intensified learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ANIL ATVAR等: "Classification of human poses and orientations with deep learning", 《2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU)》 *
伍鹏瑛等: "多层卷积特征的真实场景下行人检测研究", 《智能系统学报》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110503063A (en) * 2019-08-28 2019-11-26 东北大学秦皇岛分校 Fall detection method based on hourglass convolution autocoding neural network
CN110503063B (en) * 2019-08-28 2021-12-17 东北大学秦皇岛分校 Falling detection method based on hourglass convolution automatic coding neural network
WO2021057309A1 (en) * 2019-09-27 2021-04-01 深圳云天励飞技术股份有限公司 Tracked target determination method and related device
CN110826403A (en) * 2019-09-27 2020-02-21 深圳云天励飞技术有限公司 Tracking target determination method and related equipment
CN110889858A (en) * 2019-12-03 2020-03-17 中国太平洋保险(集团)股份有限公司 Automobile part segmentation method and device based on point regression
CN111062981A (en) * 2019-12-13 2020-04-24 腾讯科技(深圳)有限公司 Image processing method, device and storage medium
CN111062981B (en) * 2019-12-13 2023-05-05 腾讯科技(深圳)有限公司 Image processing method, device and storage medium
CN111105590A (en) * 2019-12-27 2020-05-05 深圳前海微众银行股份有限公司 Alarm method and device
CN111523480A (en) * 2020-04-24 2020-08-11 北京嘀嘀无限科技发展有限公司 Method and device for detecting face obstruction, electronic equipment and storage medium
WO2021217937A1 (en) * 2020-04-27 2021-11-04 平安国际智慧城市科技股份有限公司 Posture recognition model training method and device, and posture recognition method and device
CN111913435A (en) * 2020-07-30 2020-11-10 浙江科技学院 Single/multi-target key point positioning method based on stacked hourglass network
CN112132839A (en) * 2020-08-27 2020-12-25 杭州英歌智达科技有限公司 Multi-scale rapid face segmentation method based on deep convolution cascade network
CN112132839B (en) * 2020-08-27 2024-04-30 杭州英歌智达科技有限公司 Multi-scale rapid face segmentation method based on deep convolution cascade network
CN112598055A (en) * 2020-12-21 2021-04-02 电子科技大学 Helmet wearing detection method, computer-readable storage medium and electronic device
CN112598055B (en) * 2020-12-21 2022-06-17 电子科技大学 Helmet wearing detection method, computer-readable storage medium and electronic device
CN112966546A (en) * 2021-01-04 2021-06-15 航天时代飞鸿技术有限公司 Embedded attitude estimation method based on unmanned aerial vehicle scout image
CN113011365A (en) * 2021-03-31 2021-06-22 中国科学院光电技术研究所 Target detection method combined with lightweight network

Also Published As

Publication number Publication date
CN109766887B (en) 2022-11-11

Similar Documents

Publication Publication Date Title
CN109766887A (en) A kind of multi-target detection method based on cascade hourglass neural network
CN112380952B (en) Power equipment infrared image real-time detection and identification method based on artificial intelligence
Song et al. Kiwifruit detection in field images using Faster R-CNN with VGG16
CN109285179A (en) A kind of motion target tracking method based on multi-feature fusion
CN109902806A (en) Method is determined based on the noise image object boundary frame of convolutional neural networks
CN108053419A (en) Inhibited and the jamproof multiscale target tracking of prospect based on background
CN111611874B (en) Face mask wearing detection method based on ResNet and Canny
CN107993215A (en) A kind of weather radar image processing method and system
CN112084869B (en) Compact quadrilateral representation-based building target detection method
CN112200143A (en) Road disease detection method based on candidate area network and machine vision
CN109543676B (en) Character wheel type water meter digital character recognition method based on image processing
CN103839265A (en) SAR image registration method based on SIFT and normalized mutual information
CN106056619A (en) Unmanned aerial vehicle vision wire patrol method based on gradient constraint Radon transform
CN104408482A (en) Detecting method for high-resolution SAR (Synthetic Aperture Radar) image object
CN104050681A (en) Road vanishing point detection method based on video images
CN116091951A (en) Method and system for extracting boundary line between farmland and tractor-ploughing path
CN109712071A (en) Unmanned plane image mosaic and localization method based on track constraint
CN109447036A (en) A kind of segmentation of image digitization and recognition methods and system
CN112861870B (en) Pointer instrument image correction method, system and storage medium
CN107563299A (en) A kind of pedestrian detection method using ReCNN integrating context informations
CN110084743A (en) Image mosaic and localization method based on more air strips starting track constraint
CN112949510A (en) Human detection method based on fast R-CNN thermal infrared image
Nair et al. Flood water depth estimation—A survey
CN113191204B (en) Multi-scale blocking pedestrian detection method and system
CN114581307A (en) Multi-image stitching method, system, device and medium for target tracking identification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant