CN109255320B - Improved non-maximum suppression method - Google Patents

Improved non-maximum suppression method Download PDF

Info

Publication number
CN109255320B
CN109255320B CN201811018713.5A CN201811018713A CN109255320B CN 109255320 B CN109255320 B CN 109255320B CN 201811018713 A CN201811018713 A CN 201811018713A CN 109255320 B CN109255320 B CN 109255320B
Authority
CN
China
Prior art keywords
rectangular
rectangular frames
frames
processed
rectangular frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811018713.5A
Other languages
Chinese (zh)
Other versions
CN109255320A (en
Inventor
李宏亮
廖加竞
孙旭
刘玮
何慕威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN201811018713.5A priority Critical patent/CN109255320B/en
Publication of CN109255320A publication Critical patent/CN109255320A/en
Application granted granted Critical
Publication of CN109255320B publication Critical patent/CN109255320B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/94Hardware or software architectures specially adapted for image or video understanding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses an improved non-maximum suppression method. The invention carries out inhibition treatment by a mode of multiple iterative screening: sorting the rectangular frames to be processed according to the confidence coefficient from high to low, reserving the rectangular frame with the highest current confidence coefficient, sequentially traversing whether the proportion of the sum of the overlapping areas of the remaining rectangular frames and all the reserved rectangular frames to the whole image area is greater than a threshold value, and if so, deleting the rectangular frames; and then, the rectangular frames which are not deleted and not reserved are used as a new round of rectangular frames to be processed to continue the suppression processing until only one rectangular frame to be processed exists. Through the method, the technical problems that the traditional non-maximum value inhibition method is small in quantity of rectangular frames output by irregular targets, high in overlapping rate and low in fitting degree of the rectangular frames to the targets are solved.

Description

Improved non-maximum suppression method
Technical Field
The invention belongs to the technical field of target detection, and particularly relates to an improved non-maximum suppression method in deep learning target detection.
Background
In deep learning target detection, non-maximum suppression processing is generally performed on an output result (rectangular frame) to remove redundant repetitive frames. However, the conventional non-maximum suppression method has the following 3 problems when used for detecting irregular targets:
(1) in each comparison, the currently selected box in the conventional method is compared with only one box with the highest confidence level. The method has the advantages of reducing the number of rectangular frames and obtaining local maximum values for the rule target. However, in the detection of irregular targets, the number of rectangular frames obtained by such a method is too small to sufficiently fit the shape of the target object;
(2) if the threshold of the traditional method is just reduced, although the number of output rectangular frames is increased, the side effect of high overlapping rate of the rectangular frames is caused;
(3) for irregular targets, a plurality of rectangular frames with large size difference are often needed to fit the shapes of the irregular targets, and when the irregular targets are compared with a threshold value in a traditional method, the proportion of the overlapping area of the two frames to the total area of the two frames is calculated. Such a method is greatly affected by the area of the rectangular frame. The smaller rectangular frame is small in area and can be easily deleted in the method.
Disclosure of Invention
The invention aims to: aiming at the technical problems of the traditional non-maximum suppression method that the number of rectangular frames output by an irregular target is small, the overlapping rate is high and the fitting degree of the rectangular frames to the target is low, an improved non-maximum suppression method is provided.
The improved non-maxima suppression method of the present invention comprises the steps of:
step S1: sorting all the rectangular frames to be processed, which are to be subjected to non-maximum suppression, from high to low according to confidence coefficients to obtain a first set T1;
step S2: placing the first element in the first set T1 into a second set T2, wherein the initial value of the set T2 is an empty set;
step S3: judging whether the number of the elements in the first set T1 is 1, if not, deleting the first element in the first set T1, and then executing the step S4; if yes, go to step S8;
step S4: sequentially traversing all the rectangular boxes in the first set T1;
step S5: judging whether the traversal is finished, if so, jumping to step S2; otherwise, executing step S6;
step S6: judging whether the ratio of the sum of the overlapping areas of the traversed current rectangular frame and all the rectangular frames in the second set T2 to the area of the whole graph is larger than a preset threshold (the preferable value range is 0.01-0.03), if so, executing a step S7; otherwise, jumping to step S4;
wherein, the full map area refers to the full map area of the image to be detected where the rectangular frame to be processed is located.
Deeply learning the area of an input image of a target detection network;
step S7: deleting the traversed current rectangular box from the set T1, and jumping to step S4;
step S8: the union of the sets T1 and T2 is output.
In summary, due to the adoption of the technical scheme, compared with the prior art, the invention has the beneficial effects that:
(1) by adjusting the threshold in step S6, the number of output rectangular frames after the suppression processing can be increased, that is, if the threshold is increased, the number of output rectangular frames can be increased;
(2) in step S6, the currently traversed rectangular frame and all the retained rectangular frames are compared, whereas in the conventional non-maximum suppression method, the current rectangular frame is only compared with one rectangular frame with the highest confidence, and the improvement method makes the overlapping rate between the output rectangular frames lower;
(3) in step S6, the ratio of the total overlapping area of the current rectangular frame and all the reserved rectangular frames to the total area of the whole image is calculated, while the ratio of the overlapping area of the two rectangular frames to the total area of the two rectangular frames is calculated by the conventional non-maximum suppression method.
Drawings
FIG. 1 is a process flow diagram of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the following embodiments and accompanying drawings.
Referring to fig. 1, the specific processing steps of the present invention are as follows:
step S1: acquiring all rectangular frames output by a deep learning target detection network, namely all rectangular frames to be subjected to non-maximum suppression;
step S2: sorting all the rectangular frames from high to low according to the confidence coefficient to obtain a first set T1;
step S3: the first element (the rectangle box with the highest current confidence) in the first set T1 is placed into the second set T2 (the initial value is an empty set);
step S4: judging whether the rectangular box with the highest current confidence coefficient is the last element of the first set T1 (namely judging whether the number of the elements of the first set T1 is 1), if not, deleting the first element (the rectangular box with the highest current confidence coefficient) in the first set T1, and then executing the step S5; if yes, go to step S9;
step S5: sequentially traversing all the rectangular boxes in the first set T1;
step S6: judging whether the traversal is finished, if so, jumping to step S3; otherwise, executing step S7;
step S7: judging whether the ratio of the sum of the overlapping areas of the traversed current rectangular frame and all the rectangular frames in the second set T2 to the area of the full image (the image to be detected input into the deep learning target detection network) is greater than 0.02, if so, executing step S8; otherwise, jumping to step S5;
step S8: deleting the traversed current rectangular box from the set T1 (i.e., deleting the rectangular box whose proportion of the sum of the overlapping areas of all the rectangular boxes in the second set T2 to the total map area is greater than 0.02), and jumping to step S5;
step S9: all undeleted rectangular boxes are output, i.e., the union of sets T1 and T2 is output.
While the invention has been described with reference to specific embodiments, any feature disclosed in this specification may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise; all of the disclosed features, or all of the method or process steps, may be combined in any combination, except mutually exclusive features and/or steps.

Claims (3)

1. An improved non-maxima suppression method for deep learning target detection, comprising the steps of:
step S1: acquiring all output results output by the deep learning target detection network, taking each rectangular frame corresponding to the output results as a rectangular frame to be processed for non-maximum suppression, and sequencing all rectangular frames to be processed from high to low according to confidence coefficients to obtain a first set T1;
step S2: placing the first element in the first set T1 into a second set T2, wherein the initial value of the set T2 is an empty set;
step S3: judging whether the number of the elements in the first set T1 is 1, if not, deleting the first element in the first set T1, and then executing the step S4; if yes, go to step S8;
step S4: sequentially traversing all the rectangular boxes in the first set T1;
step S5: judging whether the traversal is finished, if so, jumping to step S2; otherwise, executing step S6;
step S6: judging whether the ratio of the sum of the overlapping areas of the traversed current rectangular frame and all the rectangular frames in the second set T2 to the total image area is greater than a preset threshold, if so, executing a step S7; otherwise, jumping to step S4;
wherein, the whole image area refers to the area of the input image of the deep learning target detection network;
step S7: deleting the traversed current rectangular box from the set T1, and jumping to step S4;
step S8: the union of the sets T1 and T2 is output.
2. The method according to claim 1, wherein in step S6, the preferred value range of the preset threshold is: 0.01 to 0.03.
3. The method of claim 2, wherein in step S6, the threshold is preferably set to 0.02.
CN201811018713.5A 2018-09-03 2018-09-03 Improved non-maximum suppression method Active CN109255320B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811018713.5A CN109255320B (en) 2018-09-03 2018-09-03 Improved non-maximum suppression method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811018713.5A CN109255320B (en) 2018-09-03 2018-09-03 Improved non-maximum suppression method

Publications (2)

Publication Number Publication Date
CN109255320A CN109255320A (en) 2019-01-22
CN109255320B true CN109255320B (en) 2020-09-25

Family

ID=65050475

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811018713.5A Active CN109255320B (en) 2018-09-03 2018-09-03 Improved non-maximum suppression method

Country Status (1)

Country Link
CN (1) CN109255320B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109948480A (en) * 2019-03-05 2019-06-28 中国电子科技集团公司第二十八研究所 A kind of non-maxima suppression method for arbitrary quadrilateral
CN110930420B (en) * 2019-11-11 2022-09-30 中科智云科技有限公司 Dense target background noise suppression method and device based on neural network
CN112069874B (en) * 2020-07-17 2022-07-05 中山大学 Method, system, equipment and storage medium for identifying cells in embryo light microscope image

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107563412A (en) * 2017-08-09 2018-01-09 浙江大学 A kind of infrared image power equipment real-time detection method based on deep learning
CN107742093A (en) * 2017-09-01 2018-02-27 国网山东省电力公司电力科学研究院 A kind of infrared image power equipment component real-time detection method, server and system
CN107909027A (en) * 2017-11-14 2018-04-13 电子科技大学 It is a kind of that there is the quick human body target detection method for blocking processing
CN108427912A (en) * 2018-02-05 2018-08-21 西安电子科技大学 Remote sensing image object detection method based on the study of dense target signature
CN108460403A (en) * 2018-01-23 2018-08-28 上海交通大学 The object detection method and system of multi-scale feature fusion in a kind of image

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10169661B2 (en) * 2014-03-28 2019-01-01 International Business Machines Corporation Filtering methods for visual object detection

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107563412A (en) * 2017-08-09 2018-01-09 浙江大学 A kind of infrared image power equipment real-time detection method based on deep learning
CN107742093A (en) * 2017-09-01 2018-02-27 国网山东省电力公司电力科学研究院 A kind of infrared image power equipment component real-time detection method, server and system
CN107909027A (en) * 2017-11-14 2018-04-13 电子科技大学 It is a kind of that there is the quick human body target detection method for blocking processing
CN108460403A (en) * 2018-01-23 2018-08-28 上海交通大学 The object detection method and system of multi-scale feature fusion in a kind of image
CN108427912A (en) * 2018-02-05 2018-08-21 西安电子科技大学 Remote sensing image object detection method based on the study of dense target signature

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
A search based approach to non maximum suppression in face detection;E. Zaytseva 等;《2012 19th IEEE International Conference on Image Processing》;20130221;第1469-1472页 *
Faster R-CNN 模型在车辆检测中的应用;王林 等;《计算机应用》;20180310;第666-670页 *

Also Published As

Publication number Publication date
CN109255320A (en) 2019-01-22

Similar Documents

Publication Publication Date Title
CN109255320B (en) Improved non-maximum suppression method
CN112132852B (en) Automatic image matting method and device based on multi-background color statistics
CN109308460B (en) Article detection method, system and computer readable storage medium
CN103942758A (en) Dark channel prior image dehazing method based on multiscale fusion
CN113642576B (en) Method and device for generating training image set in target detection and semantic segmentation tasks
CN112508923A (en) Weak and small target detection method
CN111091122B (en) Training and detecting method and device for multi-scale characteristic convolutional neural network
CN109886980B (en) Infrared image cirrus cloud detection method based on neighborhood intensity texture coding
CN108596985B (en) Color correction method for rare earth solution by actively optimizing Grey Edge parameters
CN109636822A (en) A kind of improvement Canny adaptive edge extracting method based on new building subordinating degree function
CN105427276A (en) Camera detection method based on image local edge characteristics
CN110288540B (en) Carbon fiber wire X-ray image online imaging standardization method
CN112037137A (en) Method and device for eliminating fuzzy region of insulator disc surface edge in infrared image
CN111476810B (en) Image edge detection method and device, electronic equipment and storage medium
CN111951254B (en) Edge-guided weighted-average-based source camera identification method and system
CN115880181A (en) Method, device and terminal for enhancing image contrast
US8811741B2 (en) Differentiated processing method of image zones
US8903189B2 (en) Image processing method
CN108961290B (en) Adaptive image edge detection method based on Ratio operator of Otsu
CN107491746B (en) Face pre-screening method based on large gradient pixel analysis
JP2005258728A (en) Method and program for supporting extraction of changing region between geographic images
CN110766712A (en) Method for acquiring detail layer image based on guide filter and related equipment thereof
CN104392453A (en) Method of matching and optimizing ransac features based on polar-line insertion image
KR101179500B1 (en) Adaptive noise reduction method and device
CN104680539B (en) A kind of digital picture over-segmentation edge removing method based on color similarity

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant