WO2018035768A1 - Method for acquiring dimension of candidate frame and device - Google Patents

Method for acquiring dimension of candidate frame and device Download PDF

Info

Publication number
WO2018035768A1
WO2018035768A1 PCT/CN2016/096598 CN2016096598W WO2018035768A1 WO 2018035768 A1 WO2018035768 A1 WO 2018035768A1 CN 2016096598 W CN2016096598 W CN 2016096598W WO 2018035768 A1 WO2018035768 A1 WO 2018035768A1
Authority
WO
WIPO (PCT)
Prior art keywords
candidate frame
information
image
original
search target
Prior art date
Application number
PCT/CN2016/096598
Other languages
French (fr)
Chinese (zh)
Inventor
覃剑
肖婷
王美华
Original Assignee
深圳天珑无线科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳天珑无线科技有限公司 filed Critical 深圳天珑无线科技有限公司
Priority to PCT/CN2016/096598 priority Critical patent/WO2018035768A1/en
Publication of WO2018035768A1 publication Critical patent/WO2018035768A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning

Definitions

  • the invention relates to the field of object detection in computer vision and pattern recognition, and in particular to a method and a device for acquiring candidate frame scales.
  • Target detection has become a basic problem in the field of computer vision and pattern recognition, and the determination of the candidate frame size of the detection target is an important preliminary work of the target recognition classification.
  • the existing method for generating a target candidate frame is generally a sliding window search mode, and when the target search is performed, a candidate frame of a plurality of scales is set to search through the entire scan window.
  • the candidate frames of many scales are set to search in the entire scan window, the number of candidate frames is large, the target search time is too long, and the detection rate is low.
  • the embodiment of the present invention provides a method and a device for acquiring a candidate frame size, which can determine a candidate frame size according to the scale information of the search target in the region.
  • an embodiment of the present invention provides a method for obtaining a candidate frame size, where the method includes:
  • the candidate frame size information in each of the original partitions is determined based on the scale information of the search target.
  • an embodiment of the present invention provides a device for acquiring a candidate frame size, and the device includes:
  • a first acquiring unit configured to acquire an image to be searched
  • a second acquiring unit configured to acquire image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each original partition
  • a first determining unit configured to determine, according to image information of each original partition and a Gaussian distribution function, scale information of a search target in each original partition;
  • a second determining unit configured to determine candidate frame size information in each original partition according to the scale information of the search target.
  • a method and a device for acquiring a candidate frame scale according to an embodiment of the present invention, by establishing a Gaussian model by partitioning, the scale information of a specific target in the detection area of each block can be obtained, and the specific target candidate frame can be reasonably set by using the method.
  • the size of the candidate box and the real target have a large coverage, and can achieve a higher detection rate when less specific target candidate frames are set.
  • FIG. 1 is a flowchart of a method for acquiring candidate frame scales according to an embodiment of the present invention
  • FIG. 2 is a flowchart of another method for acquiring candidate frame sizes according to an embodiment of the present invention
  • FIG. 3 is a flowchart of another method for acquiring candidate frame sizes according to an embodiment of the present invention.
  • FIG. 4 is a block diagram of a component of a candidate frame size acquiring apparatus according to an embodiment of the present invention.
  • FIG. 5 is a block diagram showing the composition of another candidate frame size acquiring apparatus according to an embodiment of the present invention.
  • FIG. 6 is a block diagram showing the composition of another candidate frame size acquiring apparatus according to an embodiment of the present invention.
  • the word “if” as used herein may be interpreted as “when” or “when” or “in response to determining” or “in response to detecting.”
  • the phrase “if determined” or “if detected (conditions or events stated)” may be interpreted as “when determined” or “in response to determination” or “when detected (stated condition or event) “Time” or “in response to a test (condition or event stated)”.
  • the embodiment of the invention provides a method for acquiring a candidate frame length, which can be applied to a target detection process such as pedestrian detection and vehicle detection in a scene including static monitoring video and vehicle monitoring video, as shown in FIG. 1 .
  • a target detection process such as pedestrian detection and vehicle detection in a scene including static monitoring video and vehicle monitoring video, as shown in FIG. 1 .
  • the image to be searched refers to all the images to be detected in the target detection process.
  • the size of the target in a certain area of the image approximates a Gaussian normal distribution.
  • each of the original partitions refers to an area that blocks the detection area.
  • the image information of each of the original partitions refers to the scale information of the search target in each block area.
  • the scale information of the search target in each block refers to the size of the search target, and is also the size of the search target in each frame of the image.
  • the search target refers to an object to be detected, such as a person, a vehicle, an object, and the like in the target detection process.
  • the Gaussian distribution function is based on a mathematical Gaussian model and is suitable for description
  • the size information of the candidate frame refers to the size of the candidate frame.
  • the determining the candidate frame size information in each of the original partitions is a process of adjusting the size of the candidate frame area according to the target scale information based on the sliding window search mode.
  • a method for acquiring a candidate frame size by establishing a Gaussian model by partitioning, can obtain the scale information of a specific target in the detection area of each block, and thereby appropriately setting the area of the specific target candidate frame.
  • the size ensures the coverage of the candidate frame to the real target, and can achieve a higher detection rate when less specific target candidate frames are set.
  • the embodiment of the present invention provides another possible implementation manner. As shown in FIG. 2, before the acquiring the image to be searched, the method further includes:
  • the original image refers to an n-frame image in the detection area.
  • n is an integer greater than zero.
  • the original partition is divided according to the size and characteristics of the detection area.
  • the number of the original partitions is determined according to the size and characteristics of the search area.
  • the collecting the scale information of the search target in the original partition refers to collecting the scale size of the search target in each original partition of the n-frame image.
  • the Gaussian distribution function obtains an initial value by performing parameter estimation by collecting scale information of the n-frame image search target.
  • the Gaussian distribution function is trained in the process of detecting the search target, and dynamically acquires the scale information of the search target in each region during the learning process.
  • step 103 determining, based on the scale information of the search target, determining that the candidate frame size information in each original partition is implemented
  • the candidate frame adjustment ratio ranges from (0.6 to 2.6).
  • the area size of the candidate frame is determined according to the adjusted scale range on the basis of the target reference scale, and has three candidate frame areas: a first candidate frame area, a second candidate frame area, and a third candidate. Frame area.
  • the target reference scale is a scale size determined to be closest to an actual target scale size according to the Gaussian distribution function.
  • the first candidate frame area refers to the product of the area of the target reference scale and the candidate frame adjustment ratio of 0.6.
  • the second candidate frame area refers to the product of the area of the target reference scale and the candidate frame adjustment ratio 1.
  • the third candidate frame area refers to the product of the area of the target reference scale and the candidate frame adjustment ratio of 2.6.
  • a method for acquiring a candidate frame size according to an embodiment of the present invention by setting a Gaussian model by partitioning, can obtain a scale information of a specific target in a detection area of each block, and thereby appropriately setting a size of a specific target candidate frame. To ensure the coverage of the candidate frame to the real target, it is possible to achieve a higher detection rate when less specific target candidate frames are set.
  • An embodiment of the present invention provides a device for acquiring a candidate frame size, which can be used to implement the foregoing method flows.
  • the composition thereof is as shown in FIG. 4, and the device includes:
  • the first obtaining unit 41 is configured to acquire an image to be searched.
  • the second obtaining unit 42 is configured to acquire image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each of the original partitions.
  • a first determining unit 43 configured to use image information and a Gaussian distribution function of each original partition, Determine the scale information of the search target in each raw partition.
  • the second determining unit 44 is configured to determine candidate frame size information in each original partition according to the scale information of the search target.
  • the device further includes:
  • the third obtaining unit 45 is configured to acquire a raw partition within the original image.
  • the collecting unit 46 is configured to collect the scale information of the search target in the original partition.
  • the third determining unit 47 is configured to determine a Gaussian distribution function of each original partition according to the scale information of the search target.
  • the second determining unit 44 includes:
  • the obtaining module 441 is configured to obtain a candidate frame adjustment ratio.
  • the determining module 442 is configured to determine candidate frame size information in each original partition according to the candidate frame adjustment ratio and the scale information of the search target.
  • a candidate frame size acquiring apparatus provided by an embodiment of the present invention can obtain a scale information of a specific target in a detection area of each block by establishing a Gaussian model by partitioning, and thereby appropriately setting a size of a specific target candidate frame. To ensure the coverage of the candidate frame to the real target, it is possible to achieve a higher detection rate when less specific target candidate frames are set.
  • the disclosed system, apparatus, and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • multiple units or components may be combined. Or it can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium.
  • the above software functional unit is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to perform the methods of the various embodiments of the present invention. Part of the steps.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

A method for acquiring the dimension of a candidate frame, related to the field of target detection in computer vision and pattern recognition, capable of determining, by means of establishing a Gaussian distribution model, dimension information of a target candidate frame in a video to be detected. The method comprises: acquiring an image to be searched (101); acquiring image information of each raw partition in the image to be searched and a Gaussian distribution function corresponding to each raw partition (102); determining dimension information of a search target in each raw partition on the basis of the image information and the Gaussian distribution function of each raw partition (103); and determining dimension information of a candidate frame in each raw partition on the basis of the dimension information of the search target (104). The technical solution provided is applicable in target detection processes such as pedestrian detection and vehicle detection in scenarios such as static surveillance video and vehicle-mounted surveillance video.

Description

一种候选框尺度的获取方法及装置Method and device for acquiring candidate frame scale 技术领域Technical field
本发明涉及计算机视觉及模式识别中的目标检测领域,尤其涉及一种候选框尺度的获取方法及装置。The invention relates to the field of object detection in computer vision and pattern recognition, and in particular to a method and a device for acquiring candidate frame scales.
背景技术Background technique
随着计算机图像处理技术的迅速发展和广泛应用,对于目标检测技术的需求也逐渐上升。目标检测已经成为计算机视觉和模式识别领域的基本问题,而检测目标的候选框尺度的确定是目标识别分类的一项重要的前期工作。目前现有的生成目标候选框的方法一般是滑动窗搜索方式,滑动窗搜索方式在进行目标搜索时,设置众多尺度的候选框在整个扫描窗口进行搜索。With the rapid development and wide application of computer image processing technology, the demand for target detection technology has gradually increased. Target detection has become a basic problem in the field of computer vision and pattern recognition, and the determination of the candidate frame size of the detection target is an important preliminary work of the target recognition classification. At present, the existing method for generating a target candidate frame is generally a sliding window search mode, and when the target search is performed, a candidate frame of a plurality of scales is set to search through the entire scan window.
在实现本发明过程中,发明人发现现有技术中至少存在如下问题:In the process of implementing the present invention, the inventors have found that at least the following problems exist in the prior art:
按照现有的目标搜索方法,在对目标进行搜索的过程中,设置众多尺度的候选框在整个扫描窗口进行搜索,候选框的数目较多,目标搜索的时间过长,检测率低。According to the existing target search method, in the process of searching for the target, the candidate frames of many scales are set to search in the entire scan window, the number of candidate frames is large, the target search time is too long, and the detection rate is low.
发明内容Summary of the invention
有鉴于此,本发明实施例提供了一种候选框尺度的获取方法及装置,可以根据区域内搜索目标的尺度信息确定候选框大小。In view of this, the embodiment of the present invention provides a method and a device for acquiring a candidate frame size, which can determine a candidate frame size according to the scale information of the search target in the region.
一方面,本发明实施例提供了一种候选框尺度的获取方法,所述方法包括:In one aspect, an embodiment of the present invention provides a method for obtaining a candidate frame size, where the method includes:
获取待搜索图像;Obtain an image to be searched;
获取所述待搜索图像中各个原始分区的图像信息以及各个原始分区各自对应的高斯分布函数;Obtaining image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each original partition;
根据每个原始分区的图像信息和高斯分布函数,确定每个原始分区中搜索目标的尺度信息;Determining the scale information of the search target in each original partition according to the image information of each original partition and the Gaussian distribution function;
根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。The candidate frame size information in each of the original partitions is determined based on the scale information of the search target.
另一方面,本发明实施例提供了一种候选框尺度的获取装置,所述装置包括:On the other hand, an embodiment of the present invention provides a device for acquiring a candidate frame size, and the device includes:
第一获取单元,用于获取待搜索图像; a first acquiring unit, configured to acquire an image to be searched;
第二获取单元,用于获取所述待搜索图像中各个原始分区的图像信息以及各个原始分区各自对应的高斯分布函数;a second acquiring unit, configured to acquire image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each original partition;
第一确定单元,用于根据每个原始分区的图像信息和高斯分布函数,确定每个原始分区中搜索目标的尺度信息;a first determining unit, configured to determine, according to image information of each original partition and a Gaussian distribution function, scale information of a search target in each original partition;
第二确定单元,用于根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。And a second determining unit, configured to determine candidate frame size information in each original partition according to the scale information of the search target.
本发明实施例提供的一种候选框尺度的获取方法及装置,通过分区建立高斯模型,可以得出每个分块的检测区域内特定目标的尺度信息,并以此来合理设置特定目标候选框的大小,候选框与真实目标的覆盖度较大,能够在设置较少特定目标候选框的情况下实现较高的检测率。A method and a device for acquiring a candidate frame scale according to an embodiment of the present invention, by establishing a Gaussian model by partitioning, the scale information of a specific target in the detection area of each block can be obtained, and the specific target candidate frame can be reasonably set by using the method. The size of the candidate box and the real target have a large coverage, and can achieve a higher detection rate when less specific target candidate frames are set.
附图说明DRAWINGS
为了更清楚地说明本发明实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the embodiments will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present invention. One of ordinary skill in the art can also obtain other drawings based on these drawings without paying for inventive labor.
图1是本发明实施例提供的一种候选框尺度的获取方法流程图;FIG. 1 is a flowchart of a method for acquiring candidate frame scales according to an embodiment of the present invention;
图2是本发明实施例提供的另一种候选框尺度的获取方法流程图;2 is a flowchart of another method for acquiring candidate frame sizes according to an embodiment of the present invention;
图3是本发明实施例提供的另一种候选框尺度的获取方法流程图;3 is a flowchart of another method for acquiring candidate frame sizes according to an embodiment of the present invention;
图4是本发明实施例提供的一种候选框尺度的获取装置的组成框图;4 is a block diagram of a component of a candidate frame size acquiring apparatus according to an embodiment of the present invention;
图5是本发明实施例提供的另一种候选框尺度的获取装置的组成框图;FIG. 5 is a block diagram showing the composition of another candidate frame size acquiring apparatus according to an embodiment of the present invention; FIG.
图6是本发明实施例提供的另一种候选框尺度的获取装置的组成框图。FIG. 6 is a block diagram showing the composition of another candidate frame size acquiring apparatus according to an embodiment of the present invention.
具体实施方式detailed description
为了更好的理解本发明的技术方案,下面结合附图对本发明实施例进行详细描述。For a better understanding of the technical solutions of the present invention, the embodiments of the present invention are described in detail below with reference to the accompanying drawings.
应当明确,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其它实施例,都属于本发明保护的范围。It should be understood that the described embodiments are only a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
在本发明实施例中使用的术语是仅仅出于描述特定实施例的目的,而 非旨在限制本发明。在本发明实施例和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。The terms used in the embodiments of the present invention are for the purpose of describing particular embodiments only. It is not intended to limit the invention. The singular forms "a", "the" and "the"
应当理解,本文中使用的术语“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。It should be understood that the term "and/or" as used herein is merely an association describing the associated object, indicating that there may be three relationships, for example, A and/or B, which may indicate that A exists separately, while A and B, there are three cases of B alone. In addition, the character "/" in this article generally indicates that the contextual object is an "or" relationship.
取决于语境,如在此所使用的词语“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”或“响应于检测”。类似地,取决于语境,短语“如果确定”或“如果检测(陈述的条件或事件)”可以被解释成为“当确定时”或“响应于确定”或“当检测(陈述的条件或事件)时”或“响应于检测(陈述的条件或事件)”。Depending on the context, the word "if" as used herein may be interpreted as "when" or "when" or "in response to determining" or "in response to detecting." Similarly, depending on the context, the phrase "if determined" or "if detected (conditions or events stated)" may be interpreted as "when determined" or "in response to determination" or "when detected (stated condition or event) "Time" or "in response to a test (condition or event stated)".
本发明实施例提供了一种候选框尺度长的获取方法,能够适用于包括静态监控视频、车载监控视频等场景中行人检测、车辆检测等目标检测过程中,如图1所示,所述方法包括:The embodiment of the invention provides a method for acquiring a candidate frame length, which can be applied to a target detection process such as pedestrian detection and vehicle detection in a scene including static monitoring video and vehicle monitoring video, as shown in FIG. 1 . include:
101、获取待搜索图像。101. Acquire an image to be searched.
其中,所述待搜索图像指的是目标检测过程中的所有待检测图像。The image to be searched refers to all the images to be detected in the target detection process.
102、获取所述待搜索图像中各个原始分区的图像信息以及各个原始分区各自对应的高斯分布函数。102. Acquire image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each original partition.
其中,需要说明的是,本发明实施例针对静态监控视频以及车载监控视频等场景监控,图像中目标在某块区域的尺度大小近似服从高斯正态分布。It should be noted that, in the embodiment of the present invention, for scene monitoring such as static monitoring video and vehicle monitoring video, the size of the target in a certain area of the image approximates a Gaussian normal distribution.
其中,所述各个原始分区指的是对检测区域进行分块的区域。Wherein, each of the original partitions refers to an area that blocks the detection area.
其中,所述各个原始分区的图像信息指的是搜索目标在每块区域的尺度信息。Wherein, the image information of each of the original partitions refers to the scale information of the search target in each block area.
其中,所述搜索目标在每块区域的尺度信息指的是搜索目标的尺度大小,也是搜索目标在每帧图像中的面积大小。The scale information of the search target in each block refers to the size of the search target, and is also the size of the search target in each frame of the image.
其中,所述搜索目标指的是目标检测过程中的待检测对象,比如人、车辆和物体等。The search target refers to an object to be detected, such as a person, a vehicle, an object, and the like in the target detection process.
其中,所述高斯分布函数指的是基于数学高斯模型建立,适用于描述 面积大小等符合正太分布的函数。Wherein, the Gaussian distribution function is based on a mathematical Gaussian model and is suitable for description The size of the area, etc., conforms to the function of the positive distribution.
103、根据每个原始分区的图像信息和高斯分布函数,确定每个原始分区中搜索目标的尺度信息。103. Determine, according to image information of each original partition and a Gaussian distribution function, scale information of a search target in each original partition.
104、根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。104. Determine candidate frame size information in each original partition according to the scale information of the search target.
其中,所述候选框的尺度信息指的是候选框的面积大小。The size information of the candidate frame refers to the size of the candidate frame.
其中,确定在每个原始分区中的候选框尺度信息是在滑动窗搜索方式的基础上根据目标尺度信息调整候选框面积大小的过程。The determining the candidate frame size information in each of the original partitions is a process of adjusting the size of the candidate frame area according to the target scale information based on the sliding window search mode.
本发明实施例提供的一种候选框尺度的获取方法,通过分区建立高斯模型,可以得出每个分块的检测区域内特定目标的尺度信息,并以此来合理设置特定目标候选框的面积大小,保证候选框对真实目标的覆盖度,能够在设置较少特定目标候选框的情况下实现较高的检测率。A method for acquiring a candidate frame size according to an embodiment of the present invention, by establishing a Gaussian model by partitioning, can obtain the scale information of a specific target in the detection area of each block, and thereby appropriately setting the area of the specific target candidate frame. The size ensures the coverage of the candidate frame to the real target, and can achieve a higher detection rate when less specific target candidate frames are set.
进一步来说,结合前述方法流程,本发明实施例提供了另一种可能的实现方式,如图2所示,在所述获取待搜索图像之前,还包括:Further, in conjunction with the foregoing method flow, the embodiment of the present invention provides another possible implementation manner. As shown in FIG. 2, before the acquiring the image to be searched, the method further includes:
201、获取原始图像内的原始分区。201. Obtain a original partition in the original image.
其中,所述原始图像指的是检测区域内的n帧图像。Wherein, the original image refers to an n-frame image in the detection area.
其中,所述n是大于0的整数。Wherein n is an integer greater than zero.
其中,所述原始分区根据检测区域大小和特征进行分块。The original partition is divided according to the size and characteristics of the detection area.
其中,所述原始分区数目越多时,函数越能准确的反应分区内目标的尺度大小,每个原始分区内部的统计分布可认为基本一致。The more the number of the original partitions, the more accurately the function can reflect the size of the target within the partition, and the statistical distribution inside each original partition can be considered to be basically the same.
其中,所述原始分区的数目越多时,计算过程越复杂,从计算方法的复杂程度和准确程度两方面考虑,所述原始分区的数目根据搜索区域的大小和特征来确定。The more the number of the original partitions is, the more complicated the calculation process is. According to the complexity and accuracy of the calculation method, the number of the original partitions is determined according to the size and characteristics of the search area.
202、采集所述原始分区内搜索目标的尺度信息。202. Collect scale information of the search target in the original partition.
其中,所述采集所述原始分区内搜索目标的尺度信息指的是采集所述n帧图像各个原始分区内搜索目标的尺度大小。The collecting the scale information of the search target in the original partition refers to collecting the scale size of the search target in each original partition of the n-frame image.
203、根据所述搜索目标的尺度信息,确定各个原始分区的高斯分布函数。203. Determine a Gaussian distribution function of each original partition according to the scale information of the search target.
其中,所述高斯分布函数通过采集所述n帧图像搜索目标的尺度信息进行参数估计得到初始值。 The Gaussian distribution function obtains an initial value by performing parameter estimation by collecting scale information of the n-frame image search target.
其中,所述高斯分布函数在对搜索目标进行检测的过程中进行训练,在学习过程中动态获取每块区域搜索目标的尺度信息。The Gaussian distribution function is trained in the process of detecting the search target, and dynamically acquires the scale information of the search target in each region during the learning process.
进一步来说,结合前述方法流程,在本发明实施例的另一种可能的实现方式中,针对步骤103根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息的实现提供了以下具体流程,如图3所示,包括:Further, in combination with the foregoing method flow, in another possible implementation manner of the embodiment of the present invention, for step 103, determining, based on the scale information of the search target, determining that the candidate frame size information in each original partition is implemented The following specific process, as shown in Figure 3, includes:
301、获取候选框调整比例。301. Obtain a candidate frame adjustment ratio.
其中,所述候选框调整比例范围为(0.6~2.6)。The candidate frame adjustment ratio ranges from (0.6 to 2.6).
302、根据所述候选框调整比例以及搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。302. Determine candidate frame size information in each original partition according to the candidate frame adjustment ratio and the scale information of the search target.
其中,所述候选框的面积大小是在目标基准尺度的基础上按照所述调整比例范围进行调整确定的,共有3个候选框面积:第一候选框面积,第二候选框面积,第三候选框面积。The area size of the candidate frame is determined according to the adjusted scale range on the basis of the target reference scale, and has three candidate frame areas: a first candidate frame area, a second candidate frame area, and a third candidate. Frame area.
其中,所述目标基准尺度是根据所述高斯分布函数确定的最接近实际目标尺度大小的尺度大小。Wherein the target reference scale is a scale size determined to be closest to an actual target scale size according to the Gaussian distribution function.
其中,第一候选框面积指的是目标基准尺度的面积与所述候选框调整比例0.6的乘积。The first candidate frame area refers to the product of the area of the target reference scale and the candidate frame adjustment ratio of 0.6.
其中,第二候选框面积指的是目标基准尺度的面积与所述候选框调整比例1的乘积。The second candidate frame area refers to the product of the area of the target reference scale and the candidate frame adjustment ratio 1.
其中,第三候选框面积指的是目标基准尺度的面积与所述候选框调整比例2.6的乘积。The third candidate frame area refers to the product of the area of the target reference scale and the candidate frame adjustment ratio of 2.6.
本发明实施例提供的一种候选框尺度的获取方法,通过分区建立高斯模型,可以得出每个分块的检测区域内特定目标的尺度信息,并以此来合理设置特定目标候选框的大小,保证候选框对真实目标的覆盖度,能够在设置较少特定目标候选框的情况下实现较高的检测率。A method for acquiring a candidate frame size according to an embodiment of the present invention, by setting a Gaussian model by partitioning, can obtain a scale information of a specific target in a detection area of each block, and thereby appropriately setting a size of a specific target candidate frame. To ensure the coverage of the candidate frame to the real target, it is possible to achieve a higher detection rate when less specific target candidate frames are set.
本发明实施例提供了一种候选框尺度的获取装置,可用于实现前述各方法流程,其组成如图4所示,所述装置包括:An embodiment of the present invention provides a device for acquiring a candidate frame size, which can be used to implement the foregoing method flows. The composition thereof is as shown in FIG. 4, and the device includes:
第一获取单元41,用于获取待搜索图像。The first obtaining unit 41 is configured to acquire an image to be searched.
第二获取单元42,用于获取所述待搜索图像中各个原始分区的图像信息以及各个原始分区各自对应的高斯分布函数。The second obtaining unit 42 is configured to acquire image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each of the original partitions.
第一确定单元43,用于根据每个原始分区的图像信息和高斯分布函数, 确定每个原始分区中搜索目标的尺度信息。a first determining unit 43 configured to use image information and a Gaussian distribution function of each original partition, Determine the scale information of the search target in each raw partition.
第二确定单元44,用于根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。The second determining unit 44 is configured to determine candidate frame size information in each original partition according to the scale information of the search target.
可选的是,如图5所示,所述装置还包括:Optionally, as shown in FIG. 5, the device further includes:
第三获取单元45,用于获取原始图像内的原始分区。The third obtaining unit 45 is configured to acquire a raw partition within the original image.
采集单元46,用于采集所述原始分区内搜索目标的尺度信息。The collecting unit 46 is configured to collect the scale information of the search target in the original partition.
第三确定单元47,用于根据所述搜索目标的尺度信息,确定各个原始分区的高斯分布函数。The third determining unit 47 is configured to determine a Gaussian distribution function of each original partition according to the scale information of the search target.
可选的是,如图6所示,所述第二确定单元44包括:Optionally, as shown in FIG. 6, the second determining unit 44 includes:
获取模块441,用于获取候选框调整比例。The obtaining module 441 is configured to obtain a candidate frame adjustment ratio.
确定模块442,用于根据所述候选框调整比例以及搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。The determining module 442 is configured to determine candidate frame size information in each original partition according to the candidate frame adjustment ratio and the scale information of the search target.
本发明实施例提供的一种候选框尺度的获取装置,通过分区建立高斯模型,可以得出每个分块的检测区域内特定目标的尺度信息,并以此来合理设置特定目标候选框的大小,保证候选框对真实目标的覆盖度,能够在设置较少特定目标候选框的情况下实现较高的检测率。A candidate frame size acquiring apparatus provided by an embodiment of the present invention can obtain a scale information of a specific target in a detection area of each block by establishing a Gaussian model by partitioning, and thereby appropriately setting a size of a specific target candidate frame. To ensure the coverage of the candidate frame to the real target, it is possible to achieve a higher detection rate when less specific target candidate frames are set.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。A person skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the system, the device and the unit described above can refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.
在本发明所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed system, apparatus, and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner. For example, multiple units or components may be combined. Or it can be integrated into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。 The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元,可以存储在一个计算机可读取存储介质中。上述软件功能单元存储在一个存储介质中,包括若干指令用以使得一台计算机装置(可以是个人计算机,服务器,或者网络装置等)或处理器(Processor)执行本发明各个实施例所述方法的部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium. The above software functional unit is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor to perform the methods of the various embodiments of the present invention. Part of the steps. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明保护的范围之内。 The above are only the preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalents, improvements, etc., which are made within the spirit and principles of the present invention, should be included in the present invention. Within the scope of protection.

Claims (6)

  1. 一种候选框尺度的获取方法,其特征在于,所述方法包括:A method for obtaining a candidate frame scale, the method comprising:
    获取待搜索图像;Obtain an image to be searched;
    获取所述待搜索图像中各个原始分区的图像信息以及各个原始分区各自对应的高斯分布函数;Obtaining image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each original partition;
    根据每个原始分区的图像信息和高斯分布函数,确定每个原始分区中搜索目标的尺度信息;Determining the scale information of the search target in each original partition according to the image information of each original partition and the Gaussian distribution function;
    根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。The candidate frame size information in each of the original partitions is determined based on the scale information of the search target.
  2. 根据权利要求1所述的方法,其特征在于,在所述获取待搜索图像之前,还包括:The method according to claim 1, wherein before the acquiring the image to be searched, the method further comprises:
    获取原始图像内的原始分区;Obtaining the original partition within the original image;
    采集所述原始分区内搜索目标的尺度信息;Collecting scale information of the search target in the original partition;
    根据所述搜索目标的尺度信息,确定各个原始分区的高斯分布函数。A Gaussian distribution function of each original partition is determined according to the scale information of the search target.
  3. 根据权利要求2所述的方法,其特征在于,所述根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息包括:The method according to claim 2, wherein the determining the candidate frame size information in each of the original partitions according to the scale information of the search target comprises:
    获取候选框调整比例;Get the candidate frame adjustment ratio;
    根据所述候选框调整比例以及搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。The candidate frame size information in each of the original partitions is determined according to the candidate frame adjustment ratio and the scale information of the search target.
  4. 一种候选框尺度的获取装置,其特征在于,所述装置包括:A candidate frame size acquiring device, wherein the device comprises:
    第一获取单元,用于获取待搜索图像;a first acquiring unit, configured to acquire an image to be searched;
    第二获取单元,用于获取所述待搜索图像中各个原始分区的图像信息以及各个原始分区各自对应的高斯分布函数;a second acquiring unit, configured to acquire image information of each original partition in the image to be searched and a Gaussian distribution function corresponding to each original partition;
    第一确定单元,用于根据每个原始分区的图像信息和高斯分布函数,确定每个原始分区中搜索目标的尺度信息;a first determining unit, configured to determine, according to image information of each original partition and a Gaussian distribution function, scale information of a search target in each original partition;
    第二确定单元,用于根据搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。And a second determining unit, configured to determine candidate frame size information in each original partition according to the scale information of the search target.
  5. 根据权利要求4所述的装置,其特征在于,所述装置还包括:The device according to claim 4, wherein the device further comprises:
    第三获取单元,用于获取原始图像内的原始分区;a third acquiring unit, configured to acquire a original partition in the original image;
    采集单元,用于采集所述原始分区内搜索目标的尺度信息;An acquisition unit, configured to collect the scale information of the search target in the original partition;
    第三确定单元,用于根据所述搜索目标的尺度信息,确定各个原始分 区的高斯分布函数。a third determining unit, configured to determine each original score according to the scale information of the search target The Gaussian distribution function of the region.
  6. 根据权利要求5所述的装置,其特征在于,所述第二确定单元包括:The apparatus according to claim 5, wherein the second determining unit comprises:
    获取模块,用于获取候选框调整比例;An obtaining module, configured to obtain a candidate frame adjustment ratio;
    确定模块,用于根据所述候选框调整比例以及搜索目标的尺度信息,确定在每个原始分区中的候选框尺度信息。 And a determining module, configured to determine candidate frame size information in each original partition according to the candidate frame adjustment ratio and the scale information of the search target.
PCT/CN2016/096598 2016-08-24 2016-08-24 Method for acquiring dimension of candidate frame and device WO2018035768A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/096598 WO2018035768A1 (en) 2016-08-24 2016-08-24 Method for acquiring dimension of candidate frame and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/096598 WO2018035768A1 (en) 2016-08-24 2016-08-24 Method for acquiring dimension of candidate frame and device

Publications (1)

Publication Number Publication Date
WO2018035768A1 true WO2018035768A1 (en) 2018-03-01

Family

ID=61246026

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/096598 WO2018035768A1 (en) 2016-08-24 2016-08-24 Method for acquiring dimension of candidate frame and device

Country Status (1)

Country Link
WO (1) WO2018035768A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106846329A (en) * 2016-12-21 2017-06-13 天津中科智能识别产业技术研究院有限公司 A kind of pedestrian's localization method for merging foreground detection
CN110162665A (en) * 2018-12-28 2019-08-23 腾讯科技(深圳)有限公司 Video searching method, computer equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104123714A (en) * 2013-04-27 2014-10-29 华中科技大学 Optimal target detection scale generation method in people flow statistics
US20150146927A1 (en) * 2013-11-28 2015-05-28 Robert Bosch Gmbh Accelerated object recognition in an image
CN105740892A (en) * 2016-01-27 2016-07-06 北京工业大学 High-accuracy human body multi-position identification method based on convolutional neural network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104123714A (en) * 2013-04-27 2014-10-29 华中科技大学 Optimal target detection scale generation method in people flow statistics
US20150146927A1 (en) * 2013-11-28 2015-05-28 Robert Bosch Gmbh Accelerated object recognition in an image
CN105740892A (en) * 2016-01-27 2016-07-06 北京工业大学 High-accuracy human body multi-position identification method based on convolutional neural network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
QIN, JIAN ET AL.: "Fast Pedestrian Proposal Generation Algorithm Using Online Gaussian Model", ACTA OPTICA SINICA, vol. 36, no. 11, 8 February 2016 (2016-02-08), XP055467373, ISSN: 0253-2239 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106846329A (en) * 2016-12-21 2017-06-13 天津中科智能识别产业技术研究院有限公司 A kind of pedestrian's localization method for merging foreground detection
CN110162665A (en) * 2018-12-28 2019-08-23 腾讯科技(深圳)有限公司 Video searching method, computer equipment and storage medium
CN110162665B (en) * 2018-12-28 2023-06-16 腾讯科技(深圳)有限公司 Video searching method, computer device and storage medium

Similar Documents

Publication Publication Date Title
CN111126399B (en) Image detection method, device and equipment and readable storage medium
US10074008B2 (en) Facial recognition with biometric pre-filters
KR101854554B1 (en) Method, device and storage medium for calculating building height
CN110222686B (en) Object detection method, object detection device, computer equipment and storage medium
US10255673B2 (en) Apparatus and method for detecting object in image, and apparatus and method for computer-aided diagnosis
WO2021254134A1 (en) Privacy shielding processing method and apparatus, electronic device, and monitoring system
KR102119897B1 (en) Skin condition detection method and electronic device
CN110298318B (en) Human head and human body joint detection method and device and electronic equipment
US20150227806A1 (en) Object information extraction apparatus, object information extraction program, and object information extraction method
WO2019127102A1 (en) Information processing method and apparatus, cloud processing device, and computer program product
WO2023142912A1 (en) Method and apparatus for detecting left behind object, and storage medium
CN111160229A (en) Video target detection method and device based on SSD (solid State disk) network
CN111860253A (en) Multitask attribute identification method, multitask attribute identification device, multitask attribute identification medium and multitask attribute identification equipment for driving scene
WO2018035768A1 (en) Method for acquiring dimension of candidate frame and device
CN111881740A (en) Face recognition method, face recognition device, electronic equipment and medium
CN111814846A (en) Training method and recognition method of attribute recognition model and related equipment
KR20220004606A (en) Signal lamp recognition method, device, apparatus, and storage medium and computer program
WO2020007156A1 (en) Human body recognition method and device, as well as storage medium
Saif et al. Moment features based violence action detection using optical flow
CN114219936A (en) Object detection method, electronic device, storage medium, and computer program product
JP4708835B2 (en) Face detection device, face detection method, and face detection program
US11367311B2 (en) Face recognition method and apparatus, server, and storage medium
US11527091B2 (en) Analyzing apparatus, control method, and program
CN110348353B (en) Image processing method and device
CN112036269A (en) Fall detection method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16913791

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16913791

Country of ref document: EP

Kind code of ref document: A1