WO2019114145A1 - Head count detection method and device in surveillance video - Google Patents

Head count detection method and device in surveillance video Download PDF

Info

Publication number
WO2019114145A1
WO2019114145A1 PCT/CN2018/079856 CN2018079856W WO2019114145A1 WO 2019114145 A1 WO2019114145 A1 WO 2019114145A1 CN 2018079856 W CN2018079856 W CN 2018079856W WO 2019114145 A1 WO2019114145 A1 WO 2019114145A1
Authority
WO
WIPO (PCT)
Prior art keywords
human body
image
equalized
equalized image
cascade classifier
Prior art date
Application number
PCT/CN2018/079856
Other languages
French (fr)
Chinese (zh)
Inventor
刘若鹏
钟凯宇
Original Assignee
深圳光启合众科技有限公司
深圳光启创新技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳光启合众科技有限公司, 深圳光启创新技术有限公司 filed Critical 深圳光启合众科技有限公司
Publication of WO2019114145A1 publication Critical patent/WO2019114145A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast

Definitions

  • the present invention relates to the field of video detection, and in particular to a method and apparatus for detecting a number of people in a surveillance video.
  • the video-based abnormal state detection of the crowd refers to the intelligent analysis of the behavioral state of the mass incidents in the public places of large-scale people, and judges whether there are detection methods such as crowd trampling, fighting, riots and other abnormal events.
  • detection methods such as crowd trampling, fighting, riots and other abnormal events.
  • the research on intelligent monitoring systems at home and abroad is at the initial stage of development, and there are few products that can be truly applied in real life.
  • the scientific research results obtained at this stage are only applicable to the conditions of close-range monitoring equipment such as street and indoor.
  • the target display resolution is high, the area is large, and the recognition difficulty is relatively low, but at a long distance.
  • the scenes shot under such conditions are large, and the target characters are very small and very fuzzy, and the detection is more difficult, so the above scientific research results are not applicable.
  • the embodiments of the present invention provide a method and a device for detecting a number of people in a surveillance video, so as to at least solve the technical problem that the security of a public place cannot be guaranteed due to the failure of the prior art to detect the abnormal state of the crowd under remote monitoring.
  • a method for monitoring a number of people in a video including: performing histogram equalization on each frame image of the collected monitoring video to obtain equalization after histogram equalization. Identifying a human body in the equalized image by a cascade classifier, wherein the cascade classifier is configured to identify a human body according to a human body feature of the equalized image after the histogram equalization; The human body performs statistics.
  • the cascade classifier is formed by at least two weak classifiers, wherein the cascade classifier is equalized according to the histogram equalization by the at least two weak classifiers that are superimposed
  • the human body features of the image identify the human body.
  • the method further includes: Performing grayscale on each frame image in the monitoring video; performing histogram equalization on each frame image of the collected monitoring video includes: performing histogram equalization on each frame image after graying .
  • the method further includes: extracting high frequency components in each frame image by using a Laplacian, And applying the weight to the high frequency component to obtain the enhanced high frequency component; superimposing the enhanced high frequency component on the histogram equalized image to obtain an enhanced equalized image;
  • the classifier, identifying the human body in the equalized image includes: identifying, by the cascade classifier, a human body in the enhanced equalized image.
  • the method further includes: performing edge detection on the histogram equalized equalized image by using a Canny operator, Obtaining a contour included in the histogram equalized equalized image; determining an area of a contour included in the histogram equalized equalized image; and using a non-target contour having an area larger than a predetermined threshold Performing a filling to obtain an equalized image including the non-target contour, wherein the non-target contour is a non-target region identifying a human body; and the human body in the equalized image is identified by the cascade classifier
  • the method includes: identifying, by the cascade classifier, a human body in the equalized image including the non-target contour.
  • the method further includes: obtaining the cascading classifier by training a plurality of sets of data, where the multiple Each set of data in the group data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • the identification function of the weak classifier under the Haar-like rectangular feature is:
  • g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x
  • f j (x) is a feature value
  • ⁇ j is a threshold value of the weak classifier
  • j is used to identify j weak classifiers
  • ⁇ and ⁇ are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  • an apparatus for monitoring the number of people in a video including: a first obtaining module, configured to perform histogram equalization on each frame image in the collected monitoring video, Obtaining a histogram equalized equalized image; an identifying module, configured to identify a human body in the equalized image by using a cascade classifier, wherein the cascaded classifier is configured to perform equalization according to a histogram
  • the human body features of the equalized image identify the human body; the statistical module is used to perform statistics on the identified human body.
  • the cascade classifier is formed by at least two weak classifiers, wherein the cascade classifier is equalized according to the histogram equalization by the at least two weak classifiers that are superimposed
  • the human body features of the image identify the human body.
  • the device further includes: a grayscale module, configured to perform histogram equalization on each frame of the collected monitoring video in a case where the collected monitoring video is colored Previously, each frame of the color of the surveillance video is grayscaled; and a module is obtained for performing histogram equalization on each frame of the grayscale image.
  • a grayscale module configured to perform histogram equalization on each frame of the collected monitoring video in a case where the collected monitoring video is colored Previously, each frame of the color of the surveillance video is grayscaled; and a module is obtained for performing histogram equalization on each frame of the grayscale image.
  • the device further includes: a second obtaining module, configured to extract each frame by using a Laplacian before identifying the human body in the equalized image by using the cascade classifier a high-frequency component in the image, and assigning a weight to the high-frequency component to obtain an enhanced high-frequency component; and superimposing the enhanced high-frequency component on the histogram-equalized image to obtain an enhanced equalization
  • an identification module configured to identify a human body in the enhanced equalized image by the cascade classifier.
  • the device further includes: an obtaining module, configured to equalize the histogram equalization by using a Canny operator before identifying the human body in the equalized image by using the cascade classifier Performing edge detection on the image to obtain an outline included in the histogram equalized equalized image; determining an area of the contour included in the histogram equalized equalized image; and determining an area larger than a predetermined threshold
  • the target contour is filled with the water, and the equalized image including the non-target contour is obtained, wherein the non-target contour is a non-target area for identifying the human body; and the identification module is configured to identify by the cascade classifier A human body in the equalized image of the non-target contour is included.
  • the device further includes: a third obtaining module, configured to obtain the cascading classification by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier And each of the plurality of sets of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • a third obtaining module configured to obtain the cascading classification by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier
  • each of the plurality of sets of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • the identification function of the weak classifier under the Haar-like rectangular feature is:
  • g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x
  • f j (x) is a feature value
  • ⁇ j is a threshold value of the weak classifier
  • j is used to identify j weak classifiers
  • ⁇ and ⁇ are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  • a robot comprising the number of persons detecting means in the monitoring video according to any one of the above.
  • a storage medium including a stored program, wherein, when the program is running, controlling a device in which the storage medium is located performs any of the above Monitor the number of people in the video.
  • a processor configured to execute a program, wherein the program is executed to perform a method for detecting a number of people in a monitoring video according to any one of the above.
  • the captured video is captured, and each frame image of the collected monitoring video is subjected to histogram equalization to obtain a histogram equalized equalized image, and is identified by a cascade classifier.
  • the human body in the equalized image by counting the human body identified from each frame image in the surveillance video, by designing a filtering and enhancement algorithm, and by testing several commonly used classifier algorithms,
  • the Haar classifier with the best detection effect is continuously optimized to achieve the purpose of detecting portraits in remote monitoring, and realizes the technical effect of detecting and counting the number of people more accurately in remote monitoring, thereby solving the problem that cannot be detected due to the prior art. Under remote monitoring, the technical problems of public space safety cannot be guaranteed due to the detection of abnormal population status.
  • FIG. 1 is a flow chart of a method for monitoring number of people in a video according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of a region of a Laplacian operator 3*3 according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of a Haar feature template in accordance with an embodiment of the present invention.
  • FIG. 4 is a structural block diagram of a person detecting device in a monitoring video according to an embodiment of the present invention.
  • FIG. 5 is a block diagram 1 of an optimized structure of a number of people detecting device in a surveillance video according to an embodiment of the present invention
  • FIG. 6 is a block diagram 2 of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention
  • FIG. 7 is a block diagram 3 of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention.
  • FIG. 8 is a block diagram 4 of an optimized structure of a number of people detecting device in a surveillance video according to an embodiment of the present invention
  • FIG. 9 is a flowchart of an optimization method for monitoring a number of people in a video according to an embodiment of the present invention.
  • FIG. 10 is a schematic diagram of a process for detecting a number of people based on a cascade classifier according to an embodiment of the present invention.
  • an embodiment of a method for monitoring the number of people in a video is provided. It is noted that the steps illustrated in the flowchart of the figures may be performed in a computer system such as a set of computer executable instructions. Also, although logical sequences are shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
  • FIG. 1 is a flowchart of a method for monitoring number of people in a video according to an embodiment of the present invention. As shown in FIG. 1, the method includes the following steps:
  • Step S102 performing histogram equalization on each frame image in the collected monitoring video to obtain a histogram equalized equalized image
  • Step S104 identifying a human body in the equalized image by using a cascade classifier, wherein the cascade classifier is configured to identify the human body according to the human body feature of the equalized image after the histogram equalization;
  • step S106 statistics are performed on the identified human body.
  • a histogram equalization is performed on each frame image of the collected monitoring video by using the captured video, and the equalized image after the histogram equalization is obtained, and the cascaded classifier is used to identify the image.
  • the human body in the image is equalized, and the human body identified from each frame image in the surveillance video is statistically analyzed, and an enhanced image histogram equalization algorithm is designed, and the cascading classification with the best portrait detection effect is designed.
  • the number of people is counted to achieve the purpose of detecting portraits in remote monitoring, and the technical effect of detecting and counting the number of people more accurately in remote monitoring is realized, thereby solving the problem that the existing technology cannot realize the crowd under remote monitoring.
  • the technical problem that the safety of public places cannot be guaranteed due to the detection of abnormal conditions.
  • the cascade classifier may be superposed by at least two weak classifiers, wherein the cascade classifier identifies the human body according to the human body features of the histogram equalized image by the superimposed at least two weak classifiers.
  • the method may further include: displaying each frame image in the color monitoring video Performing grayscale; performing histogram equalization on each frame image of the collected monitoring video includes: performing histogram equalization on each frame image after graying.
  • the byte stores the gray value, and the gray scale ranges from 0 to 255.
  • Image graying is required by the cascade classifier to gray out the original image as an input. For example, it can be implemented by calling OpenCV's cvCvtColor function.
  • OpenCV is a cross-platform computer vision library based on BSD license (open source) distribution, which is lightweight and efficient - consists of a series of C functions and a small number of C++ classes, and provides interfaces for languages such as Python, Ruby, and MATLAB. Achieve many general algorithms in image processing and computer vision.
  • Histogram equalization is to enhance image contrast, improve image quality, and help detect effects.
  • an embodiment of the present invention uses an improved histogram equalization method: using a Laplacian operator to extract high frequency components in each frame image before identifying the human body in the equalized image by a cascade classifier, and The high frequency component is assigned a weight, and the enhanced high frequency component is obtained; the enhanced high frequency component is superimposed on the histogram equalized image to obtain an enhanced equalized image.
  • identifying the human body in the equalized image by the cascade classifier includes: identifying the human body in the enhanced equalized image through the cascade classifier.
  • the "central idea" of the histogram equalization process is to change the gray histogram of the original image from a certain gray interval in the comparison set to a uniform distribution in the entire gray range. Histogram equalization is to nonlinearly stretch the image and redistribute the image pixel values so that the number of pixels in a certain gray range is approximately the same. Histogram equalization is the change of the histogram distribution of a given image to a "uniform" distribution histogram distribution. But there are two disadvantages:
  • the Laplace algorithm can achieve faster edge detection and better detection of high-frequency edges.
  • the Laplacian is a second-order differential operator. In the case of discrete, it is expressed as follows:
  • FIG. 2 is a schematic diagram of a region of a Laplacian operator 3*3, wherein the region of 3*3 is as shown, in accordance with an embodiment of the present invention.
  • the improved histogram equalization method greatly enhances the details of the image and facilitates subsequent detection.
  • the embodiment of the present invention can fill the contour area with a larger area by searching the contour, thereby eliminating the non-target area and improving the detection accuracy.
  • the Canny operator is used to perform edge detection on the histogram equalized equalized image to obtain a contour included in the histogram equalized equalized image; Determining an area of a contour included in the histogram equalized equalized image; filling a non-target contour having an area larger than a predetermined threshold with diffused water to obtain an equalized image including a non-target contour, wherein the non-target contour is Identify non-target areas of the human body.
  • identifying the human body in the equalized image by the cascade classifier includes: identifying the human body in the equalized image including the non-target contour by the cascade classifier.
  • the method further includes: obtaining a cascade classifier by training the plurality of sets of data, wherein each of the plurality of sets of data includes: a sample An image, and a human body recognition result for identifying whether the sample image includes a human body.
  • obtaining a cascade classifier by training the plurality of sets of data, wherein each of the plurality of sets of data includes: a sample An image, and a human body recognition result for identifying whether the sample image includes a human body.
  • the core of the acquisition process of the cascaded classifier is to find a small and critical part of the feature from a large number of Haar-like features by iterative method, and use this feature to generate an effective classifier, using a large number of classification capabilities.
  • the general weak classifiers are superimposed by a certain method to form a classifier with strong classification ability, and then these classifiers are cascaded to obtain the final strong classifier.
  • the Haar-like rectangle feature is a digital image feature for object detection.
  • 3 is a schematic diagram of a Haar feature template according to an embodiment of the present invention. As shown in FIG. 3, such a rectangular feature template is composed of two or more congruent black and white rectangles adjacent to each other, and the rectangular feature value is a white rectangle. The sum of the gray value and the gray value of the black rectangle.
  • the rectangle feature is sensitive to some simple graphic structures such as line segments and edges. If such a rectangle is placed in a non-face area, the calculated feature values should be different from the face feature values, so these rectangles are used to quantify face features to distinguish between faces and non-faces.
  • the feature-based approach was chosen without the pixel-based approach because, in the case of a given finite data sample, feature-based detection can not only encode the state of a particular region, but also be based on a feature-based system.
  • the pixel system is fast.
  • the recognition function of the weak classifier under the Haar-like rectangular feature may be:
  • g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x
  • f j (x) is a feature value
  • ⁇ j is a threshold value of the weak classifier
  • j is used to identify the jth Weak classifier
  • ⁇ and ⁇ are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  • a weak classifier corresponds to a Haar-like rectangle feature.
  • the weak classifier form of the Haar-like feature is as described above.
  • the classifier trained by OpenCV contains a series of feature thresholds. To determine whether the intercepted image passes the classifier, it is necessary to calculate the Haar-like eigenvalues of the image under all Haar-like feature templates, and compare the thresholds of the corresponding feature templates in the classifier.
  • FIG. 4 is a structural block diagram of a person detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 4, the monitoring is performed.
  • the number of people in the video detecting device includes a first obtaining module 44, an identifying module 46, and a statistic module 48. The number of people detecting devices in the monitoring video will be described in detail below.
  • the first obtaining module 44 is configured to perform histogram equalization on each frame image of the collected monitoring video to obtain a histogram equalized equalized image;
  • the identification module 46 is connected to the first obtaining module 44 for identifying the human body in the equalized image by using a cascade classifier, wherein the cascade classifier is used for human body feature recognition of the equalized image after the histogram equalization Out of the body;
  • the statistics module 48 is connected to the identification module 46 for counting the identified human body.
  • FIG. 5 is a block diagram of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 5, the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4, a graying module. 52. The gradation module 52 will be described in detail below.
  • the graying module 52 is connected to the first obtaining module 44, and is configured to: before the collected monitoring video is colored, in the case of performing histogram equalization on each frame of the collected monitoring video, the coloring is performed. Each frame of the video in the surveillance video is grayed out.
  • FIG. 6 is a block diagram 2 of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention.
  • the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4 , a second obtaining module. 62.
  • the second obtaining module 62 will be described in detail below.
  • the second obtaining module 62 is connected to the first obtaining module 44 and the identifying module 46 for extracting each frame image by using a Laplacian before identifying the human body in the equalized image by the cascade classifier.
  • the high frequency component is assigned to the high frequency component to obtain the enhanced high frequency component; the enhanced high frequency component is superimposed on the histogram equalized image to obtain the enhanced equalized image.
  • FIG. 7 is a block diagram 3 of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention.
  • the number of detecting devices in the monitoring video includes: obtaining module 72 in addition to all the structures in FIG. 4 .
  • the obtaining module 72 will be described in detail below.
  • the obtaining module 72 is connected to the first obtaining module 44 and the identifying module 46, and is configured to perform the histogram equalized equalized image by using the Canny operator before identifying the human body in the equalized image by the cascade classifier.
  • Edge detection obtaining a contour included in the histogram equalized equalized image; determining an area of a contour included in the histogram equalized equalized image; and using a diffused water for the non-target contour having an area larger than a predetermined threshold
  • Filling obtaining an equalized image including a non-target contour, wherein the non-target contour is a non-target area that identifies the human body.
  • FIG. 8 is a block diagram of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention.
  • the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4 , a third obtaining module. 82.
  • the third obtaining module 82 will be described in detail below.
  • the third obtaining module 82 is connected to the foregoing identifying module 46, and is configured to obtain a cascade classifier by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier, wherein the plurality of data sets Each set of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • FIG. 9 is a flowchart of an optimization method for monitoring a number of people in a video according to an embodiment of the present invention. As shown in FIG. 9, the method includes the following steps:
  • Step S902 cascading classifier training
  • the cascading classifier uses OpenCV to train the cascading classifier based on Haar-like features, it is necessary to provide corresponding positive sample pictures and counter sample pictures of the features to be identified.
  • the positive sample is a sample picture of a human body image; the negative sample is a background image, which requires no one, and the aspect ratio is 1:2.
  • the cascaded classifier is trained by the corresponding program provided by OpenCV to extract features and train classifiers, and the trained classifier model can identify these things.
  • Step S904 pedestrian detection and population statistics.
  • the input data to be detected is a real-time video, and grayscale processing is performed on each frame of the input video;
  • the result detected by the classifier may have a larger area, so by traversing the detection result of the classifier (rectangular frame), the height of the target rectangular frame is greater than the threshold. Eliminate, thereby improving detection accuracy.
  • a real-time video stream is collected, and a plurality of original sample map samples and a velocity sample map sample are obtained by line sampling based on the obtained video stream; space-time correction is performed on the obtained velocity sample map sample; based on the original sample map and The velocity sampling map, the offline training obtains the deep learning model, the deep learning model includes the classification model and the statistical model; and the obtained deep learning model is used for the population state analysis of the real-time video stream.
  • the embodiments of the invention have good adaptability to different environments, light intensity, weather conditions and camera angles; for a crowded environment such as a large flow of people, a high accuracy rate can be ensured; the calculation amount is small, and the real-time video can be satisfied.
  • the requirements for processing can be widely applied to the monitoring and management of public places densely populated by buses, subways and plazas.
  • the population density estimation for the low-density population, the pixel density method is used to estimate the population density.
  • the wavelet packet decomposition is used for the high-density population.
  • the crowd image was analyzed by multi-scale.
  • the population density level was classified by SVM (Support Vector Machine).
  • SVM Small Vector Machine
  • the block matching method based on the full search algorithm with the average absolute error as the matching criterion was used. The speed of the crowd is estimated.
  • the texture analysis method extracts the texture features for analysis, and finally uses the Adaboost classifier to classify the population density level.
  • the current research focus on population density estimation and motion analysis is to solve the problem that when there are a large number of pedestrians, there is a large amount of occlusion in the crowd, and it is difficult to accurately detect, segment and track individual pedestrians in the crowd accurately.
  • an effective statistical learning method is used to establish a reasonable decision rule, directly estimating the number of pedestrians, and determining the motion state of the target, and detecting the abnormal event. occur.
  • the solution in the above related art is only applicable to the conditions of a close-range monitoring device such as a street or an indoor.
  • the target display resolution is high, the area is large, and the recognition difficulty is relatively low, but at a long distance.
  • the recognition difficulty is relatively low, but at a long distance.
  • the conditions of detection it is not applicable.
  • the scenes shot under such conditions are large, the target characters are very small and very fuzzy, and the detection is more difficult.
  • the above methods are not applicable.
  • the monitoring video of the captured video is used to equalize the histogram of each frame in the collected monitoring video to obtain a histogram equalized image, and the histogram is identified by the cascade classifier.
  • the human body in the image after equalization is used to statistically analyze the human body identified from each frame of the video in the surveillance video, by designing filtering and enhancement algorithms, and by testing several commonly used classifier algorithms.
  • the Haar classifier with the best detection effect is obtained and continuously optimized to achieve the purpose of detecting portraits in remote monitoring, and the technical effect of detecting and counting the number of people more accurately in remote monitoring is realized, thereby solving the existing
  • the technology cannot solve the technical problem that the safety of public places cannot be guaranteed due to the detection of abnormal state of the people under remote monitoring.
  • the problem solved by the embodiment of the present invention is that during the running of the cloud number, the monitoring camera of the cloud number will focus on monitoring some important places on the ground, and the number of people is counted to determine the security status of the place, and the warning effect of the crowd is intensive. Fully guarantee the effectiveness and accuracy of the warning.
  • the embodiment of the present invention can be applied to a system for counting the number of people, and can be used in a remote monitoring environment (such as cloud number, drone, etc.) to implement pedestrian detection, and count the number of pedestrians. Some warnings can be made.
  • a remote monitoring environment such as cloud number, drone, etc.
  • FIG. 10 is a schematic diagram of a process for detecting a number of people based on a cascaded classifier according to an embodiment of the present invention.
  • the classifier can include the identification image. The part of the human body is detected, and when a feature containing a person is detected, the number counter is automatically incremented by one.
  • the cascading classifier is used for human body detection, and the corresponding program is designed based on the interface function of the pre-processing and cascading classifier in the OpenCV library.
  • a robot comprising the number of persons detecting means in the monitoring video of any of the above.
  • a storage medium comprising: a stored program, wherein the device in which the storage medium is located controls the number of people in the monitoring video in any one of the above-mentioned items when the program is running.
  • a processor for running a program wherein the program is executed while performing the number of people in the monitoring video of any of the above.
  • the disclosed technical contents may be implemented in other manners.
  • the device embodiments described above are only schematic.
  • the division of the unit may be a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, unit or module, and may be electrical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .

Abstract

Disclosed is a head count detection method and device in a surveillance video. The method comprises: histogram equalization is performed on each frame of image captured in the surveillance video to obtain an equalized image after histogram equalization; a human body in the equalized image is recognized by means of a cascade classifier; the cascade classifier is used for recognizing the human body according to human body features in the equalized image after histogram equalization; and the identified human body is counted. The present invention solves the technical problems in the prior art that the safety in public places cannot be guaranteed due to the inability to realize detection of an abnormal state of the crowd under remote monitoring.

Description

监控视频中人数检测方法及装置Method and device for detecting number of people in surveillance video 技术领域Technical field
本发明涉及视频检测领域,具体而言,涉及一种监控视频中人数检测方法及装置。The present invention relates to the field of video detection, and in particular to a method and apparatus for detecting a number of people in a surveillance video.
背景技术Background technique
人们对于公共场所的安全要求日益提升。到目前为止,国内已发生了多起暴力恐怖事件,造成了大量的人员伤亡以及财产损失,建立高效完备的智能视频监控系统已经成为当今社会的迫切需求。现阶段得到广泛应用的视频监控系统虽然提供了大量的视频信息,但是对于突发事件和情况不具有预报警能力,必须人为地参与到监视工作中去。随着机器视觉技术和图像处理技术的不断进步,需要大量人力的传统视频监控系统已然不能满足社会发展的需要,高度自动化、智能化的新一代视频监控系统必将逐步取代传统视频监控系统在安防领域中的地位,在保证系统性能的同时,解放人力,进而降低成本。People's safety requirements for public places are increasing. So far, there have been many violent terrorist incidents in China, causing a large number of casualties and property losses. Establishing an efficient and complete intelligent video surveillance system has become an urgent need in today's society. Although the video surveillance system, which is widely used at this stage, provides a large amount of video information, it does not have the ability to pre-alarm for emergencies and situations, and must be artificially involved in the monitoring work. With the continuous advancement of machine vision technology and image processing technology, traditional video surveillance systems that require a lot of manpower can no longer meet the needs of social development. A highly automated and intelligent next-generation video surveillance system will gradually replace the traditional video surveillance system in security. The position in the field, while ensuring system performance, frees manpower and thus reduces costs.
基于视频的人群异常状态检测是指在大规模人群的公共场所,对群体性事件进行行为状态智能分析,判断其是否存在人群踩踏、打架斗殴、骚乱等异常事件的检测方法。目前,国内外对于智能监控系统的研究均处于初级开发阶段,能够真正应用在实际生活中的产品很少。经过近些年大量专家学者的研究工作,在视频内容分析理解方面,侧重于人群密度估计或少数个体研究已经取得了一定的科研成果,The video-based abnormal state detection of the crowd refers to the intelligent analysis of the behavioral state of the mass incidents in the public places of large-scale people, and judges whether there are detection methods such as crowd trampling, fighting, riots and other abnormal events. At present, the research on intelligent monitoring systems at home and abroad is at the initial stage of development, and there are few products that can be truly applied in real life. After a lot of research work by experts and scholars in recent years, in the aspect of video content analysis and understanding, focusing on population density estimation or a few individual studies has achieved certain scientific research results.
而现阶段取得的所述科研成果只适用于街头、室内等近距离的监控设备的条件,这样的环境下检测的目标显示分辨率高、区域较大,识别难度相对较低,但是在远距离检测的条件下,这样的条件所拍摄的场景面积大,目标人物相对就显得非常小、并且十分模糊,检测难度更大,所以上述科研成果并不适用。The scientific research results obtained at this stage are only applicable to the conditions of close-range monitoring equipment such as street and indoor. In such an environment, the target display resolution is high, the area is large, and the recognition difficulty is relatively low, but at a long distance. Under the conditions of detection, the scenes shot under such conditions are large, and the target characters are very small and very fuzzy, and the detection is more difficult, so the above scientific research results are not applicable.
针对上述的远距离监控下人群异常状态的检测困难的问题,目前尚未提出有效的解决方案。In view of the above-mentioned problem of difficulty in detecting abnormal state of the population under remote monitoring, an effective solution has not yet been proposed.
发明内容Summary of the invention
本发明实施例提供了一种监控视频中人数检测方法及装置,以至少解决由于现有技术无法在远距离监控下实现人群异常状态的检测造成的公共场所安全无法保障的技 术问题。The embodiments of the present invention provide a method and a device for detecting a number of people in a surveillance video, so as to at least solve the technical problem that the security of a public place cannot be guaranteed due to the failure of the prior art to detect the abnormal state of the crowd under remote monitoring.
根据本发明实施例的一个方面,提供了一种监控视频中人数检测的方法,包括:对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像;通过级联分类器,识别所述均衡化图像中的人体,其中,所述级联分类器用于根据直方图均衡化后的所述均衡化图像的人体特征识别出人体;对识别出的人体进行统计。According to an aspect of the embodiments of the present invention, a method for monitoring a number of people in a video is provided, including: performing histogram equalization on each frame image of the collected monitoring video to obtain equalization after histogram equalization. Identifying a human body in the equalized image by a cascade classifier, wherein the cascade classifier is configured to identify a human body according to a human body feature of the equalized image after the histogram equalization; The human body performs statistics.
可选的,所述级联分类器至少由两个弱分类器叠加而成,其中,所述级联分类器通过叠加的所述至少两个弱分类器根据所述直方图均衡化后的均衡化图像的人体特征识别出人体。Optionally, the cascade classifier is formed by at least two weak classifiers, wherein the cascade classifier is equalized according to the histogram equalization by the at least two weak classifiers that are superimposed The human body features of the image identify the human body.
可选的,在采集的所述监控视频为彩色的情况下,在对采集到的所述监控视频中的每一帧图像进行直方图均衡化之前,所述方法还包括:将彩色的所述监控视频中的每一帧图像进行灰度化;对采集到的所述监控视频中的每一帧图像进行直方图均衡化包括:对进行灰度化后的每一帧图像进行直方图均衡化。Optionally, in the case that the collected monitoring video is colored, before performing histogram equalization on each frame of the collected monitoring video, the method further includes: Performing grayscale on each frame image in the monitoring video; performing histogram equalization on each frame image of the collected monitoring video includes: performing histogram equalization on each frame image after graying .
可选的,在通过所述级联分类器,识别所述均衡化图像中的人体之前,所述方法还包括:利用拉普拉斯算子提取所述每一帧图像中的高频成份,并对所述高频成份赋于权值,得到增强后的高频成份;将增强后的高频成份叠加于直方图均衡化后的图像中,得到增强后的均衡化图像;通过所述级联分类器,识别所述均衡化图像中的人体包括:通过所述级联分类器,识别所述增强后的均衡化图像中的人体。Optionally, before the human body in the equalized image is identified by the cascade classifier, the method further includes: extracting high frequency components in each frame image by using a Laplacian, And applying the weight to the high frequency component to obtain the enhanced high frequency component; superimposing the enhanced high frequency component on the histogram equalized image to obtain an enhanced equalized image; And the classifier, identifying the human body in the equalized image includes: identifying, by the cascade classifier, a human body in the enhanced equalized image.
可选的,在通过所述级联分类器,识别所述均衡化图像中的人体之前,所述方法还包括:利用Canny算子对所述直方图均衡化后的均衡化图像进行边缘检测,获得所述直方图均衡化后的均衡化图像中所包括的轮廓;确定所述直方图均衡化后的均衡化图像中所包括的轮廓的面积;将面积大于预定阈值的非目标轮廓使用漫水进行填充,获得包括了所述非目标轮廓的均衡化图像,其中,所述非目标轮廓为识别人体的非目标区域;通过所述级联分类器,识别所述均衡化后的图像中的人体包括:通过所述级联分类器,识别包括了所述非目标轮廓的均衡化图像中的人体。Optionally, before the human body in the equalized image is identified by the cascade classifier, the method further includes: performing edge detection on the histogram equalized equalized image by using a Canny operator, Obtaining a contour included in the histogram equalized equalized image; determining an area of a contour included in the histogram equalized equalized image; and using a non-target contour having an area larger than a predetermined threshold Performing a filling to obtain an equalized image including the non-target contour, wherein the non-target contour is a non-target region identifying a human body; and the human body in the equalized image is identified by the cascade classifier The method includes: identifying, by the cascade classifier, a human body in the equalized image including the non-target contour.
可选的,在通过所述级联分类器,识别所述均衡化图像中的人体之前,所述方法还包括:通过对多组数据进行训练得到所述级联分类器,其中,所述多组数据中的每组数据均包括:样本图像,和用于标识所述样本图像是否包括人体的人体识别结果。Optionally, before the human body in the equalized image is identified by the cascading classifier, the method further includes: obtaining the cascading classifier by training a plurality of sets of data, where the multiple Each set of data in the group data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
可选的,所述弱分类器在基于Haar-like矩形特征下的识别函数为:Optionally, the identification function of the weak classifier under the Haar-like rectangular feature is:
Figure PCTCN2018079856-appb-000001
Figure PCTCN2018079856-appb-000001
其中,g haar(x)用于标识基于人体特征x确定均衡化图像是否包括人体的识别结果,f j(x)是特征值;θ j是所述弱分类器的阈值;j用于标识第j个弱分类器;α和β是分类结果的置信度,取值范围为[-1,+1],为负则不是人体,为正则是人体。 Where g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x, f j (x) is a feature value; θ j is a threshold value of the weak classifier; j is used to identify j weak classifiers; α and β are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
根据本发明实施例的另一方面,还提供了一种监控视频中人数检测的装置,包括:第一得到模块,用于对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像;识别模块,用于通过级联分类器,识别所述均衡化图像中的人体,其中,所述级联分类器用于根据直方图均衡化后的所述均衡化图像的人体特征识别出人体;统计模块,用于对识别出的人体进行统计。According to another aspect of the present invention, an apparatus for monitoring the number of people in a video is provided, including: a first obtaining module, configured to perform histogram equalization on each frame image in the collected monitoring video, Obtaining a histogram equalized equalized image; an identifying module, configured to identify a human body in the equalized image by using a cascade classifier, wherein the cascaded classifier is configured to perform equalization according to a histogram The human body features of the equalized image identify the human body; the statistical module is used to perform statistics on the identified human body.
可选的,所述级联分类器至少由两个弱分类器叠加而成,其中,所述级联分类器通过叠加的所述至少两个弱分类器根据所述直方图均衡化后的均衡化图像的人体特征识别出人体。Optionally, the cascade classifier is formed by at least two weak classifiers, wherein the cascade classifier is equalized according to the histogram equalization by the at least two weak classifiers that are superimposed The human body features of the image identify the human body.
可选的,所述装置还包括:灰度化模块,用于在采集的所述监控视频为彩色的情况下,在对采集到的所述监控视频中的每一帧图像进行直方图均衡化之前,将彩色的所述监控视频中的每一帧图像进行灰度化;得到模块,用于对进行灰度化后的每一帧图像进行直方图均衡化。Optionally, the device further includes: a grayscale module, configured to perform histogram equalization on each frame of the collected monitoring video in a case where the collected monitoring video is colored Previously, each frame of the color of the surveillance video is grayscaled; and a module is obtained for performing histogram equalization on each frame of the grayscale image.
可选的,所述装置还包括:第二得到模块,用于在通过所述级联分类器,识别所述均衡化图像中的人体之前,利用拉普拉斯算子提取所述每一帧图像中的高频成份,并对所述高频成份赋于权值,得到增强后的高频成份;将增强后的高频成份叠加于直方图均衡化后的图像中,得到增强后的均衡化图像;识别模块,用于通过所述级联分类器,识别所述增强后的均衡化图像中的人体。Optionally, the device further includes: a second obtaining module, configured to extract each frame by using a Laplacian before identifying the human body in the equalized image by using the cascade classifier a high-frequency component in the image, and assigning a weight to the high-frequency component to obtain an enhanced high-frequency component; and superimposing the enhanced high-frequency component on the histogram-equalized image to obtain an enhanced equalization And an identification module, configured to identify a human body in the enhanced equalized image by the cascade classifier.
可选的,所述装置还包括:获得模块,用于在通过所述级联分类器,识别所述均衡化图像中的人体之前,利用Canny算子对所述直方图均衡化后的均衡化图像进行边缘检测,获得所述直方图均衡化后的均衡化图像中所包括的轮廓;确定所述直方图均衡化后的均衡化图像中所包括的轮廓的面积;将面积大于预定阈值的非目标轮廓使用漫水进行填充,获得包括了所述非目标轮廓的均衡化图像,其中,所述非目标轮廓为识别人体的非目标区域;识别模块,用于通过所述级联分类器,识别包括了所述非目标轮廓的均衡化图像中的人体。Optionally, the device further includes: an obtaining module, configured to equalize the histogram equalization by using a Canny operator before identifying the human body in the equalized image by using the cascade classifier Performing edge detection on the image to obtain an outline included in the histogram equalized equalized image; determining an area of the contour included in the histogram equalized equalized image; and determining an area larger than a predetermined threshold The target contour is filled with the water, and the equalized image including the non-target contour is obtained, wherein the non-target contour is a non-target area for identifying the human body; and the identification module is configured to identify by the cascade classifier A human body in the equalized image of the non-target contour is included.
可选的,所述装置还包括:第三得到模块,用于在通过所述级联分类器,识别所述均衡化图像中的人体之前,通过对多组数据进行训练得到所述级联分类器,其中,所述多组数据中的每组数据均包括:样本图像,和用于标识所述样本图像是否包括人体的人体识别结果。Optionally, the device further includes: a third obtaining module, configured to obtain the cascading classification by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier And each of the plurality of sets of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
可选的,所述弱分类器在基于Haar-like矩形特征下的识别函数为:Optionally, the identification function of the weak classifier under the Haar-like rectangular feature is:
Figure PCTCN2018079856-appb-000002
Figure PCTCN2018079856-appb-000002
其中,g haar(x)用于标识基于人体特征x确定均衡化图像是否包括人体的识别结果,f j(x)是特征值;θ j是所述弱分类器的阈值;j用于标识第j个弱分类器;α和β是分类结果的置信度,取值范围为[-1,+1],为负则不是人体,为正则是人体。 Where g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x, f j (x) is a feature value; θ j is a threshold value of the weak classifier; j is used to identify j weak classifiers; α and β are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
根据本发明实施例的另一方面,还提供了一种机器人,所述机器人包含上述任意一项所述的监控视频中人数检测装置。According to another aspect of the embodiments of the present invention, there is also provided a robot comprising the number of persons detecting means in the monitoring video according to any one of the above.
根据本发明实施例的另一方面,还提供了一种存储介质,所述存储介质包括存储的程序,其中,在所述程序运行时控制所述存储介质所在设备执行上述任意一项所述的监控视频中人数检测方法。According to another aspect of an embodiment of the present invention, a storage medium is provided, the storage medium including a stored program, wherein, when the program is running, controlling a device in which the storage medium is located performs any of the above Monitor the number of people in the video.
根据本发明实施例的另一方面,还提供了一种处理器,所述处理器用于运行程序,其中,所述程序运行时执行上述任意一项所述的监控视频中人数检测方法。According to another aspect of the embodiments of the present invention, there is further provided a processor, wherein the processor is configured to execute a program, wherein the program is executed to perform a method for detecting a number of people in a monitoring video according to any one of the above.
在本发明实施例中,采用采集拍摄的监控视频,对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像,并通过级联分类器识别该均衡化图像中的人体,再对从监控视频中的每一帧图像中识别出的人体进行统计的方式,通过设计出滤波和增强的算法,及通过测试几种常用的分类器算法,得出检测效果最好的Haar分类器,并不断优化,达到在远距离监控中检测人像的目的,实现了远距离监控中较为准确地检测和统计人数的技术效果,进而解决了由于现有技术无法在远距离监控下实现人群异常状态的检测造成的公共场所安全无法保障的技术问题。In the embodiment of the present invention, the captured video is captured, and each frame image of the collected monitoring video is subjected to histogram equalization to obtain a histogram equalized equalized image, and is identified by a cascade classifier. The human body in the equalized image, by counting the human body identified from each frame image in the surveillance video, by designing a filtering and enhancement algorithm, and by testing several commonly used classifier algorithms, The Haar classifier with the best detection effect is continuously optimized to achieve the purpose of detecting portraits in remote monitoring, and realizes the technical effect of detecting and counting the number of people more accurately in remote monitoring, thereby solving the problem that cannot be detected due to the prior art. Under remote monitoring, the technical problems of public space safety cannot be guaranteed due to the detection of abnormal population status.
附图说明DRAWINGS
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The drawings described herein are intended to provide a further understanding of the invention, and are intended to be a part of the invention. In the drawing:
图1是根据本发明实施例的监控视频中人数检测的方法的流程图;1 is a flow chart of a method for monitoring number of people in a video according to an embodiment of the present invention;
图2是根据本发明实施例的拉普拉斯算子3*3的区域的示意图;2 is a schematic diagram of a region of a Laplacian operator 3*3 according to an embodiment of the present invention;
图3是根据本发明实施例的Haar特征模板的示意图;3 is a schematic diagram of a Haar feature template in accordance with an embodiment of the present invention;
图4是根据本发明实施例的监控视频中人数检测装置的结构框图;4 is a structural block diagram of a person detecting device in a monitoring video according to an embodiment of the present invention;
图5是根据本发明实施例的监控视频中人数检测装置的优化结构框图一;5 is a block diagram 1 of an optimized structure of a number of people detecting device in a surveillance video according to an embodiment of the present invention;
图6是根据本发明实施例的监控视频中人数检测装置的优化结构框图二;6 is a block diagram 2 of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention;
图7是根据本发明实施例的监控视频中人数检测装置的优化结构框图三;7 is a block diagram 3 of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention;
图8是根据本发明实施例的监控视频中人数检测装置的优化结构框图四;8 is a block diagram 4 of an optimized structure of a number of people detecting device in a surveillance video according to an embodiment of the present invention;
图9是根据本发明实施例的监控视频中人数检测的优化方法的流程图;9 is a flowchart of an optimization method for monitoring a number of people in a video according to an embodiment of the present invention;
图10是根据本发明实施例的基于级联分类器的人数检测流程示意图。FIG. 10 is a schematic diagram of a process for detecting a number of people based on a cascade classifier according to an embodiment of the present invention.
具体实施方式Detailed ways
为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is an embodiment of the invention, but not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts shall fall within the scope of the present invention.
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。It is to be understood that the terms "first", "second" and the like in the specification and claims of the present invention are used to distinguish similar objects, and are not necessarily used to describe a particular order or order. It is to be understood that the data so used may be interchanged where appropriate, so that the embodiments of the invention described herein can be implemented in a sequence other than those illustrated or described herein. In addition, the terms "comprises" and "comprises" and "the" and "the" are intended to cover a non-exclusive inclusion, for example, a process, method, system, product, or device that comprises a series of steps or units is not necessarily limited to Those steps or units may include other steps or units not explicitly listed or inherent to such processes, methods, products or devices.
根据本发明实施例,提供了一种监控视频中人数检测的方法实施例,需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。In accordance with an embodiment of the present invention, an embodiment of a method for monitoring the number of people in a video is provided. It is noted that the steps illustrated in the flowchart of the figures may be performed in a computer system such as a set of computer executable instructions. Also, although logical sequences are shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
图1是根据本发明实施例的监控视频中人数检测的方法的流程图,如图1所示, 该方法包括如下步骤:FIG. 1 is a flowchart of a method for monitoring number of people in a video according to an embodiment of the present invention. As shown in FIG. 1, the method includes the following steps:
步骤S102,对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像;Step S102, performing histogram equalization on each frame image in the collected monitoring video to obtain a histogram equalized equalized image;
步骤S104,通过级联分类器,识别均衡化图像中的人体,其中,级联分类器用于根据直方图均衡化后的均衡化图像的人体特征识别出人体;Step S104, identifying a human body in the equalized image by using a cascade classifier, wherein the cascade classifier is configured to identify the human body according to the human body feature of the equalized image after the histogram equalization;
步骤S106,对识别出的人体进行统计。In step S106, statistics are performed on the identified human body.
在本发明实施例中,采用采集拍摄的监控视频对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像,并通过级联分类器识别该均衡化图像中的人体,再对从监控视频中的每一帧图像中识别出的人体进行统计的方式,通过设计出增强的图像直方图均衡化算法,及人像检测效果最好的级联分类器,并进行人数统计,达到在远距离监控中检测人像的目的,实现了远距离监控中较为准确地检测和统计人数的技术效果,进而解决了由于现有技术无法在远距离监控下实现人群异常状态的检测造成的公共场所安全无法保障的技术问题。In the embodiment of the present invention, a histogram equalization is performed on each frame image of the collected monitoring video by using the captured video, and the equalized image after the histogram equalization is obtained, and the cascaded classifier is used to identify the image. The human body in the image is equalized, and the human body identified from each frame image in the surveillance video is statistically analyzed, and an enhanced image histogram equalization algorithm is designed, and the cascading classification with the best portrait detection effect is designed. And the number of people is counted to achieve the purpose of detecting portraits in remote monitoring, and the technical effect of detecting and counting the number of people more accurately in remote monitoring is realized, thereby solving the problem that the existing technology cannot realize the crowd under remote monitoring. The technical problem that the safety of public places cannot be guaranteed due to the detection of abnormal conditions.
其中,级联分类器可以至少由两个弱分类器叠加而成,其中,级联分类器通过叠加的至少两个弱分类器根据直方图均衡化后的均衡化图像的人体特征识别出人体。The cascade classifier may be superposed by at least two weak classifiers, wherein the cascade classifier identifies the human body according to the human body features of the histogram equalized image by the superimposed at least two weak classifiers.
优选的,在采集的监控视频为彩色的情况下,在对采集到的监控视频中的每一帧图像进行直方图均衡化之前,方法还可以包括:将彩色的监控视频中的每一帧图像进行灰度化;对采集到的监控视频中的每一帧图像进行直方图均衡化包括:对进行灰度化后的每一帧图像进行直方图均衡化。Preferably, in the case that the collected monitoring video is colored, before the histogram equalization is performed on each frame of the collected monitoring video, the method may further include: displaying each frame image in the color monitoring video Performing grayscale; performing histogram equalization on each frame image of the collected monitoring video includes: performing histogram equalization on each frame image after graying.
灰度化,在RGB模型中,如果R=G=B时,则彩色表示一种灰度颜色,其中R=G=B的值为灰度值,因此,灰度图像每个像素只需一个字节存放灰度值,灰度范围为0-255。图像灰度化是由于级联分类器所要求的,将彩色的原图灰度化后作为输入。例如,可通过调用OpenCV的cvCvtColor函数实现。Grayscale, in the RGB model, if R=G=B, the color represents a grayscale color, where R=G=B is the gray value, so the grayscale image only needs one pixel per pixel. The byte stores the gray value, and the gray scale ranges from 0 to 255. Image graying is required by the cascade classifier to gray out the original image as an input. For example, it can be implemented by calling OpenCV's cvCvtColor function.
其中,OpenCV是一个基于BSD许可(开源)发行的跨平台计算机视觉库,它轻量级而且高效——由一系列C函数和少量C++类构成,同时提供了Python、Ruby、MATLAB等语言的接口,实现了图像处理和计算机视觉方面的很多通用算法。Among them, OpenCV is a cross-platform computer vision library based on BSD license (open source) distribution, which is lightweight and efficient - consists of a series of C functions and a small number of C++ classes, and provides interfaces for languages such as Python, Ruby, and MATLAB. Achieve many general algorithms in image processing and computer vision.
需要说明的是,由于拍摄的距离较远,图像中的人体相对较小,并且模糊,因此需要在检测之前对图像进行增强。直方图均衡化是为了增强图像对比度,改善图像的质量,有助于检测效果。例如,本发明实施例使用了改进的直方图均衡化方法:通过级联分类器,识别均衡化图像中的人体之前,利用拉普拉斯算子提取每一帧图像中的 高频成份,并对高频成份赋于权值,得到增强后的高频成份;将增强后的高频成份叠加于直方图均衡化后的图像中,得到增强后的均衡化图像。同时,通过级联分类器,识别均衡化图像中的人体包括:通过级联分类器,识别增强后的均衡化图像中的人体。It should be noted that since the distance of shooting is relatively long, the human body in the image is relatively small and blurred, so it is necessary to enhance the image before the detection. Histogram equalization is to enhance image contrast, improve image quality, and help detect effects. For example, an embodiment of the present invention uses an improved histogram equalization method: using a Laplacian operator to extract high frequency components in each frame image before identifying the human body in the equalized image by a cascade classifier, and The high frequency component is assigned a weight, and the enhanced high frequency component is obtained; the enhanced high frequency component is superimposed on the histogram equalized image to obtain an enhanced equalized image. Meanwhile, identifying the human body in the equalized image by the cascade classifier includes: identifying the human body in the enhanced equalized image through the cascade classifier.
直方图均衡化处理的“中心思想”是把原始图像的灰度直方图从比较集中的某个灰度区间变成在全部灰度范围内的均匀分布。直方图均衡化就是对图像进行非线性拉伸,重新分配图像像素值,使一定灰度范围内的像素数量大致相同。直方图均衡化就是把给定图像的直方图分布改变成“均匀”分布直方图分布。但是存在两个缺点:The "central idea" of the histogram equalization process is to change the gray histogram of the original image from a certain gray interval in the comparison set to a uniform distribution in the entire gray range. Histogram equalization is to nonlinearly stretch the image and redistribute the image pixel values so that the number of pixels in a certain gray range is approximately the same. Histogram equalization is the change of the histogram distribution of a given image to a "uniform" distribution histogram distribution. But there are two disadvantages:
1)变换后图像的灰度级减少,某些细节消失;1) The gray level of the transformed image is reduced, and some details disappear;
2)某些图像,如直方图有高峰,经处理后对比度不自然的过分增强。2) Some images, such as histograms, have peaks, and the contrast is unnaturally over-enhanced after processing.
考虑到图像中人体往往处于低灰度值部分的细节,为了改进直方图均衡化带来的缺点,增强细节部分的表现,将直方图均衡化引入边缘的信息改进该算法。拉普拉斯算法在能实现较快的边缘检测,并且对高频的边缘检测效果较好,拉普拉斯算子是一个二阶微分算子,在离散的情况下,表示如下:Considering the details of the human body tending to be in the low gray value part of the image, in order to improve the shortcomings caused by histogram equalization, enhance the performance of the detail part, and improve the algorithm by introducing histogram equalization into the edge information. The Laplace algorithm can achieve faster edge detection and better detection of high-frequency edges. The Laplacian is a second-order differential operator. In the case of discrete, it is expressed as follows:
Figure PCTCN2018079856-appb-000003
Figure PCTCN2018079856-appb-000003
其中among them
Figure PCTCN2018079856-appb-000004
Figure PCTCN2018079856-appb-000004
Figure PCTCN2018079856-appb-000005
Figure PCTCN2018079856-appb-000005
可以用多种方式将其表示为数字形式。对于一个3*3的区域,经验上被推荐最多的形式是:It can be represented in digital form in a variety of ways. For a 3*3 area, the most recommended form of experience is:
Figure PCTCN2018079856-appb-000006
Figure PCTCN2018079856-appb-000006
图2是根据本发明实施例的拉普拉斯算子3*3的区域的示意图,其中3*3的区域如所示。2 is a schematic diagram of a region of a Laplacian operator 3*3, wherein the region of 3*3 is as shown, in accordance with an embodiment of the present invention.
综上介绍,改进的直方图均衡化步骤如下:In summary, the improved histogram equalization steps are as follows:
(1)利用拉普拉斯算子提取原图像的高频成分并赋予相应权值λ(本实施例中选取λ=3,得到了增强后的图像高频成分λ|f(x,y)|;(1) Using the Laplacian operator to extract the high-frequency component of the original image and assign the corresponding weight λ (in this embodiment, λ=3 is selected, and the enhanced image high-frequency component λ|f(x, y) is obtained. |;
(2)使用传统的直方图均衡化获得另一幅图像;(2) Obtain another image using conventional histogram equalization;
(3)将(1)、(2)两步得到的图像进行相加,并将得到的结果超出255的像素值取255,得到最终增强后的图像。(3) Add the images obtained in two steps (1) and (2), and take the result of the pixel value exceeding 255 to obtain 255, to obtain the final enhanced image.
改进后的直方图均衡化方法极大地增强了图像细节部分,有助于之后的检测。The improved histogram equalization method greatly enhances the details of the image and facilitates subsequent detection.
需要说明的是,由于在图像中的人显得较小,于是本发明实施例可以通过轮廓的查找,将面积较大的轮廓区域填充,即可排除非目标区域,提高检测准确率。在通过级联分类器,识别均衡化图像中的人体之前,利用Canny算子对直方图均衡化后的均衡化图像进行边缘检测,获得直方图均衡化后的均衡化图像中所包括的轮廓;确定直方图均衡化后的均衡化图像中所包括的轮廓的面积;将面积大于预定阈值的非目标轮廓使用漫水进行填充,获得包括了非目标轮廓的均衡化图像,其中,非目标轮廓为识别人体的非目标区域。同时,通过级联分类器,识别均衡化后的图像中的人体包括:通过级联分类器,识别包括了非目标轮廓的均衡化图像中的人体。It should be noted that, because the person in the image appears to be small, the embodiment of the present invention can fill the contour area with a larger area by searching the contour, thereby eliminating the non-target area and improving the detection accuracy. Before the human body in the equalized image is identified by the cascade classifier, the Canny operator is used to perform edge detection on the histogram equalized equalized image to obtain a contour included in the histogram equalized equalized image; Determining an area of a contour included in the histogram equalized equalized image; filling a non-target contour having an area larger than a predetermined threshold with diffused water to obtain an equalized image including a non-target contour, wherein the non-target contour is Identify non-target areas of the human body. Meanwhile, identifying the human body in the equalized image by the cascade classifier includes: identifying the human body in the equalized image including the non-target contour by the cascade classifier.
可以采用如下具体流程实现:It can be implemented by the following specific processes:
(1)使用Canny算子进行边缘检测;(1) Edge detection using the Canny operator;
(2)调用OpenCV库的findContours(),在二值图像中找到轮廓;(2) call findContours() of the OpenCV library to find the outline in the binary image;
(3)调用OpenCV库的drawContours()将每一个轮廓绘制出来,并使用contourArea()计算每个轮廓面积,将面积大于阈值的轮廓使用漫水填充填充方法进行填充,即调用OpenCV中的cvFloodFill。(3) Call the OpenCV library's drawContours() to draw each outline, and use contourArea() to calculate each contour area. Fill the contour with the area larger than the threshold with the flood fill fill method, that is, call cvFloodFill in OpenCV.
至此,可将图中大部分非目标区域去掉。At this point, most of the non-target areas in the figure can be removed.
优选的,在通过级联分类器,识别均衡化图像中的人体之前,方法还包括:通过对多组数据进行训练得到级联分类器,其中,多组数据中的每组数据均包括:样本图像,和用于标识样本图像是否包括人体的人体识别结果。需要说明的是,在对训练级联分类器的样本图像进行选择时,可以选择一些具体代表性的场景的图像,这样可以使得训练出的级联分类器对图像中的人体进行识别时,得到的识别结果更为准确。Preferably, before the human body in the equalized image is identified by the cascade classifier, the method further includes: obtaining a cascade classifier by training the plurality of sets of data, wherein each of the plurality of sets of data includes: a sample An image, and a human body recognition result for identifying whether the sample image includes a human body. It should be noted that, when selecting a sample image of the training cascade classifier, an image of some specific representative scenes may be selected, so that the trained cascade classifier can identify the human body in the image. The recognition result is more accurate.
需要说明的是,级联分类器的获取过程其核心是通过迭代的方法从大量的Haar-like特征中找到很小很关键的一部分特征,并用该特征产生有效的分类器,利 用大量的分类能力一般的弱分类器通过一定的方法叠加起来构成一个分类能力很强的分类器,再将这些分类器进行级联得到最后的强分类器。It should be noted that the core of the acquisition process of the cascaded classifier is to find a small and critical part of the feature from a large number of Haar-like features by iterative method, and use this feature to generate an effective classifier, using a large number of classification capabilities. The general weak classifiers are superimposed by a certain method to form a classifier with strong classification ability, and then these classifiers are cascaded to obtain the final strong classifier.
其中,Haar-like矩形特征是用于物体检测的数字图像特征。图3是根据本发明实施例的Haar特征模板的示意图,如图3所示,这类矩形特征模板由两个或多个全等的黑白矩形相邻组合而成,而矩形特征值是白色矩形的灰度值的和减去黑色矩形的灰度值的和,矩形特征对一些简单的图形结构,如线段、边缘比较敏感。如果把这样的矩形放在一个非人脸区域,那么计算出的特征值应该和人脸特征值不一样,所以这些矩形就是为了把人脸特征量化,以区分人脸和非人脸。Among them, the Haar-like rectangle feature is a digital image feature for object detection. 3 is a schematic diagram of a Haar feature template according to an embodiment of the present invention. As shown in FIG. 3, such a rectangular feature template is composed of two or more congruent black and white rectangles adjacent to each other, and the rectangular feature value is a white rectangle. The sum of the gray value and the gray value of the black rectangle. The rectangle feature is sensitive to some simple graphic structures such as line segments and edges. If such a rectangle is placed in a non-face area, the calculated feature values should be different from the face feature values, so these rectangles are used to quantify face features to distinguish between faces and non-faces.
之所以选择基于特征的方法而没有选择基于像素的方法是因为,在给定的有限的数据样本情况下,基于特征的检测不但能够编码特定区域的状态,而且通过基于特征设计的系统远比基于像素的系统快。The feature-based approach was chosen without the pixel-based approach because, in the case of a given finite data sample, feature-based detection can not only encode the state of a particular region, but also be based on a feature-based system. The pixel system is fast.
优选的,弱分类器在基于Haar-like矩形特征下的识别函数可以为:Preferably, the recognition function of the weak classifier under the Haar-like rectangular feature may be:
Figure PCTCN2018079856-appb-000007
Figure PCTCN2018079856-appb-000007
其中,g haar(x)用于标识基于人体特征x确定均衡化图像是否包括人体的识别结果,f j(x)是特征值;θ j是弱分类器的阈值;j用于标识第j个弱分类器;α和β是分类结果的置信度,取值范围为[-1,+1],为负则不是人体,为正则是人体。 Where g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x, f j (x) is a feature value; θ j is a threshold value of the weak classifier; j is used to identify the jth Weak classifier; α and β are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
需要说明的是,对于Haar-like弱分类器,一个弱分类器对应一个Haar-like矩形特征。Haar-like特征的弱分类器形式如上描述所示。通过OpenCV训练好的分类器中含有一系列特征阈值。判断截取图像是否通过分类器,需要计算图像在所有Haar-like特征模板下的Haar-like特征值,并对比分类器中相应特征模板的阈值。It should be noted that for the Haar-like weak classifier, a weak classifier corresponds to a Haar-like rectangle feature. The weak classifier form of the Haar-like feature is as described above. The classifier trained by OpenCV contains a series of feature thresholds. To determine whether the intercepted image passes the classifier, it is necessary to calculate the Haar-like eigenvalues of the image under all Haar-like feature templates, and compare the thresholds of the corresponding feature templates in the classifier.
根据本发明实施例的另一方面,还提供了一种监控视频中人数检测的装置,图4是根据本发明实施例的监控视频中人数检测装置的结构框图,如图4所示,该监控视频中人数检测装置包括:第一得到模块44、识别模块46、统计模块48。下面对该监控视频中人数检测装置进行详细说明。According to another aspect of the present invention, there is also provided a device for monitoring the number of people in a video. FIG. 4 is a structural block diagram of a person detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 4, the monitoring is performed. The number of people in the video detecting device includes a first obtaining module 44, an identifying module 46, and a statistic module 48. The number of people detecting devices in the monitoring video will be described in detail below.
第一得到模块44,用于对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像;The first obtaining module 44 is configured to perform histogram equalization on each frame image of the collected monitoring video to obtain a histogram equalized equalized image;
识别模块46,与上述第一得到模块44相连,用于通过级联分类器,识别均衡化 图像中的人体,其中,级联分类器用于根据直方图均衡化后的均衡化图像的人体特征识别出人体;The identification module 46 is connected to the first obtaining module 44 for identifying the human body in the equalized image by using a cascade classifier, wherein the cascade classifier is used for human body feature recognition of the equalized image after the histogram equalization Out of the body;
统计模块48,与上述识别模块46相连,用于对识别出的人体进行统计。The statistics module 48 is connected to the identification module 46 for counting the identified human body.
图5是根据本发明实施例的监控视频中人数检测装置的优化结构框图一,如图5所示,该监控视频中人数检测装置除含图4中所有结构外,还包括:灰度化模块52。下面对该灰度化模块52进行详细说明。FIG. 5 is a block diagram of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 5, the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4, a graying module. 52. The gradation module 52 will be described in detail below.
灰度化模块52,与第一得到模块44相连,用于在采集的监控视频为彩色的情况下,在对采集到的监控视频中的每一帧图像进行直方图均衡化之前,将彩色的监控视频中的每一帧图像进行灰度化。The graying module 52 is connected to the first obtaining module 44, and is configured to: before the collected monitoring video is colored, in the case of performing histogram equalization on each frame of the collected monitoring video, the coloring is performed. Each frame of the video in the surveillance video is grayed out.
图6是根据本发明实施例的监控视频中人数检测装置的优化结构框图二,如图6所示,该监控视频中人数检测装置除含图4中所有结构外,还包括:第二得到模块62。下面对该第二得到模块62进行详细说明。FIG. 6 is a block diagram 2 of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 6 , the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4 , a second obtaining module. 62. The second obtaining module 62 will be described in detail below.
第二得到模块62,与上述第一得到模块44和识别模块46相连,用于在通过级联分类器,识别均衡化图像中的人体之前,利用拉普拉斯算子提取每一帧图像中的高频成份,并对高频成份赋于权值,得到增强后的高频成份;将增强后的高频成份叠加于直方图均衡化后的图像中,得到增强后的均衡化图像。The second obtaining module 62 is connected to the first obtaining module 44 and the identifying module 46 for extracting each frame image by using a Laplacian before identifying the human body in the equalized image by the cascade classifier. The high frequency component is assigned to the high frequency component to obtain the enhanced high frequency component; the enhanced high frequency component is superimposed on the histogram equalized image to obtain the enhanced equalized image.
图7是根据本发明实施例的监控视频中人数检测装置的优化结构框图三,如图7所示,该监控视频中人数检测装置除含图4中所有结构外,还包括:获得模块72。下面对该获得模块72进行详细说明。FIG. 7 is a block diagram 3 of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 7 , the number of detecting devices in the monitoring video includes: obtaining module 72 in addition to all the structures in FIG. 4 . The obtaining module 72 will be described in detail below.
获得模块72,与上述第一得到模块44和识别模块46相连,用于在通过级联分类器,识别均衡化图像中的人体之前,利用Canny算子对直方图均衡化后的均衡化图像进行边缘检测,获得直方图均衡化后的均衡化图像中所包括的轮廓;确定直方图均衡化后的均衡化图像中所包括的轮廓的面积;将面积大于预定阈值的非目标轮廓使用漫水进行填充,获得包括了非目标轮廓的均衡化图像,其中,非目标轮廓为识别人体的非目标区域。The obtaining module 72 is connected to the first obtaining module 44 and the identifying module 46, and is configured to perform the histogram equalized equalized image by using the Canny operator before identifying the human body in the equalized image by the cascade classifier. Edge detection, obtaining a contour included in the histogram equalized equalized image; determining an area of a contour included in the histogram equalized equalized image; and using a diffused water for the non-target contour having an area larger than a predetermined threshold Filling, obtaining an equalized image including a non-target contour, wherein the non-target contour is a non-target area that identifies the human body.
图8是根据本发明实施例的监控视频中人数检测装置的优化结构框图四,如图8所示,该监控视频中人数检测装置除含图4中所有结构外,还包括:第三得到模块82。下面对该第三得到模块82进行详细说明。FIG. 8 is a block diagram of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 8 , the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4 , a third obtaining module. 82. The third obtaining module 82 will be described in detail below.
第三得到模块82,与上述识别模块46相连,用于在通过级联分类器,识别均衡化图像中的人体之前,通过对多组数据进行训练得到级联分类器,其中,多组数据中 的每组数据均包括:样本图像,和用于标识样本图像是否包括人体的人体识别结果。The third obtaining module 82 is connected to the foregoing identifying module 46, and is configured to obtain a cascade classifier by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier, wherein the plurality of data sets Each set of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
图9是根据本发明实施例的监控视频中人数检测的优化方法的流程图,如图9所示,该方法包括如下步骤:FIG. 9 is a flowchart of an optimization method for monitoring a number of people in a video according to an embodiment of the present invention. As shown in FIG. 9, the method includes the following steps:
步骤S902,级联分类器训练;Step S902, cascading classifier training;
其中,利用OpenCV训练基于Haar-like特征的级联分类器,需要提供所要识别特征的相应正例样本图片和反例样本图片。正例样本为人的全身像的样本图片;反例样本为背景图片,要求不能有人,长宽比例是1∶2。利用OpenCV提供的相应的程序训练出级联分类器,进行提取特征和训练分类器,训练出的分类器模型就可以对这些事物进行识别。Among them, using OpenCV to train the cascading classifier based on Haar-like features, it is necessary to provide corresponding positive sample pictures and counter sample pictures of the features to be identified. The positive sample is a sample picture of a human body image; the negative sample is a background image, which requires no one, and the aspect ratio is 1:2. The cascaded classifier is trained by the corresponding program provided by OpenCV to extract features and train classifiers, and the trained classifier model can identify these things.
步骤S904,行人检测与人数统计。Step S904, pedestrian detection and population statistics.
其中,该步骤的具体流程为:The specific process of this step is:
(1)输入的待检测数据是实时的视频,对输入视频的每一帧图像进行灰度化处理;(1) The input data to be detected is a real-time video, and grayscale processing is performed on each frame of the input video;
(2)采取改进直方图均衡化方法对图像增强;(2) Adopting an improved histogram equalization method to enhance the image;
(3)使用漫水填充方法填充轮廓较大的区域;(3) filling the area with a large contour by using the flood filling method;
(4)然后对预处理的图像进行特征提取,调用OpenCV中CascadeClassifier类的相关方法,提取Haar-like特征;(4) Then extract the feature of the preprocessed image, call the related method of the CascadeClassifier class in OpenCV, and extract the Haar-like feature;
(5)载入训练好的Haar特征分类器,对输入的每一帧图像进行检测,对检测到的行人用黄色的矩形框标注;(5) Loading the trained Haar feature classifier, detecting each frame of the input image, and marking the detected pedestrian with a yellow rectangular frame;
(6)筛选分类器的检测结果,并进行人数计数,在原图标注出行人位置,最后显示人数的数量。(6) Screening the test results of the classifier, and counting the number of people, marking the pedestrian position in the original icon, and finally displaying the number of people.
其中,在目标筛选过程中,由于目标区域较小,所以分类器检测出来的结果可能看有较大的区域,于是通过遍历分类器的检测结果(矩形框),将目标矩形框高度大于阈值的剔除,进而提高检测准确率。In the target screening process, since the target area is small, the result detected by the classifier may have a larger area, so by traversing the detection result of the classifier (rectangular frame), the height of the target rectangular frame is greater than the threshold. Eliminate, thereby improving detection accuracy.
相对于相关技术中,采用实时采集视频流,基于得到的视频流通过线采样获得多幅原始采样图样本,以及速度采样图样本;对于得到的速度采样图样本进行时空矫正;基于原始采样图和速度采样图,离线训练得到深度学习模型,深度学习模型包括分类模型和统计模型;利用得到的深度学习模型对于实时视频流进行人群状态分析。本发明实施例对于不同环境、光照强度、天气情况以及摄像头角度均具有良好的适应性;对于大流量人群涌出等人群拥挤环境,可以保证较高的准确率;计算量小,可以满足 实时视频处理的要求,能够广泛地应用于对于公交、地铁和广场等滞留人群密集的公共场所的监控和管理。Compared with the related art, a real-time video stream is collected, and a plurality of original sample map samples and a velocity sample map sample are obtained by line sampling based on the obtained video stream; space-time correction is performed on the obtained velocity sample map sample; based on the original sample map and The velocity sampling map, the offline training obtains the deep learning model, the deep learning model includes the classification model and the statistical model; and the obtained deep learning model is used for the population state analysis of the real-time video stream. The embodiments of the invention have good adaptability to different environments, light intensity, weather conditions and camera angles; for a crowded environment such as a large flow of people, a high accuracy rate can be ensured; the calculation amount is small, and the real-time video can be satisfied. The requirements for processing can be widely applied to the monitoring and management of public places densely populated by buses, subways and plazas.
在相关技术中,针对人群的密度估计以及运动分析做了一定的研究,在人群密度估计方面,对于低密度人群,采用像素统计的方法估计人群密度,对于高密度的人群,利用小波包分解对人群图像进行多尺度分析,最后利用支持向量机SVM(Support Vector Machine)对人群密度等级进行分类;在人群的运动分析上,使用基于以平均绝对误差为匹配准则的全搜索算法的块匹配方法对人群运动速度进行估计。In the related art, some research has been done on the population density estimation and motion analysis. In the population density estimation, for the low-density population, the pixel density method is used to estimate the population density. For the high-density population, the wavelet packet decomposition is used. The crowd image was analyzed by multi-scale. Finally, the population density level was classified by SVM (Support Vector Machine). In the motion analysis of the crowd, the block matching method based on the full search algorithm with the average absolute error as the matching criterion was used. The speed of the crowd is estimated.
在另一些相关技术中,通过分析人群图像的频谱图,发现不同人群密度的图像所对应的频谱图像具有明显的不同,并依此将人群的频谱图视为纹理图像,对人群的频谱图采用纹理分析的方法提取纹理特征进行分析,最后利用Adaboost分类器实现人群密度级别的分类。In other related technologies, by analyzing the spectrogram of the crowd image, it is found that the spectrum images corresponding to the images of different population densities are significantly different, and the spectrum map of the crowd is regarded as the texture image, and the spectrum map of the population is adopted. The texture analysis method extracts the texture features for analysis, and finally uses the Adaboost classifier to classify the population density level.
当前在人群密度估计及运动分析方面的研究重点,是解决当行人数量比较大时,人群中存在大量的遮挡,难以准确地对人群中的单个行人进行准确的检测、分割和跟踪的问题,如何在不进行单个目标的检测、跟踪的前提下,利用前景图像整体的特征,通过有效的统计学习方法建立合理的判决规则,直接估计行人的数量,且判定目标的运动状态,并检测异常事件的发生。The current research focus on population density estimation and motion analysis is to solve the problem that when there are a large number of pedestrians, there is a large amount of occlusion in the crowd, and it is difficult to accurately detect, segment and track individual pedestrians in the crowd accurately. Under the premise of not detecting and tracking a single target, using the characteristics of the foreground image as a whole, an effective statistical learning method is used to establish a reasonable decision rule, directly estimating the number of pedestrians, and determining the motion state of the target, and detecting the abnormal event. occur.
但在上述相关技术中的解决方案只适用于街头、室内等近距离的监控设备的条件,这样的环境下检测的目标显示分辨率高、区域较大,识别难度相对较低,但是在远距离检测的条件下并不适用,这样的条件所拍摄的场景面积大,目标人物相对就显得非常小、并且十分模糊,检测难度更大,以上的方法就不适用了。However, the solution in the above related art is only applicable to the conditions of a close-range monitoring device such as a street or an indoor. In such an environment, the target display resolution is high, the area is large, and the recognition difficulty is relatively low, but at a long distance. Under the conditions of detection, it is not applicable. The scenes shot under such conditions are large, the target characters are very small and very fuzzy, and the detection is more difficult. The above methods are not applicable.
而通过上述实施例及优选实施方式,采用采集拍摄的监控视频,对采集的监控视频中的每一帧图像直方图均衡化,得到直方图均衡化后的图像,并通过级联分类器识别直方图均衡化后的图像中的人体,再对从监控视频中的每一帧图像中识别出的人体进行统计的方式,通过设计出滤波和增强的算法,及通过测试几种常用的分类器算法,得出检测效果最好的Haar分类器,并不断优化,达到在远距离监控中检测人像的目的,实现了远距离监控中较为准确地检测和统计人数的技术效果,进而解决了由于现有技术无法在远距离监控下实现人群异常状态的检测造成的公共场所安全无法保障的技术问题。According to the foregoing embodiment and the preferred embodiment, the monitoring video of the captured video is used to equalize the histogram of each frame in the collected monitoring video to obtain a histogram equalized image, and the histogram is identified by the cascade classifier. The human body in the image after equalization is used to statistically analyze the human body identified from each frame of the video in the surveillance video, by designing filtering and enhancement algorithms, and by testing several commonly used classifier algorithms. The Haar classifier with the best detection effect is obtained and continuously optimized to achieve the purpose of detecting portraits in remote monitoring, and the technical effect of detecting and counting the number of people more accurately in remote monitoring is realized, thereby solving the existing The technology cannot solve the technical problem that the safety of public places cannot be guaranteed due to the detection of abnormal state of the people under remote monitoring.
此发明实施例解决的问题是云端号在运行过程中,云端号的监控摄像头会重点监控地面上一些重要场所,通过人数的统计,来判定场所的安全状况,起到人流密集的警示作用,并充分保证预警的实效性和准确性。The problem solved by the embodiment of the present invention is that during the running of the cloud number, the monitoring camera of the cloud number will focus on monitoring some important places on the ground, and the number of people is counted to determine the security status of the place, and the warning effect of the crowd is intensive. Fully guarantee the effectiveness and accuracy of the warning.
本发明实施例可应用于统计人群数量的系统,能在远距离监控的环境下(比如云端号、无人机等设备对地监控),实现行人的检测,统计行人的数量,当人数较多时可作出某些的预警。The embodiment of the present invention can be applied to a system for counting the number of people, and can be used in a remote monitoring environment (such as cloud number, drone, etc.) to implement pedestrian detection, and count the number of pedestrians. Some warnings can be made.
图10是根据本发明实施例的基于级联分类器的人数检测流程示意图,如图所示,实现图像的人数检测,需要使用检测人体特征的特征分类器,通过分类器可将识别图像中含有人体特征的部分检测出来,当检测到一个含有人的特征时,人数计数器自动加1。本发明实施例是使用级联分类器进行人体检测的,相应的程序是基于OpenCV库中关于预处理和级联分类器的接口函数设计的。10 is a schematic diagram of a process for detecting a number of people based on a cascaded classifier according to an embodiment of the present invention. As shown in the figure, to implement the detection of the number of people of an image, it is necessary to use a feature classifier for detecting human body features, and the classifier can include the identification image. The part of the human body is detected, and when a feature containing a person is detected, the number counter is automatically incremented by one. In the embodiment of the present invention, the cascading classifier is used for human body detection, and the corresponding program is designed based on the interface function of the pre-processing and cascading classifier in the OpenCV library.
根据本发明实施例的另一方面,还提供了一种机器人,该机器人包含上述任意一项的监控视频中人数检测装置。According to another aspect of an embodiment of the present invention, there is also provided a robot comprising the number of persons detecting means in the monitoring video of any of the above.
根据本发明实施例的另一方面,还提供了一种存储介质,该存储介质包括存储的程序,其中,在程序运行时控制存储介质所在设备执行上述任意一项的监控视频中人数检测方法。According to another aspect of the embodiments of the present invention, there is further provided a storage medium, comprising: a stored program, wherein the device in which the storage medium is located controls the number of people in the monitoring video in any one of the above-mentioned items when the program is running.
根据本发明实施例的另一方面,还提供了一种处理器,该处理器用于运行程序,其中,程序运行时执行上述任意一项的监控视频中人数检测方法。According to another aspect of an embodiment of the present invention, there is further provided a processor for running a program, wherein the program is executed while performing the number of people in the monitoring video of any of the above.
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
在本发明的上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments of the present invention, the descriptions of the various embodiments are different, and the parts that are not detailed in a certain embodiment can be referred to the related descriptions of other embodiments.
在本中请所提供的几个实施例中,应该理解到,所揭露的技术内容,可通过其它的方式实现。其中,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,可以为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元或模块的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided herein, it should be understood that the disclosed technical contents may be implemented in other manners. The device embodiments described above are only schematic. For example, the division of the unit may be a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, unit or module, and may be electrical or otherwise.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成 的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention. The foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above description is only a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should be considered as the scope of protection of the present invention.

Claims (17)

  1. 一种监控视频中人数检测方法,其特征在于,包括:A method for detecting a number of people in a surveillance video, characterized in that it comprises:
    对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像;Histogram equalization is performed on each frame image of the collected monitoring video to obtain a histogram equalized equalized image;
    通过级联分类器,识别所述均衡化图像中的人体,其中,所述级联分类器用于根据直方图均衡化后的所述均衡化图像的人体特征识别出人体;Recognizing a human body in the equalized image by a cascade classifier, wherein the cascade classifier is configured to identify a human body according to a human body feature of the equalized image after the histogram equalization;
    对识别出的人体进行统计。Statistics are made on the identified human body.
  2. 根据权利要求1所述的方法,其特征在于,所述级联分类器至少由两个弱分类器叠加而成,其中,所述级联分类器通过叠加的所述至少两个弱分类器根据所述直方图均衡化后的均衡化图像的人体特征识别出人体。The method according to claim 1, wherein the cascade classifier is formed by at least two weak classifiers, wherein the cascade classifier is based on the at least two weak classifiers superimposed The human body features of the histogram equalized image of the equalized image identify the human body.
  3. 根据权利要求1所述的方法,其特征在于,The method of claim 1 wherein
    在采集的所述监控视频为彩色的情况下,在对采集到的所述监控视频中的每一帧图像进行直方图均衡化之前,所述方法还包括:将彩色的所述监控视频中的每一帧图像进行灰度化;In the case that the collected monitoring video is colored, before the histogram equalization is performed on each of the collected monitoring videos, the method further includes: in the color of the monitoring video Each frame of image is grayscaled;
    对采集到的所述监控视频中的每一帧图像进行直方图均衡化包括:对进行灰度化后的每一帧图像进行直方图均衡化。Performing histogram equalization on each of the acquired images of the monitored video includes performing histogram equalization on each frame of the grayscaled image.
  4. 根据权利要求1所述的方法,其特征在于,The method of claim 1 wherein
    在通过所述级联分类器,识别所述均衡化图像中的人体之前,所述方法还包括:利用拉普拉斯算子提取所述每一帧图像中的高频成份,并对所述高频成份赋于权值,得到增强后的高频成份;将增强后的高频成份叠加于直方图均衡化后的图像中,得到增强后的均衡化图像;Before identifying the human body in the equalized image by the cascade classifier, the method further includes: extracting high frequency components in each frame image by using a Laplacian, and The high frequency component is assigned to the weight, and the enhanced high frequency component is obtained; the enhanced high frequency component is superimposed on the histogram equalized image to obtain the enhanced equalized image;
    通过所述级联分类器,识别所述均衡化图像中的人体包括:通过所述级联分类器,识别所述增强后的均衡化图像中的人体。Identifying the human body in the equalized image by the cascade classifier includes: identifying, by the cascade classifier, a human body in the enhanced equalized image.
  5. 根据权利要求1所述的方法,其特征在于,The method of claim 1 wherein
    在通过所述级联分类器,识别所述均衡化图像中的人体之前,所述方法还包括:利用Canny算子对所述直方图均衡化后的均衡化图像进行边缘检测,获得所述直方图均衡化后的均衡化图像中所包括的轮廓;确定所述直方图均衡化后的均衡化图像中所包括的轮廓的面积;将面积大于预定阈值的非目标轮廓使用漫水进 行填充,获得包括了所述非目标轮廓的均衡化图像,其中,所述非目标轮廓为识别人体的非目标区域;Before the human body in the equalized image is identified by the cascade classifier, the method further includes: performing edge detection on the histogram equalized equalized image by using a Canny operator to obtain the histogram a contour included in the equalized image after equalization; determining an area of a contour included in the histogram equalized equalized image; and filling a non-target contour having an area larger than a predetermined threshold with a flood water to obtain An equalized image of the non-target contour is included, wherein the non-target contour is a non-target area that identifies a human body;
    通过所述级联分类器,识别所述均衡化后的图像中的人体包括:通过所述级联分类器,识别包括了所述非目标轮廓的均衡化图像中的人体。Identifying the human body in the equalized image by the cascade classifier includes: identifying, by the cascade classifier, a human body in the equalized image including the non-target contour.
  6. 根据权利要求1所述的方法,其特征在于,在通过所述级联分类器,识别所述均衡化图像中的人体之前,所述方法还包括:The method according to claim 1, wherein before the identifying the human body in the equalized image by the cascade classifier, the method further comprises:
    通过对多组数据进行训练得到所述级联分类器,其中,所述多组数据中的每组数据均包括:样本图像,和用于标识所述样本图像是否包括人体的人体识别结果。The cascade classifier is obtained by training a plurality of sets of data, wherein each of the plurality of sets of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  7. 根据权利要求2至6中任一项所述的方法,其特征在于,所述弱分类器在基于Haar-like矩形特征下的识别函数为:The method according to any one of claims 2 to 6, wherein the identification function of the weak classifier under the Haar-like rectangular feature is:
    Figure PCTCN2018079856-appb-100001
    Figure PCTCN2018079856-appb-100001
    其中,g haar(x)用于标识基于人体特征x确定均衡化图像是否包括人体的识别结果,f j(x)是特征值;θ j是所述弱分类器的阈值;j用于标识第j个弱分类器;α和β是分类结果的置信度,取值范围为[-1,+1],为负则不是人体,为正则是人体。 Where g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x, f j (x) is a feature value; θ j is a threshold value of the weak classifier; j is used to identify j weak classifiers; α and β are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  8. 一种监控视频中人数检测装置,其特征在于,包括:A monitoring device for monitoring a number of people in a video, comprising:
    第一得到模块,用于对采集到的监控视频中的每一帧图像进行直方图均衡化,得到直方图均衡化后的均衡化图像;a first obtaining module, configured to perform histogram equalization on each frame image of the collected monitoring video to obtain a histogram equalized equalized image;
    识别模块,用于通过级联分类器,识别所述均衡化图像中的人体,其中,所述级联分类器用于根据直方图均衡化后的所述均衡化图像的人体特征识别出人体;An identification module, configured to identify a human body in the equalized image by using a cascade classifier, wherein the cascade classifier is configured to identify a human body according to a human body feature of the equalized image after the histogram equalization;
    统计模块,用于对识别出的人体进行统计。A statistical module for counting the identified human body.
  9. 根据权利要求8所述的装置,其特征在于,所述级联分类器至少由两个弱分类器叠加而成,其中,所述级联分类器通过叠加的所述至少两个弱分类器根据所述直方图均衡化后的均衡化图像的人体特征识别出人体。The apparatus according to claim 8, wherein said cascade classifier is formed by at least two weak classifiers, wherein said cascade classifier is based on said at least two weak classifiers superimposed The human body features of the histogram equalized image of the equalized image identify the human body.
  10. 根据权利要求8所述的装置,其特征在于,The device of claim 8 wherein:
    所述装置还包括:灰度化模块,用于在采集的所述监控视频为彩色的情况下,在对采集到的所述监控视频中的每一帧图像进行直方图均衡化之前,将彩色的所述监控视频中的每一帧图像进行灰度化;The device further includes: a graying module, configured to: before the collected image of the monitored video is histogram equalized, in a case where the collected monitoring video is colored Each frame of the monitoring video is grayscaled;
    得到模块,用于对进行灰度化后的每一帧图像进行直方图均衡化。A module is obtained for performing histogram equalization on each frame of image after gradation.
  11. 根据权利要求8所述的装置,其特征在于,The device of claim 8 wherein:
    所述装置还包括:第二得到模块,用于在通过所述级联分类器,识别所述均衡化图像中的人体之前,利用拉普拉斯算子提取所述每一帧图像中的高频成份,并对所述高频成份赋于权值,得到增强后的高频成份;将增强后的高频成份叠加于直方图均衡化后的图像中,得到增强后的均衡化图像;The apparatus further includes: a second obtaining module, configured to extract, in the image of each frame by using a Laplacian, before identifying the human body in the equalized image by the cascade classifier a frequency component, and assigning a weight to the high frequency component to obtain an enhanced high frequency component; and superimposing the enhanced high frequency component on the histogram equalized image to obtain an enhanced equalized image;
    识别模块,用于通过所述级联分类器,识别所述增强后的均衡化图像中的人体。An identification module, configured to identify a human body in the enhanced equalized image by the cascade classifier.
  12. 根据权利要求8所述的装置,其特征在于,The device of claim 8 wherein:
    所述装置还包括:获得模块,用于在通过所述级联分类器,识别所述均衡化图像中的人体之前,利用Canny算子对所述直方图均衡化后的均衡化图像进行边缘检测,获得所述直方图均衡化后的均衡化图像中所包括的轮廓;确定所述直方图均衡化后的均衡化图像中所包括的轮廓的面积;将面积大于预定阈值的非目标轮廓使用漫水进行填充,获得包括了所述非目标轮廓的均衡化图像,其中,所述非目标轮廓为识别人体的非目标区域;The apparatus further includes: an obtaining module, configured to perform edge detection on the histogram equalized equalized image by using a Canny operator before identifying the human body in the equalized image by the cascade classifier Obtaining a contour included in the histogram equalized equalized image; determining an area of a contour included in the histogram equalized equalized image; and using a non-target contour having an area larger than a predetermined threshold Filling the water to obtain an equalized image including the non-target contour, wherein the non-target contour is a non-target area identifying the human body;
    识别模块,用于通过所述级联分类器,识别包括了所述非目标轮廓的均衡化图像中的人体。And an identification module, configured to identify a human body in the equalized image including the non-target contour by the cascade classifier.
  13. 根据权利要求8所述的装置,其特征在于,所述装置还包括:The device according to claim 8, wherein the device further comprises:
    第三得到模块,用于在通过所述级联分类器,识别所述均衡化图像中的人体之前,通过对多组数据进行训练得到所述级联分类器,其中,所述多组数据中的每组数据均包括:样本图像,和用于标识所述样本图像是否包括人体的人体识别结果。a third obtaining module, configured to obtain the cascade classifier by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier, wherein the plurality of sets of data are Each set of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  14. 根据权利要求9至13中任一项所述的装置,其特征在于,所述弱分类器在基于Haar-like矩形特征下的识别函数为:The apparatus according to any one of claims 9 to 13, wherein the identification function of the weak classifier under the Haar-like rectangular feature is:
    Figure PCTCN2018079856-appb-100002
    Figure PCTCN2018079856-appb-100002
    其中,g haar(x)用于标识基于人体特征x确定均衡化图像是否包括人体的识别结果,f j(x)是特征值;θ j是所述弱分类器的阈值;j用于标识第j个弱分类器;α和β是分类结果的置信度,取值范围为[-1,+1],为负则不是人体,为正则是人体。 Where g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x, f j (x) is a feature value; θ j is a threshold value of the weak classifier; j is used to identify j weak classifiers; α and β are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  15. 一种机器人,其特征在于,所述机器人包含权利要求8至14中任意一项所述的监控视频中人数检测装置。A robot, characterized in that the robot comprises the number of persons detecting means in the monitoring video according to any one of claims 8 to 14.
  16. 一种存储介质,其特征在于,所述存储介质包括存储的程序,其中,在所述程序运行时控制所述存储介质所在设备执行权利要求1至7中任意一项所述的监控视频中人数检测方法。A storage medium, comprising: a stored program, wherein, when the program is running, controlling a device in which the storage medium is located performs the number of monitoring videos according to any one of claims 1 to 7. Detection method.
  17. 一种处理器,其特征在于,所述处理器用于运行程序,其中,所述程序运行时执行权利要求1至7中任意一项所述的监控视频中人数检测方法。A processor, wherein the processor is configured to execute a program, wherein the program is executed to execute the method for detecting a number of people in the monitoring video according to any one of claims 1 to 7.
PCT/CN2018/079856 2017-12-12 2018-03-21 Head count detection method and device in surveillance video WO2019114145A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711327492.5 2017-12-12
CN201711327492.5A CN109918971B (en) 2017-12-12 2017-12-12 Method and device for detecting number of people in monitoring video

Publications (1)

Publication Number Publication Date
WO2019114145A1 true WO2019114145A1 (en) 2019-06-20

Family

ID=66818939

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/079856 WO2019114145A1 (en) 2017-12-12 2018-03-21 Head count detection method and device in surveillance video

Country Status (2)

Country Link
CN (1) CN109918971B (en)
WO (1) WO2019114145A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717885A (en) * 2019-09-02 2020-01-21 平安科技(深圳)有限公司 Customer number counting method and device, electronic equipment and readable storage medium
CN110807424A (en) * 2019-11-01 2020-02-18 深圳市科卫泰实业发展有限公司 Port ship comparison method based on aerial images
CN112327893A (en) * 2019-08-05 2021-02-05 旭日蓝天(武汉)科技有限公司 AI (Artificial intelligence) technology-based unmanned aerial vehicle flight control front-end people counting system and method
CN114399728A (en) * 2021-12-17 2022-04-26 燕山大学 Method for counting crowds in foggy day scene

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110852814A (en) * 2020-01-14 2020-02-28 深圳惠通天下信息技术有限公司 Advertisement delivery self-service system and method
CN111474914B (en) * 2020-02-24 2020-12-01 安徽省华腾农业科技有限公司经开区分公司 Fault code big data storage system
CN111931567A (en) * 2020-07-01 2020-11-13 珠海大横琴科技发展有限公司 Human body recognition method and device, electronic equipment and storage medium
CN114362988B (en) * 2021-09-29 2023-06-20 中国科学院计算机网络信息中心 Network traffic identification method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130071033A1 (en) * 2011-09-21 2013-03-21 Tandent Vision Science, Inc. Classifier for use in generating a diffuse image
CN104298969A (en) * 2014-09-25 2015-01-21 电子科技大学 Crowd scale statistical method based on color and HAAR feature fusion
CN104463191A (en) * 2014-10-30 2015-03-25 华南理工大学 Robot visual processing method based on attention mechanism
KR20160037424A (en) * 2014-09-29 2016-04-06 동명대학교산학협력단 A Novel Multi-view Face Detection Method Based on Improved Real Adaboost Algorithm
CN105893946A (en) * 2016-03-29 2016-08-24 中国科学院上海高等研究院 Front face image detection method

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101340510B (en) * 2008-08-07 2010-06-23 中兴通讯股份有限公司 Method for video enhancement and apparatus thereof
CN106326839A (en) * 2016-08-11 2017-01-11 中防通用河北电信技术有限公司 People counting method based on drill video stream
CN106446872A (en) * 2016-11-07 2017-02-22 湖南源信光电科技有限公司 Detection and recognition method of human face in video under low-light conditions
CN106778683A (en) * 2017-01-12 2017-05-31 西安电子科技大学 Based on the quick Multi-angle face detection method for improving LBP features

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130071033A1 (en) * 2011-09-21 2013-03-21 Tandent Vision Science, Inc. Classifier for use in generating a diffuse image
CN104298969A (en) * 2014-09-25 2015-01-21 电子科技大学 Crowd scale statistical method based on color and HAAR feature fusion
KR20160037424A (en) * 2014-09-29 2016-04-06 동명대학교산학협력단 A Novel Multi-view Face Detection Method Based on Improved Real Adaboost Algorithm
CN104463191A (en) * 2014-10-30 2015-03-25 华南理工大学 Robot visual processing method based on attention mechanism
CN105893946A (en) * 2016-03-29 2016-08-24 中国科学院上海高等研究院 Front face image detection method

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112327893A (en) * 2019-08-05 2021-02-05 旭日蓝天(武汉)科技有限公司 AI (Artificial intelligence) technology-based unmanned aerial vehicle flight control front-end people counting system and method
CN110717885A (en) * 2019-09-02 2020-01-21 平安科技(深圳)有限公司 Customer number counting method and device, electronic equipment and readable storage medium
CN110807424A (en) * 2019-11-01 2020-02-18 深圳市科卫泰实业发展有限公司 Port ship comparison method based on aerial images
CN110807424B (en) * 2019-11-01 2024-02-02 深圳市科卫泰实业发展有限公司 Port ship comparison method based on aerial image
CN114399728A (en) * 2021-12-17 2022-04-26 燕山大学 Method for counting crowds in foggy day scene
CN114399728B (en) * 2021-12-17 2023-12-05 燕山大学 Foggy scene crowd counting method

Also Published As

Publication number Publication date
CN109918971B (en) 2024-01-05
CN109918971A (en) 2019-06-21

Similar Documents

Publication Publication Date Title
WO2019114145A1 (en) Head count detection method and device in surveillance video
CN107622258B (en) Rapid pedestrian detection method combining static underlying characteristics and motion information
Li et al. 3D parallel fully convolutional networks for real-time video wildfire smoke detection
Subburaman et al. Counting people in the crowd using a generic head detector
US9158985B2 (en) Method and apparatus for processing image of scene of interest
CN103824070B (en) A kind of rapid pedestrian detection method based on computer vision
CN106056079B (en) A kind of occlusion detection method of image capture device and human face five-sense-organ
Bayona et al. Comparative evaluation of stationary foreground object detection algorithms based on background subtraction techniques
CN109359577B (en) System for detecting number of people under complex background based on machine learning
CN109190456B (en) Multi-feature fusion overlook pedestrian detection method based on aggregated channel features and gray level co-occurrence matrix
CN111383244A (en) Target detection tracking method
CN106909883A (en) A kind of modularization hand region detection method and device based on ROS
Su et al. A new local-main-gradient-orientation HOG and contour differences based algorithm for object classification
CN104899559B (en) A kind of rapid pedestrian detection method based on video monitoring
Patel et al. Top-down and bottom-up cues based moving object detection for varied background video sequences
Ryan et al. Scene invariant crowd counting
KR20200060868A (en) multi-view monitoring system using object-oriented auto-tracking function
Ryan et al. Scene invariant crowd counting and crowd occupancy analysis
CN108875572B (en) Pedestrian re-identification method based on background suppression
CN107341456B (en) Weather sunny and cloudy classification method based on single outdoor color image
CN114694090A (en) Campus abnormal behavior detection method based on improved PBAS algorithm and YOLOv5
Chung et al. Face detection and posture recognition in a real time tracking system
Yu et al. A crowd flow estimation method based on dynamic texture and GRNN
De Siqueira et al. Adaptive detection of human skin in color images
Fradi et al. Crowd context-dependent privacy protection filters

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18887712

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18887712

Country of ref document: EP

Kind code of ref document: A1