WO2019114145A1 - Procédé et dispositif de détection du nombre de personnes présentes dans une vidéo de surveillance - Google Patents

Procédé et dispositif de détection du nombre de personnes présentes dans une vidéo de surveillance Download PDF

Info

Publication number
WO2019114145A1
WO2019114145A1 PCT/CN2018/079856 CN2018079856W WO2019114145A1 WO 2019114145 A1 WO2019114145 A1 WO 2019114145A1 CN 2018079856 W CN2018079856 W CN 2018079856W WO 2019114145 A1 WO2019114145 A1 WO 2019114145A1
Authority
WO
WIPO (PCT)
Prior art keywords
human body
image
equalized
equalized image
cascade classifier
Prior art date
Application number
PCT/CN2018/079856
Other languages
English (en)
Chinese (zh)
Inventor
刘若鹏
钟凯宇
Original Assignee
深圳光启合众科技有限公司
深圳光启创新技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳光启合众科技有限公司, 深圳光启创新技术有限公司 filed Critical 深圳光启合众科技有限公司
Publication of WO2019114145A1 publication Critical patent/WO2019114145A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast

Definitions

  • the present invention relates to the field of video detection, and in particular to a method and apparatus for detecting a number of people in a surveillance video.
  • the video-based abnormal state detection of the crowd refers to the intelligent analysis of the behavioral state of the mass incidents in the public places of large-scale people, and judges whether there are detection methods such as crowd trampling, fighting, riots and other abnormal events.
  • detection methods such as crowd trampling, fighting, riots and other abnormal events.
  • the research on intelligent monitoring systems at home and abroad is at the initial stage of development, and there are few products that can be truly applied in real life.
  • the scientific research results obtained at this stage are only applicable to the conditions of close-range monitoring equipment such as street and indoor.
  • the target display resolution is high, the area is large, and the recognition difficulty is relatively low, but at a long distance.
  • the scenes shot under such conditions are large, and the target characters are very small and very fuzzy, and the detection is more difficult, so the above scientific research results are not applicable.
  • the embodiments of the present invention provide a method and a device for detecting a number of people in a surveillance video, so as to at least solve the technical problem that the security of a public place cannot be guaranteed due to the failure of the prior art to detect the abnormal state of the crowd under remote monitoring.
  • a method for monitoring a number of people in a video including: performing histogram equalization on each frame image of the collected monitoring video to obtain equalization after histogram equalization. Identifying a human body in the equalized image by a cascade classifier, wherein the cascade classifier is configured to identify a human body according to a human body feature of the equalized image after the histogram equalization; The human body performs statistics.
  • the cascade classifier is formed by at least two weak classifiers, wherein the cascade classifier is equalized according to the histogram equalization by the at least two weak classifiers that are superimposed
  • the human body features of the image identify the human body.
  • the method further includes: Performing grayscale on each frame image in the monitoring video; performing histogram equalization on each frame image of the collected monitoring video includes: performing histogram equalization on each frame image after graying .
  • the method further includes: extracting high frequency components in each frame image by using a Laplacian, And applying the weight to the high frequency component to obtain the enhanced high frequency component; superimposing the enhanced high frequency component on the histogram equalized image to obtain an enhanced equalized image;
  • the classifier, identifying the human body in the equalized image includes: identifying, by the cascade classifier, a human body in the enhanced equalized image.
  • the method further includes: performing edge detection on the histogram equalized equalized image by using a Canny operator, Obtaining a contour included in the histogram equalized equalized image; determining an area of a contour included in the histogram equalized equalized image; and using a non-target contour having an area larger than a predetermined threshold Performing a filling to obtain an equalized image including the non-target contour, wherein the non-target contour is a non-target region identifying a human body; and the human body in the equalized image is identified by the cascade classifier
  • the method includes: identifying, by the cascade classifier, a human body in the equalized image including the non-target contour.
  • the method further includes: obtaining the cascading classifier by training a plurality of sets of data, where the multiple Each set of data in the group data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • the identification function of the weak classifier under the Haar-like rectangular feature is:
  • g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x
  • f j (x) is a feature value
  • ⁇ j is a threshold value of the weak classifier
  • j is used to identify j weak classifiers
  • ⁇ and ⁇ are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  • an apparatus for monitoring the number of people in a video including: a first obtaining module, configured to perform histogram equalization on each frame image in the collected monitoring video, Obtaining a histogram equalized equalized image; an identifying module, configured to identify a human body in the equalized image by using a cascade classifier, wherein the cascaded classifier is configured to perform equalization according to a histogram
  • the human body features of the equalized image identify the human body; the statistical module is used to perform statistics on the identified human body.
  • the cascade classifier is formed by at least two weak classifiers, wherein the cascade classifier is equalized according to the histogram equalization by the at least two weak classifiers that are superimposed
  • the human body features of the image identify the human body.
  • the device further includes: a grayscale module, configured to perform histogram equalization on each frame of the collected monitoring video in a case where the collected monitoring video is colored Previously, each frame of the color of the surveillance video is grayscaled; and a module is obtained for performing histogram equalization on each frame of the grayscale image.
  • a grayscale module configured to perform histogram equalization on each frame of the collected monitoring video in a case where the collected monitoring video is colored Previously, each frame of the color of the surveillance video is grayscaled; and a module is obtained for performing histogram equalization on each frame of the grayscale image.
  • the device further includes: a second obtaining module, configured to extract each frame by using a Laplacian before identifying the human body in the equalized image by using the cascade classifier a high-frequency component in the image, and assigning a weight to the high-frequency component to obtain an enhanced high-frequency component; and superimposing the enhanced high-frequency component on the histogram-equalized image to obtain an enhanced equalization
  • an identification module configured to identify a human body in the enhanced equalized image by the cascade classifier.
  • the device further includes: an obtaining module, configured to equalize the histogram equalization by using a Canny operator before identifying the human body in the equalized image by using the cascade classifier Performing edge detection on the image to obtain an outline included in the histogram equalized equalized image; determining an area of the contour included in the histogram equalized equalized image; and determining an area larger than a predetermined threshold
  • the target contour is filled with the water, and the equalized image including the non-target contour is obtained, wherein the non-target contour is a non-target area for identifying the human body; and the identification module is configured to identify by the cascade classifier A human body in the equalized image of the non-target contour is included.
  • the device further includes: a third obtaining module, configured to obtain the cascading classification by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier And each of the plurality of sets of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • a third obtaining module configured to obtain the cascading classification by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier
  • each of the plurality of sets of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • the identification function of the weak classifier under the Haar-like rectangular feature is:
  • g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x
  • f j (x) is a feature value
  • ⁇ j is a threshold value of the weak classifier
  • j is used to identify j weak classifiers
  • ⁇ and ⁇ are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  • a robot comprising the number of persons detecting means in the monitoring video according to any one of the above.
  • a storage medium including a stored program, wherein, when the program is running, controlling a device in which the storage medium is located performs any of the above Monitor the number of people in the video.
  • a processor configured to execute a program, wherein the program is executed to perform a method for detecting a number of people in a monitoring video according to any one of the above.
  • the captured video is captured, and each frame image of the collected monitoring video is subjected to histogram equalization to obtain a histogram equalized equalized image, and is identified by a cascade classifier.
  • the human body in the equalized image by counting the human body identified from each frame image in the surveillance video, by designing a filtering and enhancement algorithm, and by testing several commonly used classifier algorithms,
  • the Haar classifier with the best detection effect is continuously optimized to achieve the purpose of detecting portraits in remote monitoring, and realizes the technical effect of detecting and counting the number of people more accurately in remote monitoring, thereby solving the problem that cannot be detected due to the prior art. Under remote monitoring, the technical problems of public space safety cannot be guaranteed due to the detection of abnormal population status.
  • FIG. 1 is a flow chart of a method for monitoring number of people in a video according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of a region of a Laplacian operator 3*3 according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of a Haar feature template in accordance with an embodiment of the present invention.
  • FIG. 4 is a structural block diagram of a person detecting device in a monitoring video according to an embodiment of the present invention.
  • FIG. 5 is a block diagram 1 of an optimized structure of a number of people detecting device in a surveillance video according to an embodiment of the present invention
  • FIG. 6 is a block diagram 2 of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention
  • FIG. 7 is a block diagram 3 of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention.
  • FIG. 8 is a block diagram 4 of an optimized structure of a number of people detecting device in a surveillance video according to an embodiment of the present invention
  • FIG. 9 is a flowchart of an optimization method for monitoring a number of people in a video according to an embodiment of the present invention.
  • FIG. 10 is a schematic diagram of a process for detecting a number of people based on a cascade classifier according to an embodiment of the present invention.
  • an embodiment of a method for monitoring the number of people in a video is provided. It is noted that the steps illustrated in the flowchart of the figures may be performed in a computer system such as a set of computer executable instructions. Also, although logical sequences are shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
  • FIG. 1 is a flowchart of a method for monitoring number of people in a video according to an embodiment of the present invention. As shown in FIG. 1, the method includes the following steps:
  • Step S102 performing histogram equalization on each frame image in the collected monitoring video to obtain a histogram equalized equalized image
  • Step S104 identifying a human body in the equalized image by using a cascade classifier, wherein the cascade classifier is configured to identify the human body according to the human body feature of the equalized image after the histogram equalization;
  • step S106 statistics are performed on the identified human body.
  • a histogram equalization is performed on each frame image of the collected monitoring video by using the captured video, and the equalized image after the histogram equalization is obtained, and the cascaded classifier is used to identify the image.
  • the human body in the image is equalized, and the human body identified from each frame image in the surveillance video is statistically analyzed, and an enhanced image histogram equalization algorithm is designed, and the cascading classification with the best portrait detection effect is designed.
  • the number of people is counted to achieve the purpose of detecting portraits in remote monitoring, and the technical effect of detecting and counting the number of people more accurately in remote monitoring is realized, thereby solving the problem that the existing technology cannot realize the crowd under remote monitoring.
  • the technical problem that the safety of public places cannot be guaranteed due to the detection of abnormal conditions.
  • the cascade classifier may be superposed by at least two weak classifiers, wherein the cascade classifier identifies the human body according to the human body features of the histogram equalized image by the superimposed at least two weak classifiers.
  • the method may further include: displaying each frame image in the color monitoring video Performing grayscale; performing histogram equalization on each frame image of the collected monitoring video includes: performing histogram equalization on each frame image after graying.
  • the byte stores the gray value, and the gray scale ranges from 0 to 255.
  • Image graying is required by the cascade classifier to gray out the original image as an input. For example, it can be implemented by calling OpenCV's cvCvtColor function.
  • OpenCV is a cross-platform computer vision library based on BSD license (open source) distribution, which is lightweight and efficient - consists of a series of C functions and a small number of C++ classes, and provides interfaces for languages such as Python, Ruby, and MATLAB. Achieve many general algorithms in image processing and computer vision.
  • Histogram equalization is to enhance image contrast, improve image quality, and help detect effects.
  • an embodiment of the present invention uses an improved histogram equalization method: using a Laplacian operator to extract high frequency components in each frame image before identifying the human body in the equalized image by a cascade classifier, and The high frequency component is assigned a weight, and the enhanced high frequency component is obtained; the enhanced high frequency component is superimposed on the histogram equalized image to obtain an enhanced equalized image.
  • identifying the human body in the equalized image by the cascade classifier includes: identifying the human body in the enhanced equalized image through the cascade classifier.
  • the "central idea" of the histogram equalization process is to change the gray histogram of the original image from a certain gray interval in the comparison set to a uniform distribution in the entire gray range. Histogram equalization is to nonlinearly stretch the image and redistribute the image pixel values so that the number of pixels in a certain gray range is approximately the same. Histogram equalization is the change of the histogram distribution of a given image to a "uniform" distribution histogram distribution. But there are two disadvantages:
  • the Laplace algorithm can achieve faster edge detection and better detection of high-frequency edges.
  • the Laplacian is a second-order differential operator. In the case of discrete, it is expressed as follows:
  • FIG. 2 is a schematic diagram of a region of a Laplacian operator 3*3, wherein the region of 3*3 is as shown, in accordance with an embodiment of the present invention.
  • the improved histogram equalization method greatly enhances the details of the image and facilitates subsequent detection.
  • the embodiment of the present invention can fill the contour area with a larger area by searching the contour, thereby eliminating the non-target area and improving the detection accuracy.
  • the Canny operator is used to perform edge detection on the histogram equalized equalized image to obtain a contour included in the histogram equalized equalized image; Determining an area of a contour included in the histogram equalized equalized image; filling a non-target contour having an area larger than a predetermined threshold with diffused water to obtain an equalized image including a non-target contour, wherein the non-target contour is Identify non-target areas of the human body.
  • identifying the human body in the equalized image by the cascade classifier includes: identifying the human body in the equalized image including the non-target contour by the cascade classifier.
  • the method further includes: obtaining a cascade classifier by training the plurality of sets of data, wherein each of the plurality of sets of data includes: a sample An image, and a human body recognition result for identifying whether the sample image includes a human body.
  • obtaining a cascade classifier by training the plurality of sets of data, wherein each of the plurality of sets of data includes: a sample An image, and a human body recognition result for identifying whether the sample image includes a human body.
  • the core of the acquisition process of the cascaded classifier is to find a small and critical part of the feature from a large number of Haar-like features by iterative method, and use this feature to generate an effective classifier, using a large number of classification capabilities.
  • the general weak classifiers are superimposed by a certain method to form a classifier with strong classification ability, and then these classifiers are cascaded to obtain the final strong classifier.
  • the Haar-like rectangle feature is a digital image feature for object detection.
  • 3 is a schematic diagram of a Haar feature template according to an embodiment of the present invention. As shown in FIG. 3, such a rectangular feature template is composed of two or more congruent black and white rectangles adjacent to each other, and the rectangular feature value is a white rectangle. The sum of the gray value and the gray value of the black rectangle.
  • the rectangle feature is sensitive to some simple graphic structures such as line segments and edges. If such a rectangle is placed in a non-face area, the calculated feature values should be different from the face feature values, so these rectangles are used to quantify face features to distinguish between faces and non-faces.
  • the feature-based approach was chosen without the pixel-based approach because, in the case of a given finite data sample, feature-based detection can not only encode the state of a particular region, but also be based on a feature-based system.
  • the pixel system is fast.
  • the recognition function of the weak classifier under the Haar-like rectangular feature may be:
  • g haar (x) is used to identify whether the equalized image includes a recognition result of the human body based on the human body feature x
  • f j (x) is a feature value
  • ⁇ j is a threshold value of the weak classifier
  • j is used to identify the jth Weak classifier
  • ⁇ and ⁇ are the confidence of the classification result, the value range is [-1, +1], the negative is not the human body, and the regular is the human body.
  • a weak classifier corresponds to a Haar-like rectangle feature.
  • the weak classifier form of the Haar-like feature is as described above.
  • the classifier trained by OpenCV contains a series of feature thresholds. To determine whether the intercepted image passes the classifier, it is necessary to calculate the Haar-like eigenvalues of the image under all Haar-like feature templates, and compare the thresholds of the corresponding feature templates in the classifier.
  • FIG. 4 is a structural block diagram of a person detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 4, the monitoring is performed.
  • the number of people in the video detecting device includes a first obtaining module 44, an identifying module 46, and a statistic module 48. The number of people detecting devices in the monitoring video will be described in detail below.
  • the first obtaining module 44 is configured to perform histogram equalization on each frame image of the collected monitoring video to obtain a histogram equalized equalized image;
  • the identification module 46 is connected to the first obtaining module 44 for identifying the human body in the equalized image by using a cascade classifier, wherein the cascade classifier is used for human body feature recognition of the equalized image after the histogram equalization Out of the body;
  • the statistics module 48 is connected to the identification module 46 for counting the identified human body.
  • FIG. 5 is a block diagram of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention. As shown in FIG. 5, the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4, a graying module. 52. The gradation module 52 will be described in detail below.
  • the graying module 52 is connected to the first obtaining module 44, and is configured to: before the collected monitoring video is colored, in the case of performing histogram equalization on each frame of the collected monitoring video, the coloring is performed. Each frame of the video in the surveillance video is grayed out.
  • FIG. 6 is a block diagram 2 of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention.
  • the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4 , a second obtaining module. 62.
  • the second obtaining module 62 will be described in detail below.
  • the second obtaining module 62 is connected to the first obtaining module 44 and the identifying module 46 for extracting each frame image by using a Laplacian before identifying the human body in the equalized image by the cascade classifier.
  • the high frequency component is assigned to the high frequency component to obtain the enhanced high frequency component; the enhanced high frequency component is superimposed on the histogram equalized image to obtain the enhanced equalized image.
  • FIG. 7 is a block diagram 3 of an optimized structure of a person detecting device in a monitoring video according to an embodiment of the present invention.
  • the number of detecting devices in the monitoring video includes: obtaining module 72 in addition to all the structures in FIG. 4 .
  • the obtaining module 72 will be described in detail below.
  • the obtaining module 72 is connected to the first obtaining module 44 and the identifying module 46, and is configured to perform the histogram equalized equalized image by using the Canny operator before identifying the human body in the equalized image by the cascade classifier.
  • Edge detection obtaining a contour included in the histogram equalized equalized image; determining an area of a contour included in the histogram equalized equalized image; and using a diffused water for the non-target contour having an area larger than a predetermined threshold
  • Filling obtaining an equalized image including a non-target contour, wherein the non-target contour is a non-target area that identifies the human body.
  • FIG. 8 is a block diagram of an optimized structure of a number of people detecting device in a monitoring video according to an embodiment of the present invention.
  • the number of detecting devices in the monitoring video includes, in addition to all the structures in FIG. 4 , a third obtaining module. 82.
  • the third obtaining module 82 will be described in detail below.
  • the third obtaining module 82 is connected to the foregoing identifying module 46, and is configured to obtain a cascade classifier by training a plurality of sets of data before identifying the human body in the equalized image by using the cascade classifier, wherein the plurality of data sets Each set of data includes: a sample image, and a human body recognition result for identifying whether the sample image includes a human body.
  • FIG. 9 is a flowchart of an optimization method for monitoring a number of people in a video according to an embodiment of the present invention. As shown in FIG. 9, the method includes the following steps:
  • Step S902 cascading classifier training
  • the cascading classifier uses OpenCV to train the cascading classifier based on Haar-like features, it is necessary to provide corresponding positive sample pictures and counter sample pictures of the features to be identified.
  • the positive sample is a sample picture of a human body image; the negative sample is a background image, which requires no one, and the aspect ratio is 1:2.
  • the cascaded classifier is trained by the corresponding program provided by OpenCV to extract features and train classifiers, and the trained classifier model can identify these things.
  • Step S904 pedestrian detection and population statistics.
  • the input data to be detected is a real-time video, and grayscale processing is performed on each frame of the input video;
  • the result detected by the classifier may have a larger area, so by traversing the detection result of the classifier (rectangular frame), the height of the target rectangular frame is greater than the threshold. Eliminate, thereby improving detection accuracy.
  • a real-time video stream is collected, and a plurality of original sample map samples and a velocity sample map sample are obtained by line sampling based on the obtained video stream; space-time correction is performed on the obtained velocity sample map sample; based on the original sample map and The velocity sampling map, the offline training obtains the deep learning model, the deep learning model includes the classification model and the statistical model; and the obtained deep learning model is used for the population state analysis of the real-time video stream.
  • the embodiments of the invention have good adaptability to different environments, light intensity, weather conditions and camera angles; for a crowded environment such as a large flow of people, a high accuracy rate can be ensured; the calculation amount is small, and the real-time video can be satisfied.
  • the requirements for processing can be widely applied to the monitoring and management of public places densely populated by buses, subways and plazas.
  • the population density estimation for the low-density population, the pixel density method is used to estimate the population density.
  • the wavelet packet decomposition is used for the high-density population.
  • the crowd image was analyzed by multi-scale.
  • the population density level was classified by SVM (Support Vector Machine).
  • SVM Small Vector Machine
  • the block matching method based on the full search algorithm with the average absolute error as the matching criterion was used. The speed of the crowd is estimated.
  • the texture analysis method extracts the texture features for analysis, and finally uses the Adaboost classifier to classify the population density level.
  • the current research focus on population density estimation and motion analysis is to solve the problem that when there are a large number of pedestrians, there is a large amount of occlusion in the crowd, and it is difficult to accurately detect, segment and track individual pedestrians in the crowd accurately.
  • an effective statistical learning method is used to establish a reasonable decision rule, directly estimating the number of pedestrians, and determining the motion state of the target, and detecting the abnormal event. occur.
  • the solution in the above related art is only applicable to the conditions of a close-range monitoring device such as a street or an indoor.
  • the target display resolution is high, the area is large, and the recognition difficulty is relatively low, but at a long distance.
  • the recognition difficulty is relatively low, but at a long distance.
  • the conditions of detection it is not applicable.
  • the scenes shot under such conditions are large, the target characters are very small and very fuzzy, and the detection is more difficult.
  • the above methods are not applicable.
  • the monitoring video of the captured video is used to equalize the histogram of each frame in the collected monitoring video to obtain a histogram equalized image, and the histogram is identified by the cascade classifier.
  • the human body in the image after equalization is used to statistically analyze the human body identified from each frame of the video in the surveillance video, by designing filtering and enhancement algorithms, and by testing several commonly used classifier algorithms.
  • the Haar classifier with the best detection effect is obtained and continuously optimized to achieve the purpose of detecting portraits in remote monitoring, and the technical effect of detecting and counting the number of people more accurately in remote monitoring is realized, thereby solving the existing
  • the technology cannot solve the technical problem that the safety of public places cannot be guaranteed due to the detection of abnormal state of the people under remote monitoring.
  • the problem solved by the embodiment of the present invention is that during the running of the cloud number, the monitoring camera of the cloud number will focus on monitoring some important places on the ground, and the number of people is counted to determine the security status of the place, and the warning effect of the crowd is intensive. Fully guarantee the effectiveness and accuracy of the warning.
  • the embodiment of the present invention can be applied to a system for counting the number of people, and can be used in a remote monitoring environment (such as cloud number, drone, etc.) to implement pedestrian detection, and count the number of pedestrians. Some warnings can be made.
  • a remote monitoring environment such as cloud number, drone, etc.
  • FIG. 10 is a schematic diagram of a process for detecting a number of people based on a cascaded classifier according to an embodiment of the present invention.
  • the classifier can include the identification image. The part of the human body is detected, and when a feature containing a person is detected, the number counter is automatically incremented by one.
  • the cascading classifier is used for human body detection, and the corresponding program is designed based on the interface function of the pre-processing and cascading classifier in the OpenCV library.
  • a robot comprising the number of persons detecting means in the monitoring video of any of the above.
  • a storage medium comprising: a stored program, wherein the device in which the storage medium is located controls the number of people in the monitoring video in any one of the above-mentioned items when the program is running.
  • a processor for running a program wherein the program is executed while performing the number of people in the monitoring video of any of the above.
  • the disclosed technical contents may be implemented in other manners.
  • the device embodiments described above are only schematic.
  • the division of the unit may be a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, unit or module, and may be electrical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Image Analysis (AREA)

Abstract

L'invention concerne un procédé et un dispositif de détection du nombre de personnes présentes dans une vidéo de surveillance. Le procédé comprend : une égalisation d'histogramme qui est effectuée sur chaque trame d'image capturée dans la vidéo de surveillance pour obtenir une image égalisée après égalisation d'histogramme ; un corps humain dans l'image égalisée est reconnu au moyen d'un classificateur en cascade ; le classificateur en cascade est destiné à reconnaître le corps humain selon des caractéristiques de corps humain dans l'image égalisée après égalisation d'histogramme ; et le corps humain identifié est comptabilisé. La présente invention résout les problèmes techniques de l'état de la technique selon lesquels la sécurité dans des lieux publics ne peut être garantie en raison de l'incapacité à réaliser la détection d'un état anormal de la foule sous surveillance à distance.
PCT/CN2018/079856 2017-12-12 2018-03-21 Procédé et dispositif de détection du nombre de personnes présentes dans une vidéo de surveillance WO2019114145A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711327492.5 2017-12-12
CN201711327492.5A CN109918971B (zh) 2017-12-12 2017-12-12 监控视频中人数检测方法及装置

Publications (1)

Publication Number Publication Date
WO2019114145A1 true WO2019114145A1 (fr) 2019-06-20

Family

ID=66818939

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/079856 WO2019114145A1 (fr) 2017-12-12 2018-03-21 Procédé et dispositif de détection du nombre de personnes présentes dans une vidéo de surveillance

Country Status (2)

Country Link
CN (1) CN109918971B (fr)
WO (1) WO2019114145A1 (fr)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717885A (zh) * 2019-09-02 2020-01-21 平安科技(深圳)有限公司 顾客数量的统计方法及装置、电子设备及可读存储介质
CN110807424A (zh) * 2019-11-01 2020-02-18 深圳市科卫泰实业发展有限公司 一种基于航拍图像的港口船舶比对方法
CN112327893A (zh) * 2019-08-05 2021-02-05 旭日蓝天(武汉)科技有限公司 一种基于ai技术的无人机飞控前端人数统计系统及方法
CN114399728A (zh) * 2021-12-17 2022-04-26 燕山大学 一种雾天场景人群计数方法

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110852814A (zh) * 2020-01-14 2020-02-28 深圳惠通天下信息技术有限公司 一种广告投放自助服务系统及方法
CN111474914B (zh) * 2020-02-24 2020-12-01 安徽省华腾农业科技有限公司经开区分公司 故障代码大数据存储系统
CN111931567B (zh) * 2020-07-01 2024-05-28 珠海大横琴科技发展有限公司 人体识别的方法及装置、电子设备、存储介质
CN114362988B (zh) * 2021-09-29 2023-06-20 中国科学院计算机网络信息中心 网络流量的识别方法及装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130071033A1 (en) * 2011-09-21 2013-03-21 Tandent Vision Science, Inc. Classifier for use in generating a diffuse image
CN104298969A (zh) * 2014-09-25 2015-01-21 电子科技大学 基于颜色与haar特征融合的人群规模统计方法
CN104463191A (zh) * 2014-10-30 2015-03-25 华南理工大学 一种基于注意机制的机器人视觉处理方法
KR20160037424A (ko) * 2014-09-29 2016-04-06 동명대학교산학협력단 개선된 리얼 아다부스트 알고리즘에 근거한 새로운 멀티 뷰 얼굴감지방식
CN105893946A (zh) * 2016-03-29 2016-08-24 中国科学院上海高等研究院 一种正面人脸图像的检测方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101340510B (zh) * 2008-08-07 2010-06-23 中兴通讯股份有限公司 一种视频增强的方法及其装置
CN106326839A (zh) * 2016-08-11 2017-01-11 中防通用河北电信技术有限公司 一种基于出操视频流的人数统计方法
CN106446872A (zh) * 2016-11-07 2017-02-22 湖南源信光电科技有限公司 一种低照度下的视频人脸检测识别方法
CN106778683A (zh) * 2017-01-12 2017-05-31 西安电子科技大学 基于改进lbp特征的快速多角度人脸检测方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130071033A1 (en) * 2011-09-21 2013-03-21 Tandent Vision Science, Inc. Classifier for use in generating a diffuse image
CN104298969A (zh) * 2014-09-25 2015-01-21 电子科技大学 基于颜色与haar特征融合的人群规模统计方法
KR20160037424A (ko) * 2014-09-29 2016-04-06 동명대학교산학협력단 개선된 리얼 아다부스트 알고리즘에 근거한 새로운 멀티 뷰 얼굴감지방식
CN104463191A (zh) * 2014-10-30 2015-03-25 华南理工大学 一种基于注意机制的机器人视觉处理方法
CN105893946A (zh) * 2016-03-29 2016-08-24 中国科学院上海高等研究院 一种正面人脸图像的检测方法

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112327893A (zh) * 2019-08-05 2021-02-05 旭日蓝天(武汉)科技有限公司 一种基于ai技术的无人机飞控前端人数统计系统及方法
CN110717885A (zh) * 2019-09-02 2020-01-21 平安科技(深圳)有限公司 顾客数量的统计方法及装置、电子设备及可读存储介质
CN110807424A (zh) * 2019-11-01 2020-02-18 深圳市科卫泰实业发展有限公司 一种基于航拍图像的港口船舶比对方法
CN110807424B (zh) * 2019-11-01 2024-02-02 深圳市科卫泰实业发展有限公司 一种基于航拍图像的港口船舶比对方法
CN114399728A (zh) * 2021-12-17 2022-04-26 燕山大学 一种雾天场景人群计数方法
CN114399728B (zh) * 2021-12-17 2023-12-05 燕山大学 一种雾天场景人群计数方法

Also Published As

Publication number Publication date
CN109918971B (zh) 2024-01-05
CN109918971A (zh) 2019-06-21

Similar Documents

Publication Publication Date Title
WO2019114145A1 (fr) Procédé et dispositif de détection du nombre de personnes présentes dans une vidéo de surveillance
CN107622258B (zh) 一种结合静态底层特征和运动信息的快速行人检测方法
Subburaman et al. Counting people in the crowd using a generic head detector
CN103824070B (zh) 一种基于计算机视觉的快速行人检测方法
US9158985B2 (en) Method and apparatus for processing image of scene of interest
CN106056079B (zh) 一种图像采集设备及人脸五官的遮挡检测方法
Bayona et al. Comparative evaluation of stationary foreground object detection algorithms based on background subtraction techniques
CN108596140A (zh) 一种移动终端人脸识别方法及系统
CN109359577B (zh) 一种基于机器学习的复杂背景下人数检测系统
CN109190456B (zh) 基于聚合通道特征和灰度共生矩阵的多特征融合俯视行人检测方法
CN111383244A (zh) 一种目标检测跟踪方法
CN106909883A (zh) 一种基于ros的模块化手部区域检测方法和装置
Su et al. A new local-main-gradient-orientation HOG and contour differences based algorithm for object classification
CN104899559B (zh) 一种基于视频监控的快速行人检测方法
Ryan et al. Scene invariant crowd counting
KR20200060868A (ko) 객체 탐지 및 자동 추적이 가능한 다시점 영상 감시 시스템
Ryan et al. Scene invariant crowd counting and crowd occupancy analysis
CN108875572B (zh) 基于背景抑制的行人重识别方法
CN107341456B (zh) 一种基于单幅户外彩色图像的天气晴阴分类方法
Shreedarshan et al. Crowd recognition system based on optical flow along with SVM classifier
CN114694090A (zh) 一种基于改进PBAS算法与YOLOv5的校园异常行为检测方法
Yu et al. A crowd flow estimation method based on dynamic texture and GRNN
Fradi et al. Crowd context-dependent privacy protection filters
Chung et al. Face detection and posture recognition in a real time tracking system
De Siqueira et al. Adaptive detection of human skin in color images

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18887712

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18887712

Country of ref document: EP

Kind code of ref document: A1