CN106056078B - Crowd density estimation method based on multi-feature regression type ensemble learning - Google Patents
Crowd density estimation method based on multi-feature regression type ensemble learning Download PDFInfo
- Publication number
- CN106056078B CN106056078B CN201610374700.6A CN201610374700A CN106056078B CN 106056078 B CN106056078 B CN 106056078B CN 201610374700 A CN201610374700 A CN 201610374700A CN 106056078 B CN106056078 B CN 106056078B
- Authority
- CN
- China
- Prior art keywords
- image
- scene
- density estimation
- value
- head
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
- G06V20/53—Recognition of crowd images, e.g. recognition of crowd congestion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
Abstract
The invention relates to a crowd density estimation method based on multi-feature regression type ensemble learning, which is characterized in that the width of the head of a person is taken as a reference to carry out multi-level image blocking on a scene frame image, and the blocks are subjected to image scaling and Gamma correction to realize the consistency of the image scale and illumination; the method comprises the steps of constructing a density estimation model by utilizing preprocessed samples, extracting three features of D-SIFT, GLCM and GIST to construct a first-layer support vector machine regression (SVR) coarse prediction model, constructing a second-layer SVR fine prediction model by taking the coarse prediction results as new features, and adding the fine prediction results of all sub-images to perform density estimation according to the people number grades set by a scene. The method solves the problems of scene illumination change, camera height angle change and pedestrian shielding, and is suitable for a plurality of different scenes to realize crowd density estimation by utilizing a plurality of scene samples to adopt a plurality of characteristics and applying a regression mode to integrate and learn a construction model.
Description
Technical Field
The invention relates to the technical field of digital image processing and pattern recognition, in particular to a crowd density estimation method based on multi-feature regression type ensemble learning.
Background
Along with the improvement of living standard of people, the urbanization progress is accelerated continuously, and the collective activities of large-scale public places are frequent day by day, so that accidents caused by dense crowds are frequent in recent years. Therefore, how to use computer vision to carry out real-time intelligent monitoring on the crowd, make crowd density estimation in time and take effective measures has important significance for guaranteeing social stability and crowd safety.
Current methods for estimating population density can be divided into two major categories:
1) the direct method comprises the following steps: direct methods use some classifiers to attempt to segment or detect each individual in a population and then count to obtain population density. These methods can be further divided into two subclasses: model-based methods: detection or segmentation is performed by a model or a shape contour of a person. For example, a method for extracting human head contour features based on Haar wavelet transform and detecting pedestrians by combining a support vector Machine (Lin S F, Chen J Y, Chao H X. estimation of number of people in their crowned scenes using a productive transformation [ J ]. Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on, 2001,31(6):645 (654)), Feizzwald et al propose a method for detecting pedestrians based on components and improved gradient Histogram (HOG) features (Feizzwanwa P F, Girshick R B, occupant D, et al, object estimation of motion with detection of pedestrian position [ 2010, noise ] using a forest detection framework [ 11, 1625 ] and a method for detecting pedestrians using a forest Analysis [ 7, J7 ] and a forest detection framework (IEEE J7, 11), lempitsky V.class-specific hough recommendations for object detection [ M ]. precision requirements for computer vision and medical Image analysis. Springer London,2013: 143-; a track clustering-based method: each individual is detected by clustering points of interest on the pedestrian through long-term tracking. For example, Rabaud and Belongie propose a method for clustering trajectories and inferring scene population using a Kanade-Lucas-Tomasi (KLT) tracker and a series of underlying features (Rabaud V, Belongie S. counting drained moving objects [ C ]. Computer Vision and Pattern registration, 2006IEEE Computer Society reference on. IEEE,2006,1:705 well 711), Rao et al (Rao A S, Gubbi J, Marusic S, et al. The direct method has a good effect under the condition of a small number of people in a scene, but the defect is obvious, the crowd is seriously overlapped under the condition of crowding, and the performance of the direct method is linearly reduced.
2) An indirect method: the indirect method treats the crowd as a whole, and obtains the crowd density by extracting the characteristics of textures and the like for the crowd and combining a regression model. Indirect methods can also be divided into three categories: analysis based on pixels: these methods first remove the scene background and then use some very underlying features to estimate crowd density. Davies et al (Davies A C, Yin J H, vehicle S A. crowd monitoring using image processing [ J ]. Electronics & communication Engineering Journal,1995,7(1):37-47) estimate people by linear relationship by extracting foreground and analyzing crowd foreground and edge pixels and adding perspective correction. Hussasin et al (Hussain N, Yatim H S M, Hussain N L, et al CDES: A pixel-based visibility estimation system for massive-Haram [ J ]. Safety Science,2011,49(6):824-833) extract underlying features on foreground pixels corrected for perspective distortion by scaling and then use a neural network to supervise training, the trained model estimates sparse population accurately, but as the density increases, population occlusion occurs, the wrong estimation rises linearly; texture and gradient based method: texture and gradient features may better represent the number of people in a scene than a pixel-based approach. Texture and Gradient features used in population density estimation include Gray-level co-occlusion matrix (GLCM), ULBP feature (Uniform local Pattern), HOG feature, and Gradient orientation co-occlusion matrix (GOCM), among others; thirdly, the method based on the characteristic points comprises the following steps: feature points are feature pixels of interest, e.g. corner points detected in the image. For example, Conte et al (Conte D, Foggia P, Percannella G, et al. counting moving magnets in raised scenes [ J ]. Motor vision and Applications,2013,24(5): 1029-. Kishor et al (Kishore P V V, Rahul R, Sravya K, et al. crown definition Analysis and tracking [ C ]. Advances in Computing, Communications and information (ICACCI),2015International reference on IEEE,2015: 1209-. Indirect methods usually need to extract foreground or motion information to reduce background interference, and in practical applications, because of illumination changes, continuous crowding of pedestrians, various background factors and the like, extraction of foreground and motion information becomes a difficult task, so that accurate estimation of the methods is difficult to make in practical applications
Disclosure of Invention
The invention aims to provide a crowd density estimation method based on multi-feature regression type ensemble learning.
In order to achieve the purpose, the invention adopts the following technical scheme: a crowd density estimation method based on multi-feature regression ensemble learning comprises the following steps:
image blocking step: acquiring a video monitoring frame image of a scene, performing multi-level image blocking on the scene by taking the width of a human head as a reference, performing scaling processing on the multi-level block image to unify the size, and performing Gamma correction preprocessing to obtain a sub-image sample;
a crowd density estimation step: carrying out coarse prediction on three characteristics of D-SIFT, GLCM and GIST of the sub-image sample by adopting a first layer of support vector regression model; and performing fine prediction by using the coarse prediction result as a new feature and using a second-layer support vector regression model, adding the fine prediction results of all the sub-image samples, and performing density estimation according to the set crowd density grade of the scene.
Preferably, the multi-level image blocking specifically comprises the following steps:
firstly, a scene interest area is defined, then the size of a first layer block image is determined, a reference pedestrian is selected, when the head of the reference pedestrian just enters the bottom boundary of the interest area, the width of the head of the reference pedestrian is measured to be w pixels, then the width of the first layer block image is set to be w pixels 128/42 pixels, then the reference pedestrian continuously moves forwards until the width of the head is w pixels 21/42 w/2 pixels, and the length from the top of the head to the bottom boundary of the interest area is the height of the first layer block image;
determining the size of the second layer block image, selecting a reference pedestrian, and measuring the head width w when the head of the reference pedestrian just crosses the upper edge of the first layer block image1Setting the width of the second layer block as the head width w1128/42 pixels, then moving forward with reference to the pedestrian until the head width is w1*21/42=w1When the pixel is 2, the length from the head to the upper side of the first layer block image is the height of the second layer block image;
and determining the size of the third-layer block image by analogy until the multi-layer block image completes non-overlapping full coverage on the scene interest area.
Preferably, the widths and heights of the multiple-level block images after being subjected to scaling processing and unified in size are all 128 pixels.
Preferably, the step of obtaining the sub-image from the multi-level block image through Gamma correction preprocessing comprises: firstly, dividing a pixel value into three intervals of 0-255, and then converting the pixel value into an angle, wherein the specific expression is as follows:
where x is the pixel value, x0And x1Respectively, a set pixel threshold value, E1=[0,x0],E2=[x0,x1],E3=[x1,255],Then it is the converted angle;
the gamma value γ (x) is then determined using trigonometric relationships, defined as follows:
the fluctuation of the Gamma value is too large by simply adjusting the Gamma value by the weight value a, so the weight value b is introduced and the linear correction function shown in the formula (3) is adopted for correction
The final corrected Gamma value is defined as
The pixel corrected value is
Preferably, the crowd density estimating step comprises:
respectively extracting D-SIFT, GLCM and GIST characteristics from the sub-image samples;
respectively training a coarse prediction model by utilizing a first-layer support vector regression model for the extracted features, and obtaining different number of people coarse prediction values corresponding to the three features of D-SIFT, GLCM and GIST through the coarse prediction model for the test sample set;
the rough people number prediction value is used as a new feature, a second-layer support vector regression model is used for training a fine prediction model, and the result of the rough people number prediction is obtained through the fine prediction model to obtain more accurate sub-image sample people number prediction, namely a fine prediction value;
adding the fine prediction values of all sub-image samples of one frame of image, counting the number of people in the scene interest area,
and obtaining the crowd density estimation value of the current frame according to the density classification standard of the scene interest area.
Compared with the prior art, the invention has the beneficial effects that: the method solves the problems of scene illumination change, camera height angle change and pedestrian shielding, and is suitable for a plurality of different scenes to realize crowd density estimation by utilizing a plurality of scene samples to adopt a plurality of characteristics and applying a regression mode to integrate and learn a construction model.
The invention is further described below with reference to the accompanying drawings and specific embodiments.
Drawings
FIG. 1 is a schematic flow diagram of the present invention;
FIG. 2 is a schematic diagram of block image sizing;
FIG. 3 is a diagram illustrating a correspondence between a multi-level block image and a scene interest area;
fig. 4 is a flowchart of regression ensemble learning.
Detailed Description
In order to more fully understand the technical contents of the present invention, the technical solutions of the present invention will be further described and illustrated with reference to specific embodiments.
As shown in fig. 1, which is a schematic flow chart of the present invention, a population density estimation method based on multi-feature regression ensemble learning includes the following steps:
image blocking step: acquiring a video monitoring frame image of a scene, performing multi-level image blocking on the scene by taking the width of a human head as a reference, performing scaling processing on the multi-level block image to unify the size, and performing Gamma correction preprocessing to obtain a sub-image sample;
a crowd density estimation step: carrying out coarse prediction on three characteristics of D-SIFT, GLCM and GIST of the sub-image sample by adopting a first layer of support vector regression model; and performing fine prediction by using the coarse prediction result as a new feature and using a second-layer support vector regression model, adding the fine prediction results of all the sub-image samples, and performing density estimation according to the set crowd density grade of the scene.
Further, as shown in fig. 2, a schematic diagram is determined for the block image size; as shown in fig. 3, it is a schematic diagram of the correspondence between the multi-level block image and the scene interest area; in the above technical solution, the specific steps of partitioning the multi-level image are as follows:
firstly, a scene interest area is defined, then the size of a first layer block image is determined, a reference pedestrian is selected, when the head of the reference pedestrian just enters the bottom boundary of the interest area, the width of the head of the reference pedestrian is measured to be w pixels, then the width of the first layer block image is set to be w pixels 128/42 pixels, then the reference pedestrian continuously moves forwards until the width of the head is w pixels 21/42 w/2 pixels, and the length from the top of the head to the bottom boundary of the interest area is the height of the first layer block image;
determining the size of the second layer block image, selecting a reference pedestrian, and measuring the head width w when the head of the reference pedestrian just crosses the upper edge of the first layer block image1Setting the width of the second layer block as the head width w1128/42 pixels, then moving forward with reference to the pedestrian until the head width is w1*21/42=w1When the pixel is 2, the length from the head to the upper side of the first layer block image is the height of the second layer block image;
and determining the size of the third-layer block image by analogy until the multi-layer block image completes non-overlapping full coverage on the scene interest area.
The method of image blocking with human head width as reference is adopted, the image of a frame is divided into blocks with different sizes in a plurality of layers from near to far, the blocks are used as basic elements to construct a model and predict the number of people, and the problem of perspective projection effect can be solved.
After image blocking, a plurality of multi-level block images with different distances, sizes, different times and weather are obtained, and before the characteristics are extracted, the multi-level block images need to be preprocessed to reduce environmental interference and training amount,
the multi-level block images are firstly subjected to scaling processing to be uniform in size, the width and the height after the sizes are uniform are all 128 pixels, so that the block images with different distances can be uniformly trained into samples with the same size by normalizing the sizes, the samples at far and near positions do not need to be separately trained, and the training amount is greatly reduced.
Secondly, in order to reduce the influence caused by ambient light, Gamma correction needs to be carried out on the block image, the multi-level block image is preprocessed by the Gamma correction to obtain a sub-image, and the method specifically comprises the following steps: firstly, dividing a pixel value into three intervals of 0-255, and then converting the pixel value into an angle, wherein the specific expression is as follows:
where x is the pixel value, x0And x1Respectively, a set pixel threshold value, E1=[0,x0],E2=[x0,x1],E3=[x1,255],Then it is the converted angle;
the gamma value γ (x) is then determined using trigonometric relationships, defined as follows:
the fluctuation of the Gamma value is too large by simply adjusting the Gamma value by the weight value a, so the weight value b is introduced and the linear correction function shown in the formula (3) is adopted for correction
The final corrected Gamma value is defined as
The pixel corrected value is
Further, as shown in fig. 4, which is a flowchart of the regression-based ensemble learning, the crowd density estimating step includes:
respectively extracting D-SIFT, GLCM and GIST characteristics from the sub-image samples, and setting the characteristics as xD-SIFT、xGLCMAnd xGIST;
Respectively training a rough prediction model by using a first-layer support vector regression model for the extracted features, and obtaining three models f by regression fitting of the first-layer support vector regression model for a test sample set1(xD-SIFT)、f2(xGLCM) And f3(xGIST) And the model outputs the predicted values y of the three characteristics of D-SIFT, GLCM and GISTD-SIFT、yGLCMAnd yGISTAnd combining the three predicted values into new characteristics corresponding to the rough predicted values of different people numbers:
xALL=[yD-SIFT,yGLCM,yGIST] (11)
training the new feature to a fine prediction model f by using a second-layer support vector regression modelFinal(xALL) The result of the rough people number prediction is subjected to a fine prediction model to obtain more accurate sub-image sample people number prediction yFinalI.e. the fine prediction value; the regression ensemble learning includes two parts: a training (learning) part and a prediction (application) part, as shown in fig. 4, wherein the training part is a training regression model, firstly, the characteristics of a plurality of sub-images are extracted, the number of people of each sub-image is counted as the number label of the person to form a sample set of the training part, then the sample set is divided into a training set and a test set, the coarse regression model corresponding to the three characteristics is trained through the training set, and the test set can obtain corresponding prediction output, namely a coarse prediction value, through the coarse regression model. Three are to be arrangedAnd (4) taking the rough predicted value of the model as a new feature and combining the population labels to form a new sample set, and continuously dividing the new sample set into a new training set and a new testing set. And training the fine regression model through the new training set, and judging whether the model is accurate or not by the new testing set through obtaining the fine prediction value through the fine regression model.
The prediction part predicts the number of people through a trained model. Extracting features from a test sample of unknown people, obtaining a rough predicted value by combining a rough regression model trained by a training part, and inputting three rough predicted values serving as new features into a fine regression model to obtain a fine predicted value, namely the final people number prediction.
Considering that the sensitivities of different features to the population density are inconsistent, the two-layer regression can be adopted to make up for the deficiencies of each other, and the prediction accuracy can be improved.
Adding the fine prediction values of all sub-image samples of one frame of image, counting the number of people in the scene interest area,
and obtaining the crowd density estimation value of the current frame according to the density classification standard of the scene interest area. For example: assuming that the current scene can accommodate the maximum number of people nmaxFor the standard, an average classification is adopted, and the classification is divided into five grades: [0, n ]max/5]、[nmax/5,2nmax/5]、[2nmax/5,3nmax/5]、[3nmax/5,4nmax/5]And [4nmaxAnd/5, ∞), respectively denoted as VL (very low), L (low), M (medium), H (high), and VH (very high), and the population density estimation can be done by comparing the above criteria against the population count of the region of interest of the statistical scene.
The technical contents of the present invention are further illustrated by the examples, so as to facilitate the understanding of the reader, but the embodiments of the present invention are not limited thereto, and any technical extension or re-creation based on the present invention is protected by the present invention.
Claims (4)
1. A crowd density estimation method based on multi-feature regression ensemble learning is characterized by comprising the following steps:
image blocking step: acquiring a video monitoring frame image of a scene, performing multi-level image blocking on the scene by taking the width of a human head as a reference, performing scaling processing on the multi-level block image to unify the size, and performing Gamma correction preprocessing to obtain a sub-image sample;
a crowd density estimation step: carrying out coarse prediction on three characteristics of D-SIFT, GLCM and GIST of the sub-image sample by adopting a first layer of support vector regression model; using the coarse prediction result as a new feature to perform fine prediction by using a second-layer support vector regression model, adding the fine prediction results of all sub-image samples, and performing density estimation according to the set crowd density grade of the scene;
the method for partitioning the multi-level image comprises the following specific steps:
firstly, a scene interest area is defined, then the size of a first layer block image is determined, a reference pedestrian is selected, when the head of the reference pedestrian just enters the bottom boundary of the interest area, the width of the head of the reference pedestrian is measured to be w pixels, then the width of the first layer block image is set to be w pixels 128/42 pixels, then the reference pedestrian continuously moves forwards until the width of the head is w pixels 21/42 w/2 pixels, and the length from the top of the head to the bottom boundary of the interest area is the height of the first layer block image;
determining the size of the second layer block image, selecting a reference pedestrian, and measuring the head width w when the head of the reference pedestrian just crosses the upper edge of the first layer block image1Setting the width of the second layer block as the head width w1128/42 pixels, then moving forward with reference to the pedestrian until the head width is w1*21/42=w1When the pixel is 2, the length from the head to the upper side of the first layer block image is the height of the second layer block image;
and determining the size of the third-layer block image by analogy until the multi-layer block image completes non-overlapping full coverage on the scene interest area.
2. The crowd density estimation method based on multi-feature regression ensemble learning of claim 1, wherein the widths and heights of the multi-level block images after scaling processing and size unification are all 128 pixels.
3. The crowd density estimation method based on the multi-feature regression type ensemble learning of claim 2, wherein the step of obtaining the sub-images from the multi-level block images through Gamma correction preprocessing comprises: firstly, dividing a pixel value into three intervals of 0-255, and then converting the pixel value into an angle, wherein the specific expression is as follows:
where x is the pixel value, x0And x1Respectively, a set pixel threshold value, E1=[0,x0],E2=[x0,x1],E3=[x1,255],Then it is the converted angle;
the gamma value γ (x) is then determined using trigonometric relationships, defined as follows:
the fluctuation of the Gamma value is too large by simply adjusting the Gamma value by the weight value a, so the weight value b is introduced and the linear correction function shown in the formula (3) is adopted for correction
The final corrected Gamma value is defined as
The pixel corrected value is
4. The crowd density estimation method based on the multi-feature regression-based ensemble learning according to claim 1, 2 or 3, wherein the crowd density estimation step comprises:
respectively extracting D-SIFT, GLCM and GIST characteristics from the sub-image samples;
respectively training a coarse prediction model by utilizing a first-layer support vector regression model for the extracted features, and obtaining different number of people coarse prediction values corresponding to the three features of D-SIFT, GLCM and GIST through the coarse prediction model for the test sample set;
the rough people number prediction value is used as a new feature, a second-layer support vector regression model is used for training a fine prediction model, and the result of the rough people number prediction is obtained through the fine prediction model to obtain more accurate sub-image sample people number prediction, namely a fine prediction value;
and adding the fine predicted values of all sub-image samples of one frame of image, counting the number of people in the scene interest area, and obtaining the crowd density estimation value of the current frame according to the density classification standard of the scene interest area.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610374700.6A CN106056078B (en) | 2016-05-31 | 2016-05-31 | Crowd density estimation method based on multi-feature regression type ensemble learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610374700.6A CN106056078B (en) | 2016-05-31 | 2016-05-31 | Crowd density estimation method based on multi-feature regression type ensemble learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106056078A CN106056078A (en) | 2016-10-26 |
CN106056078B true CN106056078B (en) | 2021-09-14 |
Family
ID=57172224
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610374700.6A Active CN106056078B (en) | 2016-05-31 | 2016-05-31 | Crowd density estimation method based on multi-feature regression type ensemble learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106056078B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106878952B (en) * | 2017-03-20 | 2020-07-10 | 迪爱斯信息技术股份有限公司 | Regional personnel number prediction method and device |
CN107480786B (en) * | 2017-08-07 | 2021-04-30 | 复旦大学 | Output state limitation-based recurrent neural network track likelihood probability calculation method |
CN108985256A (en) * | 2018-08-01 | 2018-12-11 | 曜科智能科技(上海)有限公司 | Based on the multiple neural network demographic method of scene Density Distribution, system, medium, terminal |
CN110598630A (en) * | 2019-09-12 | 2019-12-20 | 江苏航天大为科技股份有限公司 | Method for detecting passenger crowding degree of urban rail transit based on convolutional neural network |
CN112733624B (en) * | 2020-12-26 | 2023-02-03 | 电子科技大学 | People stream density detection method, system storage medium and terminal for indoor dense scene |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103761723A (en) * | 2014-01-22 | 2014-04-30 | 西安电子科技大学 | Image super-resolution reconstruction method based on multi-layer supporting vectors |
-
2016
- 2016-05-31 CN CN201610374700.6A patent/CN106056078B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103761723A (en) * | 2014-01-22 | 2014-04-30 | 西安电子科技大学 | Image super-resolution reconstruction method based on multi-layer supporting vectors |
Non-Patent Citations (7)
Title |
---|
Crowd Density Estimation Using Texture Analysis and Learning;X. Wu 等;《2006 IEEE International Conference on Robotics and Biomimetics》;20070402;第214-219页 * |
Modeling the shape of the scene: a holistic representation of the spatial envelope;Oliva A 等;《International Journal of Computer Vision》;20010531;第42卷(第3期);第145-175页 * |
Visual categorization with bags of keypoints;Gabriela Csurka 等;《ResearchGate》;20040131;第2-17页 * |
基于Gist特征与PHOG特征融合的多场景分类;肖保良;《中北大学学报(自然科学版)》;20141231;第35卷(第6期);第690-694页 * |
基于GLCM纹理特征分析的人群密度估计方法实现;侯鹏鹏;《中国安防》;20141215(第23期);第88-90页 * |
多种人群密度场景下的人群计数;覃勋辉 等;《中国图象图形学报》;20130430;第18卷(第4期);摘要、第1节 * |
大规模群体人数检测算法研究;郭婷;《中国优秀硕士学位论文全文数据库 (信息科技辑)》;20150315(第3期);第5.1节 * |
Also Published As
Publication number | Publication date |
---|---|
CN106056078A (en) | 2016-10-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230289979A1 (en) | A method for video moving object detection based on relative statistical characteristics of image pixels | |
CN106056078B (en) | Crowd density estimation method based on multi-feature regression type ensemble learning | |
CN108492319B (en) | Moving target detection method based on deep full convolution neural network | |
CN107909081B (en) | Method for quickly acquiring and quickly calibrating image data set in deep learning | |
US9639748B2 (en) | Method for detecting persons using 1D depths and 2D texture | |
CN103310444B (en) | A kind of method of the monitoring people counting based on overhead camera head | |
CN106570486A (en) | Kernel correlation filtering target tracking method based on feature fusion and Bayesian classification | |
CN104166841A (en) | Rapid detection identification method for specified pedestrian or vehicle in video monitoring network | |
CN109918971B (en) | Method and device for detecting number of people in monitoring video | |
CN109840483B (en) | Landslide crack detection and identification method and device | |
CN103218816A (en) | Crowd density estimation method and pedestrian volume statistical method based on video analysis | |
CN108280397A (en) | Human body image hair detection method based on depth convolutional neural networks | |
CN105740945A (en) | People counting method based on video analysis | |
CN103310194A (en) | Method for detecting head and shoulders of pedestrian in video based on overhead pixel gradient direction | |
CN109255326B (en) | Traffic scene smoke intelligent detection method based on multi-dimensional information feature fusion | |
CN106204594A (en) | A kind of direction detection method of dispersivity moving object based on video image | |
CN107154044B (en) | Chinese food image segmentation method | |
CN109685045A (en) | A kind of Moving Targets Based on Video Streams tracking and system | |
CN111553214B (en) | Method and system for detecting smoking behavior of driver | |
CN105405138A (en) | Water surface target tracking method based on saliency detection | |
CN106778637B (en) | Statistical method for man and woman passenger flow | |
Surkutlawar et al. | Shadow suppression using RGB and HSV color space in moving object detection | |
CN107103301B (en) | Method and system for matching discriminant color regions with maximum video target space-time stability | |
Aradhya | Elegant and efficient algorithms for real time object detection, counting and classification for video surveillance applications from single fixed camera | |
Pratomo et al. | Parking detection system using background subtraction and HSV color segmentation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |