CN106056078B - Crowd density estimation method based on multi-feature regression type ensemble learning - Google Patents

Crowd density estimation method based on multi-feature regression type ensemble learning Download PDF

Info

Publication number
CN106056078B
CN106056078B CN201610374700.6A CN201610374700A CN106056078B CN 106056078 B CN106056078 B CN 106056078B CN 201610374700 A CN201610374700 A CN 201610374700A CN 106056078 B CN106056078 B CN 106056078B
Authority
CN
China
Prior art keywords
image
scene
density estimation
value
head
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610374700.6A
Other languages
Chinese (zh)
Other versions
CN106056078A (en
Inventor
郑宏
张洞明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Research Institute of Wuhan University
Original Assignee
Shenzhen Research Institute of Wuhan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Research Institute of Wuhan University filed Critical Shenzhen Research Institute of Wuhan University
Priority to CN201610374700.6A priority Critical patent/CN106056078B/en
Publication of CN106056078A publication Critical patent/CN106056078A/en
Application granted granted Critical
Publication of CN106056078B publication Critical patent/CN106056078B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects
    • G06V20/53Recognition of crowd images, e.g. recognition of crowd congestion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines

Abstract

The invention relates to a crowd density estimation method based on multi-feature regression type ensemble learning, which is characterized in that the width of the head of a person is taken as a reference to carry out multi-level image blocking on a scene frame image, and the blocks are subjected to image scaling and Gamma correction to realize the consistency of the image scale and illumination; the method comprises the steps of constructing a density estimation model by utilizing preprocessed samples, extracting three features of D-SIFT, GLCM and GIST to construct a first-layer support vector machine regression (SVR) coarse prediction model, constructing a second-layer SVR fine prediction model by taking the coarse prediction results as new features, and adding the fine prediction results of all sub-images to perform density estimation according to the people number grades set by a scene. The method solves the problems of scene illumination change, camera height angle change and pedestrian shielding, and is suitable for a plurality of different scenes to realize crowd density estimation by utilizing a plurality of scene samples to adopt a plurality of characteristics and applying a regression mode to integrate and learn a construction model.

Description

Crowd density estimation method based on multi-feature regression type ensemble learning
Technical Field
The invention relates to the technical field of digital image processing and pattern recognition, in particular to a crowd density estimation method based on multi-feature regression type ensemble learning.
Background
Along with the improvement of living standard of people, the urbanization progress is accelerated continuously, and the collective activities of large-scale public places are frequent day by day, so that accidents caused by dense crowds are frequent in recent years. Therefore, how to use computer vision to carry out real-time intelligent monitoring on the crowd, make crowd density estimation in time and take effective measures has important significance for guaranteeing social stability and crowd safety.
Current methods for estimating population density can be divided into two major categories:
1) the direct method comprises the following steps: direct methods use some classifiers to attempt to segment or detect each individual in a population and then count to obtain population density. These methods can be further divided into two subclasses: model-based methods: detection or segmentation is performed by a model or a shape contour of a person. For example, a method for extracting human head contour features based on Haar wavelet transform and detecting pedestrians by combining a support vector Machine (Lin S F, Chen J Y, Chao H X. estimation of number of people in their crowned scenes using a productive transformation [ J ]. Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on, 2001,31(6):645 (654)), Feizzwald et al propose a method for detecting pedestrians based on components and improved gradient Histogram (HOG) features (Feizzwanwa P F, Girshick R B, occupant D, et al, object estimation of motion with detection of pedestrian position [ 2010, noise ] using a forest detection framework [ 11, 1625 ] and a method for detecting pedestrians using a forest Analysis [ 7, J7 ] and a forest detection framework (IEEE J7, 11), lempitsky V.class-specific hough recommendations for object detection [ M ]. precision requirements for computer vision and medical Image analysis. Springer London,2013: 143-; a track clustering-based method: each individual is detected by clustering points of interest on the pedestrian through long-term tracking. For example, Rabaud and Belongie propose a method for clustering trajectories and inferring scene population using a Kanade-Lucas-Tomasi (KLT) tracker and a series of underlying features (Rabaud V, Belongie S. counting drained moving objects [ C ]. Computer Vision and Pattern registration, 2006IEEE Computer Society reference on. IEEE,2006,1:705 well 711), Rao et al (Rao A S, Gubbi J, Marusic S, et al. The direct method has a good effect under the condition of a small number of people in a scene, but the defect is obvious, the crowd is seriously overlapped under the condition of crowding, and the performance of the direct method is linearly reduced.
2) An indirect method: the indirect method treats the crowd as a whole, and obtains the crowd density by extracting the characteristics of textures and the like for the crowd and combining a regression model. Indirect methods can also be divided into three categories: analysis based on pixels: these methods first remove the scene background and then use some very underlying features to estimate crowd density. Davies et al (Davies A C, Yin J H, vehicle S A. crowd monitoring using image processing [ J ]. Electronics & communication Engineering Journal,1995,7(1):37-47) estimate people by linear relationship by extracting foreground and analyzing crowd foreground and edge pixels and adding perspective correction. Hussasin et al (Hussain N, Yatim H S M, Hussain N L, et al CDES: A pixel-based visibility estimation system for massive-Haram [ J ]. Safety Science,2011,49(6):824-833) extract underlying features on foreground pixels corrected for perspective distortion by scaling and then use a neural network to supervise training, the trained model estimates sparse population accurately, but as the density increases, population occlusion occurs, the wrong estimation rises linearly; texture and gradient based method: texture and gradient features may better represent the number of people in a scene than a pixel-based approach. Texture and Gradient features used in population density estimation include Gray-level co-occlusion matrix (GLCM), ULBP feature (Uniform local Pattern), HOG feature, and Gradient orientation co-occlusion matrix (GOCM), among others; thirdly, the method based on the characteristic points comprises the following steps: feature points are feature pixels of interest, e.g. corner points detected in the image. For example, Conte et al (Conte D, Foggia P, Percannella G, et al. counting moving magnets in raised scenes [ J ]. Motor vision and Applications,2013,24(5): 1029-. Kishor et al (Kishore P V V, Rahul R, Sravya K, et al. crown definition Analysis and tracking [ C ]. Advances in Computing, Communications and information (ICACCI),2015International reference on IEEE,2015: 1209-. Indirect methods usually need to extract foreground or motion information to reduce background interference, and in practical applications, because of illumination changes, continuous crowding of pedestrians, various background factors and the like, extraction of foreground and motion information becomes a difficult task, so that accurate estimation of the methods is difficult to make in practical applications
Disclosure of Invention
The invention aims to provide a crowd density estimation method based on multi-feature regression type ensemble learning.
In order to achieve the purpose, the invention adopts the following technical scheme: a crowd density estimation method based on multi-feature regression ensemble learning comprises the following steps:
image blocking step: acquiring a video monitoring frame image of a scene, performing multi-level image blocking on the scene by taking the width of a human head as a reference, performing scaling processing on the multi-level block image to unify the size, and performing Gamma correction preprocessing to obtain a sub-image sample;
a crowd density estimation step: carrying out coarse prediction on three characteristics of D-SIFT, GLCM and GIST of the sub-image sample by adopting a first layer of support vector regression model; and performing fine prediction by using the coarse prediction result as a new feature and using a second-layer support vector regression model, adding the fine prediction results of all the sub-image samples, and performing density estimation according to the set crowd density grade of the scene.
Preferably, the multi-level image blocking specifically comprises the following steps:
firstly, a scene interest area is defined, then the size of a first layer block image is determined, a reference pedestrian is selected, when the head of the reference pedestrian just enters the bottom boundary of the interest area, the width of the head of the reference pedestrian is measured to be w pixels, then the width of the first layer block image is set to be w pixels 128/42 pixels, then the reference pedestrian continuously moves forwards until the width of the head is w pixels 21/42 w/2 pixels, and the length from the top of the head to the bottom boundary of the interest area is the height of the first layer block image;
determining the size of the second layer block image, selecting a reference pedestrian, and measuring the head width w when the head of the reference pedestrian just crosses the upper edge of the first layer block image1Setting the width of the second layer block as the head width w1128/42 pixels, then moving forward with reference to the pedestrian until the head width is w1*21/42=w1When the pixel is 2, the length from the head to the upper side of the first layer block image is the height of the second layer block image;
and determining the size of the third-layer block image by analogy until the multi-layer block image completes non-overlapping full coverage on the scene interest area.
Preferably, the widths and heights of the multiple-level block images after being subjected to scaling processing and unified in size are all 128 pixels.
Preferably, the step of obtaining the sub-image from the multi-level block image through Gamma correction preprocessing comprises: firstly, dividing a pixel value into three intervals of 0-255, and then converting the pixel value into an angle, wherein the specific expression is as follows:
Figure GDA0003146549090000041
where x is the pixel value, x0And x1Respectively, a set pixel threshold value, E1=[0,x0],E2=[x0,x1],E3=[x1,255],
Figure GDA0003146549090000042
Then it is the converted angle;
the gamma value γ (x) is then determined using trigonometric relationships, defined as follows:
Figure GDA0003146549090000043
the fluctuation of the Gamma value is too large by simply adjusting the Gamma value by the weight value a, so the weight value b is introduced and the linear correction function shown in the formula (3) is adopted for correction
Figure GDA0003146549090000044
The final corrected Gamma value is defined as
Figure GDA0003146549090000045
The pixel corrected value is
Figure GDA0003146549090000046
Preferably, the crowd density estimating step comprises:
respectively extracting D-SIFT, GLCM and GIST characteristics from the sub-image samples;
respectively training a coarse prediction model by utilizing a first-layer support vector regression model for the extracted features, and obtaining different number of people coarse prediction values corresponding to the three features of D-SIFT, GLCM and GIST through the coarse prediction model for the test sample set;
the rough people number prediction value is used as a new feature, a second-layer support vector regression model is used for training a fine prediction model, and the result of the rough people number prediction is obtained through the fine prediction model to obtain more accurate sub-image sample people number prediction, namely a fine prediction value;
adding the fine prediction values of all sub-image samples of one frame of image, counting the number of people in the scene interest area,
and obtaining the crowd density estimation value of the current frame according to the density classification standard of the scene interest area.
Compared with the prior art, the invention has the beneficial effects that: the method solves the problems of scene illumination change, camera height angle change and pedestrian shielding, and is suitable for a plurality of different scenes to realize crowd density estimation by utilizing a plurality of scene samples to adopt a plurality of characteristics and applying a regression mode to integrate and learn a construction model.
The invention is further described below with reference to the accompanying drawings and specific embodiments.
Drawings
FIG. 1 is a schematic flow diagram of the present invention;
FIG. 2 is a schematic diagram of block image sizing;
FIG. 3 is a diagram illustrating a correspondence between a multi-level block image and a scene interest area;
fig. 4 is a flowchart of regression ensemble learning.
Detailed Description
In order to more fully understand the technical contents of the present invention, the technical solutions of the present invention will be further described and illustrated with reference to specific embodiments.
As shown in fig. 1, which is a schematic flow chart of the present invention, a population density estimation method based on multi-feature regression ensemble learning includes the following steps:
image blocking step: acquiring a video monitoring frame image of a scene, performing multi-level image blocking on the scene by taking the width of a human head as a reference, performing scaling processing on the multi-level block image to unify the size, and performing Gamma correction preprocessing to obtain a sub-image sample;
a crowd density estimation step: carrying out coarse prediction on three characteristics of D-SIFT, GLCM and GIST of the sub-image sample by adopting a first layer of support vector regression model; and performing fine prediction by using the coarse prediction result as a new feature and using a second-layer support vector regression model, adding the fine prediction results of all the sub-image samples, and performing density estimation according to the set crowd density grade of the scene.
Further, as shown in fig. 2, a schematic diagram is determined for the block image size; as shown in fig. 3, it is a schematic diagram of the correspondence between the multi-level block image and the scene interest area; in the above technical solution, the specific steps of partitioning the multi-level image are as follows:
firstly, a scene interest area is defined, then the size of a first layer block image is determined, a reference pedestrian is selected, when the head of the reference pedestrian just enters the bottom boundary of the interest area, the width of the head of the reference pedestrian is measured to be w pixels, then the width of the first layer block image is set to be w pixels 128/42 pixels, then the reference pedestrian continuously moves forwards until the width of the head is w pixels 21/42 w/2 pixels, and the length from the top of the head to the bottom boundary of the interest area is the height of the first layer block image;
determining the size of the second layer block image, selecting a reference pedestrian, and measuring the head width w when the head of the reference pedestrian just crosses the upper edge of the first layer block image1Setting the width of the second layer block as the head width w1128/42 pixels, then moving forward with reference to the pedestrian until the head width is w1*21/42=w1When the pixel is 2, the length from the head to the upper side of the first layer block image is the height of the second layer block image;
and determining the size of the third-layer block image by analogy until the multi-layer block image completes non-overlapping full coverage on the scene interest area.
The method of image blocking with human head width as reference is adopted, the image of a frame is divided into blocks with different sizes in a plurality of layers from near to far, the blocks are used as basic elements to construct a model and predict the number of people, and the problem of perspective projection effect can be solved.
After image blocking, a plurality of multi-level block images with different distances, sizes, different times and weather are obtained, and before the characteristics are extracted, the multi-level block images need to be preprocessed to reduce environmental interference and training amount,
the multi-level block images are firstly subjected to scaling processing to be uniform in size, the width and the height after the sizes are uniform are all 128 pixels, so that the block images with different distances can be uniformly trained into samples with the same size by normalizing the sizes, the samples at far and near positions do not need to be separately trained, and the training amount is greatly reduced.
Secondly, in order to reduce the influence caused by ambient light, Gamma correction needs to be carried out on the block image, the multi-level block image is preprocessed by the Gamma correction to obtain a sub-image, and the method specifically comprises the following steps: firstly, dividing a pixel value into three intervals of 0-255, and then converting the pixel value into an angle, wherein the specific expression is as follows:
Figure GDA0003146549090000061
where x is the pixel value, x0And x1Respectively, a set pixel threshold value, E1=[0,x0],E2=[x0,x1],E3=[x1,255],
Figure GDA0003146549090000062
Then it is the converted angle;
the gamma value γ (x) is then determined using trigonometric relationships, defined as follows:
Figure GDA0003146549090000063
the fluctuation of the Gamma value is too large by simply adjusting the Gamma value by the weight value a, so the weight value b is introduced and the linear correction function shown in the formula (3) is adopted for correction
Figure GDA0003146549090000071
The final corrected Gamma value is defined as
Figure GDA0003146549090000072
The pixel corrected value is
Figure GDA0003146549090000073
Further, as shown in fig. 4, which is a flowchart of the regression-based ensemble learning, the crowd density estimating step includes:
respectively extracting D-SIFT, GLCM and GIST characteristics from the sub-image samples, and setting the characteristics as xD-SIFT、xGLCMAnd xGIST
Respectively training a rough prediction model by using a first-layer support vector regression model for the extracted features, and obtaining three models f by regression fitting of the first-layer support vector regression model for a test sample set1(xD-SIFT)、f2(xGLCM) And f3(xGIST) And the model outputs the predicted values y of the three characteristics of D-SIFT, GLCM and GISTD-SIFT、yGLCMAnd yGISTAnd combining the three predicted values into new characteristics corresponding to the rough predicted values of different people numbers:
xALL=[yD-SIFT,yGLCM,yGIST] (11)
training the new feature to a fine prediction model f by using a second-layer support vector regression modelFinal(xALL) The result of the rough people number prediction is subjected to a fine prediction model to obtain more accurate sub-image sample people number prediction yFinalI.e. the fine prediction value; the regression ensemble learning includes two parts: a training (learning) part and a prediction (application) part, as shown in fig. 4, wherein the training part is a training regression model, firstly, the characteristics of a plurality of sub-images are extracted, the number of people of each sub-image is counted as the number label of the person to form a sample set of the training part, then the sample set is divided into a training set and a test set, the coarse regression model corresponding to the three characteristics is trained through the training set, and the test set can obtain corresponding prediction output, namely a coarse prediction value, through the coarse regression model. Three are to be arrangedAnd (4) taking the rough predicted value of the model as a new feature and combining the population labels to form a new sample set, and continuously dividing the new sample set into a new training set and a new testing set. And training the fine regression model through the new training set, and judging whether the model is accurate or not by the new testing set through obtaining the fine prediction value through the fine regression model.
The prediction part predicts the number of people through a trained model. Extracting features from a test sample of unknown people, obtaining a rough predicted value by combining a rough regression model trained by a training part, and inputting three rough predicted values serving as new features into a fine regression model to obtain a fine predicted value, namely the final people number prediction.
Considering that the sensitivities of different features to the population density are inconsistent, the two-layer regression can be adopted to make up for the deficiencies of each other, and the prediction accuracy can be improved.
Adding the fine prediction values of all sub-image samples of one frame of image, counting the number of people in the scene interest area,
and obtaining the crowd density estimation value of the current frame according to the density classification standard of the scene interest area. For example: assuming that the current scene can accommodate the maximum number of people nmaxFor the standard, an average classification is adopted, and the classification is divided into five grades: [0, n ]max/5]、[nmax/5,2nmax/5]、[2nmax/5,3nmax/5]、[3nmax/5,4nmax/5]And [4nmaxAnd/5, ∞), respectively denoted as VL (very low), L (low), M (medium), H (high), and VH (very high), and the population density estimation can be done by comparing the above criteria against the population count of the region of interest of the statistical scene.
The technical contents of the present invention are further illustrated by the examples, so as to facilitate the understanding of the reader, but the embodiments of the present invention are not limited thereto, and any technical extension or re-creation based on the present invention is protected by the present invention.

Claims (4)

1. A crowd density estimation method based on multi-feature regression ensemble learning is characterized by comprising the following steps:
image blocking step: acquiring a video monitoring frame image of a scene, performing multi-level image blocking on the scene by taking the width of a human head as a reference, performing scaling processing on the multi-level block image to unify the size, and performing Gamma correction preprocessing to obtain a sub-image sample;
a crowd density estimation step: carrying out coarse prediction on three characteristics of D-SIFT, GLCM and GIST of the sub-image sample by adopting a first layer of support vector regression model; using the coarse prediction result as a new feature to perform fine prediction by using a second-layer support vector regression model, adding the fine prediction results of all sub-image samples, and performing density estimation according to the set crowd density grade of the scene;
the method for partitioning the multi-level image comprises the following specific steps:
firstly, a scene interest area is defined, then the size of a first layer block image is determined, a reference pedestrian is selected, when the head of the reference pedestrian just enters the bottom boundary of the interest area, the width of the head of the reference pedestrian is measured to be w pixels, then the width of the first layer block image is set to be w pixels 128/42 pixels, then the reference pedestrian continuously moves forwards until the width of the head is w pixels 21/42 w/2 pixels, and the length from the top of the head to the bottom boundary of the interest area is the height of the first layer block image;
determining the size of the second layer block image, selecting a reference pedestrian, and measuring the head width w when the head of the reference pedestrian just crosses the upper edge of the first layer block image1Setting the width of the second layer block as the head width w1128/42 pixels, then moving forward with reference to the pedestrian until the head width is w1*21/42=w1When the pixel is 2, the length from the head to the upper side of the first layer block image is the height of the second layer block image;
and determining the size of the third-layer block image by analogy until the multi-layer block image completes non-overlapping full coverage on the scene interest area.
2. The crowd density estimation method based on multi-feature regression ensemble learning of claim 1, wherein the widths and heights of the multi-level block images after scaling processing and size unification are all 128 pixels.
3. The crowd density estimation method based on the multi-feature regression type ensemble learning of claim 2, wherein the step of obtaining the sub-images from the multi-level block images through Gamma correction preprocessing comprises: firstly, dividing a pixel value into three intervals of 0-255, and then converting the pixel value into an angle, wherein the specific expression is as follows:
Figure FDA0003146549080000021
where x is the pixel value, x0And x1Respectively, a set pixel threshold value, E1=[0,x0],E2=[x0,x1],E3=[x1,255],
Figure FDA0003146549080000022
Then it is the converted angle;
the gamma value γ (x) is then determined using trigonometric relationships, defined as follows:
Figure FDA0003146549080000023
the fluctuation of the Gamma value is too large by simply adjusting the Gamma value by the weight value a, so the weight value b is introduced and the linear correction function shown in the formula (3) is adopted for correction
Figure FDA0003146549080000024
The final corrected Gamma value is defined as
Figure FDA0003146549080000025
The pixel corrected value is
Figure FDA0003146549080000026
4. The crowd density estimation method based on the multi-feature regression-based ensemble learning according to claim 1, 2 or 3, wherein the crowd density estimation step comprises:
respectively extracting D-SIFT, GLCM and GIST characteristics from the sub-image samples;
respectively training a coarse prediction model by utilizing a first-layer support vector regression model for the extracted features, and obtaining different number of people coarse prediction values corresponding to the three features of D-SIFT, GLCM and GIST through the coarse prediction model for the test sample set;
the rough people number prediction value is used as a new feature, a second-layer support vector regression model is used for training a fine prediction model, and the result of the rough people number prediction is obtained through the fine prediction model to obtain more accurate sub-image sample people number prediction, namely a fine prediction value;
and adding the fine predicted values of all sub-image samples of one frame of image, counting the number of people in the scene interest area, and obtaining the crowd density estimation value of the current frame according to the density classification standard of the scene interest area.
CN201610374700.6A 2016-05-31 2016-05-31 Crowd density estimation method based on multi-feature regression type ensemble learning Active CN106056078B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610374700.6A CN106056078B (en) 2016-05-31 2016-05-31 Crowd density estimation method based on multi-feature regression type ensemble learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610374700.6A CN106056078B (en) 2016-05-31 2016-05-31 Crowd density estimation method based on multi-feature regression type ensemble learning

Publications (2)

Publication Number Publication Date
CN106056078A CN106056078A (en) 2016-10-26
CN106056078B true CN106056078B (en) 2021-09-14

Family

ID=57172224

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610374700.6A Active CN106056078B (en) 2016-05-31 2016-05-31 Crowd density estimation method based on multi-feature regression type ensemble learning

Country Status (1)

Country Link
CN (1) CN106056078B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106878952B (en) * 2017-03-20 2020-07-10 迪爱斯信息技术股份有限公司 Regional personnel number prediction method and device
CN107480786B (en) * 2017-08-07 2021-04-30 复旦大学 Output state limitation-based recurrent neural network track likelihood probability calculation method
CN108985256A (en) * 2018-08-01 2018-12-11 曜科智能科技(上海)有限公司 Based on the multiple neural network demographic method of scene Density Distribution, system, medium, terminal
CN110598630A (en) * 2019-09-12 2019-12-20 江苏航天大为科技股份有限公司 Method for detecting passenger crowding degree of urban rail transit based on convolutional neural network
CN112733624B (en) * 2020-12-26 2023-02-03 电子科技大学 People stream density detection method, system storage medium and terminal for indoor dense scene

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103761723A (en) * 2014-01-22 2014-04-30 西安电子科技大学 Image super-resolution reconstruction method based on multi-layer supporting vectors

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103761723A (en) * 2014-01-22 2014-04-30 西安电子科技大学 Image super-resolution reconstruction method based on multi-layer supporting vectors

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
Crowd Density Estimation Using Texture Analysis and Learning;X. Wu 等;《2006 IEEE International Conference on Robotics and Biomimetics》;20070402;第214-219页 *
Modeling the shape of the scene: a holistic representation of the spatial envelope;Oliva A 等;《International Journal of Computer Vision》;20010531;第42卷(第3期);第145-175页 *
Visual categorization with bags of keypoints;Gabriela Csurka 等;《ResearchGate》;20040131;第2-17页 *
基于Gist特征与PHOG特征融合的多场景分类;肖保良;《中北大学学报(自然科学版)》;20141231;第35卷(第6期);第690-694页 *
基于GLCM纹理特征分析的人群密度估计方法实现;侯鹏鹏;《中国安防》;20141215(第23期);第88-90页 *
多种人群密度场景下的人群计数;覃勋辉 等;《中国图象图形学报》;20130430;第18卷(第4期);摘要、第1节 *
大规模群体人数检测算法研究;郭婷;《中国优秀硕士学位论文全文数据库 (信息科技辑)》;20150315(第3期);第5.1节 *

Also Published As

Publication number Publication date
CN106056078A (en) 2016-10-26

Similar Documents

Publication Publication Date Title
US20230289979A1 (en) A method for video moving object detection based on relative statistical characteristics of image pixels
CN106056078B (en) Crowd density estimation method based on multi-feature regression type ensemble learning
CN108492319B (en) Moving target detection method based on deep full convolution neural network
CN107909081B (en) Method for quickly acquiring and quickly calibrating image data set in deep learning
US9639748B2 (en) Method for detecting persons using 1D depths and 2D texture
CN103310444B (en) A kind of method of the monitoring people counting based on overhead camera head
CN106570486A (en) Kernel correlation filtering target tracking method based on feature fusion and Bayesian classification
CN104166841A (en) Rapid detection identification method for specified pedestrian or vehicle in video monitoring network
CN109918971B (en) Method and device for detecting number of people in monitoring video
CN109840483B (en) Landslide crack detection and identification method and device
CN103218816A (en) Crowd density estimation method and pedestrian volume statistical method based on video analysis
CN108280397A (en) Human body image hair detection method based on depth convolutional neural networks
CN105740945A (en) People counting method based on video analysis
CN103310194A (en) Method for detecting head and shoulders of pedestrian in video based on overhead pixel gradient direction
CN109255326B (en) Traffic scene smoke intelligent detection method based on multi-dimensional information feature fusion
CN106204594A (en) A kind of direction detection method of dispersivity moving object based on video image
CN107154044B (en) Chinese food image segmentation method
CN109685045A (en) A kind of Moving Targets Based on Video Streams tracking and system
CN111553214B (en) Method and system for detecting smoking behavior of driver
CN105405138A (en) Water surface target tracking method based on saliency detection
CN106778637B (en) Statistical method for man and woman passenger flow
Surkutlawar et al. Shadow suppression using RGB and HSV color space in moving object detection
CN107103301B (en) Method and system for matching discriminant color regions with maximum video target space-time stability
Aradhya Elegant and efficient algorithms for real time object detection, counting and classification for video surveillance applications from single fixed camera
Pratomo et al. Parking detection system using background subtraction and HSV color segmentation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant