CN111507272A - Method and system for identifying pedestrian attributes in monitoring scene - Google Patents
Method and system for identifying pedestrian attributes in monitoring scene Download PDFInfo
- Publication number
- CN111507272A CN111507272A CN202010310527.XA CN202010310527A CN111507272A CN 111507272 A CN111507272 A CN 111507272A CN 202010310527 A CN202010310527 A CN 202010310527A CN 111507272 A CN111507272 A CN 111507272A
- Authority
- CN
- China
- Prior art keywords
- image
- attribute
- pedestrian
- detected
- determining
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 33
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000013528 artificial neural network Methods 0.000 claims abstract description 27
- 238000007781 pre-processing Methods 0.000 claims abstract description 21
- 238000010606 normalization Methods 0.000 claims description 19
- 229910052582 BN Inorganic materials 0.000 claims description 12
- PZNSFCLAULLKQX-UHFFFAOYSA-N Boron nitride Chemical compound N#B PZNSFCLAULLKQX-UHFFFAOYSA-N 0.000 claims description 12
- 238000012545 processing Methods 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 7
- 230000006870 function Effects 0.000 claims description 6
- 230000004913 activation Effects 0.000 claims description 3
- 229910052732 germanium Inorganic materials 0.000 claims description 3
- GNPVGFCGXDBREM-UHFFFAOYSA-N germanium atom Chemical compound [Ge] GNPVGFCGXDBREM-UHFFFAOYSA-N 0.000 claims description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000005537 brownian motion Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/103—Static body considered as a whole, e.g. static pedestrian or occupant recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
The invention relates to a pedestrian attribute identification method and a system in a monitoring scene, wherein the attribute identification method comprises the following steps: acquiring an image of a pedestrian to be detected in a monitoring scene; preprocessing the pedestrian image to be detected to obtain a processed image; obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network; determining the weight parameters of each attribute classifier according to the characteristics of the full-connection layer and the convolution image; determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters; determining a predicted value of the corresponding attribute based on each network attribute value; and determining the attribute type of the pedestrian image to be detected according to the predicted values. Extracting the convolution image characteristics of the pedestrian image to be detected through a deep neural network, and determining the weight parameters of each attribute classifier; and obtaining network attribute values under different attribute classifiers, and further obtaining a predicted value of the corresponding attribute so as to accurately determine the attribute type of the pedestrian image to be detected.
Description
Technical Field
The invention relates to the technical field of visual scene processing and analysis, in particular to a pedestrian attribute identification method and system in a monitoring scene.
Background
In recent years, the fields of computer vision, artificial intelligence, machine perception and the like are rapidly developed. With the wide deployment of the placement of the cameras, how to perform efficient pedestrian attribute identification in a monitoring scene is widely concerned.
The pedestrian attribute identification in the monitoring scene is to utilize a computer algorithm to process and analyze the pedestrian pictures in the video, and automatically obtain the attribute categories contained by a certain pedestrian, such as age, gender, backpack, clothing and the like. Thereby providing support and assistance for the pedestrian picture retrieval and pedestrian re-identification technology at the downstream.
In the traditional method, the feature expression of the pedestrian picture is obtained by constructing the manually designed picture features, and the performance of the traditional method is not enough to meet the application requirements in the actual scene. With the wide use of deep learning in recent years, many pedestrian attribute algorithms start from two aspects of better feature expression and attribute relationship modeling, the pedestrian attribute identification method in the monitoring scene is continuously improved, and the development of the pedestrian attribute identification field is promoted.
Although a great deal of work is carried out before, the performance of pedestrian attribute identification is obviously improved by learning the relationship between visual feature expressions with higher discriminability and better modeling attributes, the parameter quantity of the model and the complexity of calculation are inevitably increased by each method, and the difficulty of pedestrian attribute identification is increased.
Disclosure of Invention
In order to solve the above problems in the prior art, that is, to improve the identification of the pedestrian attribute, the present invention aims to provide a method and a system for identifying the pedestrian attribute in a monitoring scene.
In order to solve the technical problems, the invention provides the following scheme:
a method for identifying attributes of pedestrians in a monitoring scene comprises the following steps:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
Optionally, the preprocessing the image of the pedestrian to be detected to obtain a processed image specifically includes:
zooming the to-be-detected pedestrian image to obtain a zoomed image;
randomly and horizontally overturning the zoomed image to obtain an overturned image;
and filling zero in the overturning image to obtain a preprocessed image.
Optionally, obtaining a convolution image feature X of the pedestrian image to be detected according to the following formulaimg:
Ximg=fcnn(Iimg;θcnn);
wherein , a real number space in which the pedestrian features are located, CfeatNumber of layers being convolution features, i.e. dimension of pedestrian feature space, fcnnAs a deep neural network, IimgTo process images, Iimg∈RH×W×CH is the height of the convolution signature,w is the width of the convolution feature map, C is the number of processed image layers input by the deep neural network, and thetacnnIs a learnable parameter of a deep neural network.
Optionally, the determining the weight parameter of each attribute classifier according to the full connection layer and the convolution image feature specifically includes:
establishing an attribute classifier Cls (X) for identifying the attribute of the pedestrian image according to the full connection layer and the characteristics of the convolution imageimg;θcls); wherein ,XimgFor convolution of image features, θclsIs a weight parameter of the attribute classifier, for the real space in which the weight parameters of the attribute classifier are located, CfeatNumber of layers, N, characteristic of convolutionattrMarking the number of attributes pre-stored in a database;
determining a weight parameter of each attribute classifier based on the attribute classifier identified by the pedestrian image attribute wherein ,i is the serial number of the current attribute classifier, the number of the attribute classifiers is consistent with the attribute class, i is 1, 2, …, Nattr。
Optionally, the determining, based on the convolution image features and the weight parameters, a network attribute value of the pedestrian image to be detected under different attribute classifiers includes:
respectively carrying out normalization processing on the convolution image characteristics and the weight parameters to obtain corresponding normalization characteristics and normalization weight parameters;
and determining the network attribute values of the pedestrian images to be detected under different attribute classifiers according to the normalization features and the normalization weight parameters.
Optionally, determining the network attribute values of the pedestrian image to be detected under different attribute classifiers according to the following formula:
wherein ,NattrI is the serial number of the current attribute classifier for the number of labeled attributes pre-stored in the database,the network attribute value of the pedestrian image to be detected under the ith attribute classifier is α which is a scaling factor,in order to normalize the characteristics of the features,normalized weight parameters for the i attribute classifiers.
Optionally, the predicted value of the corresponding attribute is determined according to the following formula:
wherein ,NattrI is the serial number of the current attribute classifier for the number of labeled attributes pre-stored in the database,the predicted value of the ith attribute of the pedestrian image to be detected,and BN (boron nitride) is a batch normalization layer processing function, and Sigmoid (germanium) is a neural network activation function, wherein BN (boron nitride) is a network attribute value of the pedestrian image to be detected under the ith attribute classifier.
In order to solve the technical problems, the invention also provides the following scheme:
a system for identifying attributes of pedestrians in a monitored scene, the system comprising:
the acquiring unit is used for acquiring an image of a pedestrian to be detected in a monitoring scene;
the preprocessing unit is used for preprocessing the pedestrian image to be detected to obtain a processed image;
the characteristic determining unit is used for obtaining the convolution image characteristic of the pedestrian image to be detected according to the processed image through the deep neural network;
the parameter determining unit is used for determining the weight parameters of the attribute classifiers according to the full-connection layer and the convolution image characteristics;
the calculation unit is used for determining the network attribute values of the pedestrian images to be detected under different attribute classifiers based on the convolution image characteristics and the weight parameters;
the prediction unit is used for determining a prediction value of the corresponding attribute based on the network attribute value of each pedestrian image to be detected;
and the attribute determining unit is used for determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
In order to solve the technical problems, the invention also provides the following scheme:
a system for identifying pedestrian attributes in a monitored scene, comprising:
a processor; and
a memory arranged to store computer executable instructions that, when executed, cause the processor to:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
In order to solve the technical problems, the invention also provides the following scheme:
a computer-readable storage medium storing one or more programs that, when executed by an electronic device including a plurality of application programs, cause the electronic device to:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
According to the embodiment of the invention, the invention discloses the following technical effects:
extracting the convolution image characteristics of the pedestrian image to be detected from the preprocessed processed image through a deep neural network, and further obtaining the weight parameters of each attribute classifier; and determining the network attribute values of the pedestrian images to be detected under different attribute classifiers based on the convolution image characteristics and the weight parameters, and further obtaining the predicted values of the corresponding attributes, so that the attribute types of the pedestrian images to be detected can be accurately determined.
Drawings
FIG. 1 is a flow chart of a method for pedestrian attribute identification in a surveillance scene according to the present invention;
FIG. 2 is a schematic block diagram of a pedestrian attribute identification system in a monitoring scene according to the present invention.
Description of the symbols:
the system comprises an acquisition unit-1, a preprocessing unit-2, a feature determination unit-3, a parameter determination unit-4, a calculation unit-5, a prediction unit-6 and an attribute determination unit-7.
Detailed Description
Preferred embodiments of the present invention are described below with reference to the accompanying drawings. It should be understood by those skilled in the art that these embodiments are only for explaining the technical principle of the present invention, and are not intended to limit the scope of the present invention.
The invention aims to provide a pedestrian attribute identification method in a monitoring scene, which extracts the convolution image characteristics of a pedestrian image to be detected from a preprocessed processed image through a deep neural network, and further obtains the weight parameters of each attribute classifier; and determining the network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image characteristics and the weight parameters, and further obtaining the predicted values of the corresponding attributes, so that the attribute type of the pedestrian image to be detected can be accurately determined.
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
As shown in fig. 1, the method for identifying the pedestrian attributes in the monitoring scene of the present invention includes:
step 100: acquiring an image of a pedestrian to be detected in a monitoring scene;
step 200: preprocessing the pedestrian image to be detected to obtain a processed image;
step 300: obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
step 400: determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
step 500: determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
step 600: determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
step 700: and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
Further, in step 200, the preprocessing the image of the pedestrian to be detected to obtain a processed image specifically includes:
step 201: and carrying out zooming processing on the pedestrian image to be detected to obtain a zoomed image. For example, by scaling, so that the image I of the pedestrian to be testedpedesIs 0.75, but not limited thereto.
Step 202: and carrying out random horizontal turning on the zoomed image to obtain a turned image.
And filling zero in the overturning image to obtain a preprocessed image.
Optionally, in step 300, obtaining a convolution image feature X of the pedestrian image to be detected according to the following formulaimg:
Ximg=fcnn(Iimg;θcnn);
wherein , a real number space in which the pedestrian features are located, CfeatNumber of layers, f, characteristic of convolutioncnnAs a deep neural network, IimgTo process images, Iimg∈RH×W×CH is the height of the convolution feature map, W is the width of the convolution feature map, and C is the processed image input by the deep neural networkNumber of layers, thetacnnIs a learnable parameter of a deep neural network.
Preferably, in step 400, the determining a weight parameter of each attribute classifier according to the full-connected layer and the feature of the convolution image specifically includes:
step 401: establishing an attribute classifier Cls (X) for identifying the attribute of the pedestrian image according to the full connection layer and the characteristics of the convolution imageimg;θcls)。
wherein ,XimgFor convolution of image features, θclsIs a weight parameter of the attribute classifier, for the real space in which the weight parameters of the attribute classifier are located, CfeatNumber of layers, N, characteristic of convolutionattrThe number of attributes is noted for pre-storage in the database.
Step 402: determining a weight parameter of each attribute classifier based on the attribute classifier identified by the pedestrian image attribute
wherein ,i is the serial number of the current attribute classifier, the number of the attribute classifiers is consistent with the attribute classes, and i is 1, 2attr。
Further, in step 500, the determining, based on the convolution image features and the weight parameters, the network attribute values of the pedestrian image to be detected under different attribute classifiers specifically includes:
step 501: and respectively carrying out normalization processing on the convolution image characteristics and the weight parameters to obtain corresponding normalization characteristics and normalization weight parameters.
wherein ,in order to normalize the characteristics of the features,for the normalized weight parameters of the i attribute classifiers, | | |. | | represents the absolute value calculation.
After normalization, the normalized features and the modulus length of each normalized weight parameter are 1.
Step 502: and determining the network attribute values of the pedestrian images to be detected under different attribute classifiers according to the normalization features and the normalization weight parameters.
Determining the network attribute values of the pedestrian image to be detected under different attribute classifiers according to the following formula:
wherein ,NattrI is the serial number of the current attribute classifier for the number of labeled attributes pre-stored in the database,the network attribute value of the pedestrian image to be detected under the ith attribute classifier is α which is a scaling factor,in order to normalize the characteristics of the features,normalized weight parameters for the i attribute classifiers.
The loss distribution of the positive and negative samples of the pedestrian attribute can be balanced by introducing and introducing the scaling factor, and in the embodiment, α is 20, but the value is not limited to the value and can be adjusted according to actual needs.
Optionally, the predicted value of the corresponding attribute is determined according to the following formula:
wherein ,NattrI is the serial number of the current attribute classifier for the number of labeled attributes pre-stored in the database,the predicted value of the ith attribute of the pedestrian image to be detected,and BN (boron nitride) is a batch normalization layer processing function, and Sigmoid (germanium) is a neural network activation function, wherein BN (boron nitride) is a network attribute value of the pedestrian image to be detected under the ith attribute classifier.
The predicted value of a certain attribute obtained by the invention can effectively improve the performance of the attribute, and the performance conditions of all the attributes are obtained by averaging the predicted values of all the attributes.
Specifically, the prediction threshold is an average value of the prediction values of all the attributes, the prediction values of all the attributes are compared with the prediction threshold, and the attribute corresponding to the prediction value larger than the prediction threshold is selected as the attribute type of the pedestrian image to be detected. Wherein the attribute types are age, gender, backpack, clothing, etc.
The pedestrian attribute identification in the monitoring scene is realized by a simple and efficient method, so that the pedestrian attribute identification is more suitable for being deployed in hardware facilities in the monitoring scene; 2) the method includes the steps that the pedestrian attribute picture features and the weights of different attribute classifiers are normalized and then calculated to solve the problem that the weights of the pedestrian attribute classifiers depend on prior distribution of pedestrian attributes in a scene; 3) according to the invention, the problem of unbalanced distribution of the positive and negative samples is further solved by introducing the scaling factor, so that the network model is easier to optimize, and the performance of the network model is further improved.
In particular, the present invention has several distinct advantages over the prior art:
1) the calculation amount and the model parameter amount of all the algorithms are obviously higher than those of the invention, and the invention realizes the pedestrian attribute identification performance equivalent to that of the current best method under the condition of using only 63.18 percent of parameter amount and 46.18 percent of calculation amount.
2) The method solves the problem of unbalance of positive and negative samples in pedestrian attribute identification, and realizes that the prediction performance of the pedestrian attributes does not depend on the distribution prior of the pedestrian attributes in a scene by normalizing the pedestrian picture characteristics and the weight of each attribute classifier.
3) The more efficient algorithm and the more lightweight model enable the algorithm to be better applied to monitoring hardware facilities in a scene than other algorithms.
In addition, the invention also provides a pedestrian attribute identification system in the monitoring scene, which can improve the identification of the pedestrian attribute.
As shown in fig. 2, the system for identifying the attributes of pedestrians in the monitoring scene of the present invention includes an obtaining unit 1, a preprocessing unit 2, a feature determining unit 3, a parameter determining unit 4, a calculating unit 5, a predicting unit 6, and an attribute determining unit 7.
The acquiring unit 1 is used for acquiring an image of a pedestrian to be detected in a monitoring scene; the preprocessing unit 2 is used for preprocessing the pedestrian image to be detected to obtain a processed image; the characteristic determining unit 3 is used for obtaining the convolution image characteristic of the pedestrian image to be detected through the deep neural network according to the processed image; the parameter determining unit 4 is configured to determine a weight parameter of each attribute classifier according to the full connection layer and the convolution image feature; the calculating unit 5 is configured to determine network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters; the prediction unit 6 is configured to determine a prediction value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image; the attribute determining unit 7 is configured to determine the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
The invention further provides the following scheme:
a system for identifying pedestrian attributes in a monitored scene, comprising:
a processor; and
a memory arranged to store computer executable instructions that, when executed, cause the processor to:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
The invention further provides the following scheme:
a computer-readable storage medium storing one or more programs that, when executed by an electronic device including a plurality of application programs, cause the electronic device to:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
Compared with the prior art, the beneficial effects of the computer-readable storage medium, the pedestrian attribute identification system in the monitoring scene and the pedestrian attribute identification method in the monitoring scene are the same, and are not repeated herein.
Compared with the prior art, the image retrieval system and the computer readable storage medium have the same beneficial effects as the image retrieval method, and are not repeated herein.
So far, the technical solutions of the present invention have been described in connection with the preferred embodiments shown in the drawings, but it is easily understood by those skilled in the art that the scope of the present invention is obviously not limited to these specific embodiments. Equivalent changes or substitutions of related technical features can be made by those skilled in the art without departing from the principle of the invention, and the technical scheme after the changes or substitutions can fall into the protection scope of the invention.
Claims (10)
1. A method for identifying attributes of pedestrians in a monitoring scene is characterized by comprising the following steps:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
2. The method for identifying the attributes of the pedestrians in the monitored scene according to claim 1, wherein the preprocessing the image of the pedestrian to be detected to obtain a processed image specifically comprises:
zooming the to-be-detected pedestrian image to obtain a zoomed image;
randomly and horizontally overturning the zoomed image to obtain an overturned image;
and filling zero in the overturning image to obtain a preprocessed image.
3. The method for identifying the attributes of pedestrians in the monitored scene according to claim 1, wherein the convolution image feature X of the image of the pedestrian to be detected is obtained according to the following formulaimg:
Ximg=fcnn(Iimg;θcnn);
wherein , a real number space in which the pedestrian features are located, CfeatNumber of layers, f, characteristic of convolutioncnnAs a deep neural network, IimgTo process images, Iimg∈RH×W×CH is the height of the convolution feature map, W is the width of the convolution feature map, C is the number of processed image layers input by the deep neural network, and thetacnnIs a learnable parameter of a deep neural network.
4. The method for identifying the attributes of the pedestrian in the monitored scene according to claim 1, wherein the determining the weight parameters of each attribute classifier according to the full-connected layer and the convolution image features specifically comprises:
establishing an attribute classifier Cls (X) for identifying the attribute of the pedestrian image according to the full connection layer and the characteristics of the convolution imageimg;θcls); wherein ,XimgFor convolution of image features, θclsIs a weight parameter of the attribute classifier,for the real space in which the weight parameters of the attribute classifier are located, CfeatNumber of layers, N, characteristic of convolutionattrMarking the number of attributes pre-stored in a database;
determining a weight parameter of each attribute classifier based on the attribute classifier identified by the pedestrian image attribute wherein ,i is the serial number of the current attribute classifier, the number of the attribute classifiers is consistent with the attribute classes, and i is 1, 2attr。
5. The method for identifying the attributes of the pedestrians in the monitored scene according to claim 1, wherein the determining the network attribute values of the images of the pedestrians to be detected under different attribute classifiers based on the features of the convolution images and the weight parameters specifically comprises:
respectively carrying out normalization processing on the convolution image characteristics and the weight parameters to obtain corresponding normalization characteristics and normalization weight parameters;
and determining the network attribute values of the pedestrian images to be detected under different attribute classifiers according to the normalization features and the normalization weight parameters.
6. The method for identifying the attributes of the pedestrians in the monitored scene according to the claim 5 is characterized in that the network attribute values of the images of the pedestrians to be detected under different attribute classifiers are determined according to the following formula:
wherein ,NattrI is the serial number of the current attribute classifier for the number of labeled attributes pre-stored in the database,the network attribute value of the pedestrian image to be detected under the ith attribute classifier is α which is a scaling factor,in order to normalize the characteristics of the features,normalized weight parameters for the i attribute classifiers.
7. The method of claim 1, wherein the predicted value of the corresponding attribute is determined according to the following formula:
wherein ,NattrI is the serial number of the current attribute classifier for the number of labeled attributes pre-stored in the database,the predicted value of the ith attribute of the pedestrian image to be detected,and BN (boron nitride) is a batch normalization layer processing function, and Sigmoid (germanium) is a neural network activation function, wherein BN (boron nitride) is a network attribute value of the pedestrian image to be detected under the ith attribute classifier.
8. A system for identifying attributes of pedestrians in a monitored scene, the system comprising:
the acquiring unit is used for acquiring an image of a pedestrian to be detected in a monitoring scene;
the preprocessing unit is used for preprocessing the pedestrian image to be detected to obtain a processed image;
the characteristic determining unit is used for obtaining the convolution image characteristic of the pedestrian image to be detected according to the processed image through the deep neural network;
the parameter determining unit is used for determining the weight parameters of the attribute classifiers according to the full-connection layer and the convolution image characteristics;
the calculation unit is used for determining the network attribute values of the pedestrian images to be detected under different attribute classifiers based on the convolution image characteristics and the weight parameters;
the prediction unit is used for determining a prediction value of the corresponding attribute based on the network attribute value of each pedestrian image to be detected;
and the attribute determining unit is used for determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
9. A system for identifying pedestrian attributes in a monitored scene, comprising:
a processor; and
a memory arranged to store computer executable instructions that, when executed, cause the processor to:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
10. A computer-readable storage medium storing one or more programs that, when executed by an electronic device including a plurality of application programs, cause the electronic device to:
acquiring an image of a pedestrian to be detected in a monitoring scene;
preprocessing the pedestrian image to be detected to obtain a processed image;
obtaining the convolution image characteristics of the pedestrian image to be detected through a deep neural network according to the processed image;
determining the weight parameters of each attribute classifier according to the full connection layer and the characteristics of the convolution image;
determining network attribute values of the pedestrian image to be detected under different attribute classifiers based on the convolution image features and the weight parameters;
determining a predicted value of a corresponding attribute based on the network attribute value of each to-be-detected pedestrian image;
and determining the attribute type of the pedestrian image to be detected according to the predicted value of each attribute.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010310527.XA CN111507272B (en) | 2020-04-20 | 2020-04-20 | Pedestrian attribute identification method and system in monitoring scene |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010310527.XA CN111507272B (en) | 2020-04-20 | 2020-04-20 | Pedestrian attribute identification method and system in monitoring scene |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111507272A true CN111507272A (en) | 2020-08-07 |
CN111507272B CN111507272B (en) | 2023-09-26 |
Family
ID=71874486
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010310527.XA Active CN111507272B (en) | 2020-04-20 | 2020-04-20 | Pedestrian attribute identification method and system in monitoring scene |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111507272B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106529442A (en) * | 2016-10-26 | 2017-03-22 | 清华大学 | Pedestrian identification method and apparatus |
KR101941994B1 (en) * | 2018-08-24 | 2019-01-24 | 전북대학교산학협력단 | System for pedestrian detection and attribute extraction based on a joint deep network |
-
2020
- 2020-04-20 CN CN202010310527.XA patent/CN111507272B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106529442A (en) * | 2016-10-26 | 2017-03-22 | 清华大学 | Pedestrian identification method and apparatus |
KR101941994B1 (en) * | 2018-08-24 | 2019-01-24 | 전북대학교산학협력단 | System for pedestrian detection and attribute extraction based on a joint deep network |
Non-Patent Citations (2)
Title |
---|
许方洁;成科扬;张建明;: "基于深度学习与属性学习相结合的行人再识别" * |
陈超;: "混合Boost算法实现的行人检测技术" * |
Also Published As
Publication number | Publication date |
---|---|
CN111507272B (en) | 2023-09-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107123131B (en) | Moving target detection method based on deep learning | |
CN111784633B (en) | Insulator defect automatic detection algorithm for electric power inspection video | |
US20180018503A1 (en) | Method, terminal, and storage medium for tracking facial critical area | |
CN111783576B (en) | Pedestrian re-identification method based on improved YOLOv3 network and feature fusion | |
CN107657249A (en) | Method, apparatus, storage medium and the processor that Analysis On Multi-scale Features pedestrian identifies again | |
CN107220603A (en) | Vehicle checking method and device based on deep learning | |
Ahamed et al. | HOG-CNN based real time face recognition | |
CN108241821A (en) | Image processing equipment and method | |
CN110555420A (en) | fusion model network and method based on pedestrian regional feature extraction and re-identification | |
CN115294117A (en) | Defect detection method and related device for LED lamp beads | |
CN109961425A (en) | A kind of water quality recognition methods of Dynamic Water | |
CN114782892A (en) | Illegal behavior target detection method, device, equipment and storage medium | |
CN107633527B (en) | Target tracking method and device based on full convolution neural network | |
CN111160100A (en) | Lightweight depth model aerial photography vehicle detection method based on sample generation | |
CN117036342B (en) | Chip defect identification method and system | |
CN114078127A (en) | Object defect detection and counting method, device, equipment and storage medium | |
CN115880765A (en) | Method and device for detecting abnormal behavior of regional intrusion and computer equipment | |
CN113870254A (en) | Target object detection method and device, electronic equipment and storage medium | |
CN113192017A (en) | Package defect identification method, device, equipment and storage medium | |
CN112329550A (en) | Weak supervision learning-based disaster-stricken building rapid positioning evaluation method and device | |
CN111507272A (en) | Method and system for identifying pedestrian attributes in monitoring scene | |
CN110795995A (en) | Data processing method, device and computer readable storage medium | |
CN114445788A (en) | Vehicle parking detection method and device, terminal equipment and readable storage medium | |
CN111640071A (en) | Method for obtaining panoramic foreground target based on convolutional neural network frame difference repairing method | |
CN115050105B (en) | Method, device and equipment for judging doubtful shadow and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |