CN106485235B - A kind of convolutional neural networks generation method, age recognition methods and relevant apparatus - Google Patents
A kind of convolutional neural networks generation method, age recognition methods and relevant apparatus Download PDFInfo
- Publication number
- CN106485235B CN106485235B CN201610925676.0A CN201610925676A CN106485235B CN 106485235 B CN106485235 B CN 106485235B CN 201610925676 A CN201610925676 A CN 201610925676A CN 106485235 B CN106485235 B CN 106485235B
- Authority
- CN
- China
- Prior art keywords
- age
- face
- convolutional neural
- neural networks
- classifier
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/178—Human faces, e.g. facial parts, sketches or expressions estimating age from face image; using age information for improving recognition
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Image Processing (AREA)
- Image Analysis (AREA)
Abstract
Generation method, age recognition methods, relevant apparatus and the calculating equipment that the invention discloses a kind of for carrying out the convolutional neural networks of age identification to the face in image, the generation method of the convolutional neural networks includes: to be trained to the first convolutional neural networks, includes multiple convolution groups, multiple full articulamentums and the first classifier being sequentially connected in the first convolutional neural networks;By in trained first convolutional neural networks the full articulamentum in part and the first classifier replaced accordingly, generate the second convolutional neural networks simultaneously it is trained;The full articulamentum and classifier new to the addition of trained second convolutional neural networks, to generate third convolutional neural networks and be trained;The full articulamentum and classifier new to the addition of trained third convolutional neural networks, to generate Volume Four product neural network and be trained.Wherein, it before being trained to above-mentioned each convolutional neural networks, can be pre-processed to for trained human face image information.
Description
Technical field
It is the present invention relates to technical field of image processing, in particular to a kind of for carrying out age identification to the face in image
Convolutional neural networks generation method, age recognition methods, relevant apparatus and calculate equipment.
Background technique
Face has contained bulk information as important one of biological characteristic on facial image, such as age, gender, ethnic group
Deng.It is further deep with studying in image processing techniques facial image, especially in terms of the identification of face age, with volume
Face age recognition methods based on product neural network (CNN:Convolutional Neural Network) is also gradually sent out
Exhibition is got up, and is all played an important role in numerous reality scenes.On " Computer Science " in 2014,
Karen Simonyan and Andrew Zisserman have delivered an entitled " Very Deep Convolutional
The paper of Networks for Large-Scale Image Recognition " proposes a kind of deeper convolution mind of level
Through network model, i.e. VGG (Visual Geometry Group) model, the accuracy for carrying out age identification based on this model has
Certain promotion.
However, gap is larger between the amplitude of variation, especially 0~5 years old baby for the facial image of different age group
Child, variation is very huge, but 25~30 years old or so adult, then varies less, this phenomenon is brought to age identification
Large effect.In existing face age recognition methods, age true value is replaced using age distribution value, is alleviated to a certain extent
The above problem, but the result got is in a certain age range, and not accurate enough, once the critical value in section malfunctions,
Then age estimated value can fall into adjacent interval, bring biggish error.And since the numerical value span at age is larger, even if taking
Above-mentioned VGG model carries out age identification, it is also desirable to and take multiple models to realize according to the distribution of age bracket, it is complex, and
Versatility is lower.
Summary of the invention
For this purpose, the present invention provides a kind of convolutional neural networks generation side for the face progress age identification in image
Case, and the age identifying schemes based on the convolutional neural networks are proposed, it is existing above to try hard to solve or at least alleviate
Problem.
According to an aspect of the present invention, it provides a kind of for carrying out the convolutional Neural of age identification to the face in image
Network generation method, suitable for executing in calculating equipment, this method comprises the following steps: firstly, according to the face obtained in advance
Sets of image data is trained so that the first convolutional neural networks are suitable for identification face, people the first convolutional neural networks
Face image data acquisition system includes multiple human face image informations, and each facial image information includes people in facial image and correspondence image
Age information, the first convolutional neural networks include the multiple convolution groups being sequentially connected, the first full articulamentum, the second full connection
Layer, the full articulamentum of third and the first classifier;According to preset age threshold to each of face image data set face
Image information is handled, to add new age threshold attribute, the instruction of age threshold attribute for each human face image information
The age of corresponding people is greater than preset age threshold and is also no more than preset age threshold;By trained first convolution
The full articulamentum of third and the first classifier in neural network replace with the 4th full articulamentum and the second classifier respectively, to generate
Second convolutional neural networks, and according to being added to the face image data set of age threshold attribute to the second convolutional neural networks
It is trained, the age of people corresponding to the output instruction face so as to the second classifier of the second convolutional neural networks is above year
Age threshold value is also no greater than age threshold;After the first full articulamentum in trained second convolutional neural networks, addition
The 5th full articulamentum, the 6th full articulamentum and the third classifier being sequentially connected, to generate third convolutional neural networks, and select
In face image data set, the age no more than default age threshold human face image information to third convolutional neural networks
It is trained, the age of people corresponding to the output instruction face so as to the third classifier of third convolutional neural networks is zero to institute
Which of default age threshold;After the first full articulamentum in trained third convolutional neural networks, addition according to
Secondary connected the 7th full articulamentum, eight convergent points articulamentum and the 4th classifier, to generate Volume Four product neural network, and according to people
Face image data acquisition system is trained Volume Four product neural network, people corresponding to the output instruction face so as to the 4th classifier
Age.
Optionally, the convolutional neural networks according to the present invention for carrying out age identification to the face in image generate
In method, the facial image of each human face image information keeps horizontal front and meets default ruler in face image data set
It is very little, facial image correspond to people age be 0~100 between integer.
Optionally, the convolutional neural networks according to the present invention for carrying out age identification to the face in image generate
It include at least one convolutional layer in each convolution group of the first convolutional neural networks in method.
Optionally, the convolutional neural networks according to the present invention for carrying out age identification to the face in image generate
In method, after the first full articulamentum in trained second convolutional neural networks, it is complete to add the 5th be sequentially connected
Articulamentum, the 6th full articulamentum and third classifier, the step of to generate third convolutional neural networks before, further comprise the steps of:
To in face image data set, the age no more than preset age threshold human face image information age carry out 0/1 coding,
0/1 coding includes adding the sum of 1 with preset age threshold for number of encoding bits, each is any one of digital 0 and digital 1,
Since first place, the difference that the number that number 1 occurs subtracts 1 is the age.
Optionally, the convolutional neural networks according to the present invention for carrying out age identification to the face in image generate
In method, the 6th full articulamentum includes multiple full articulamentums of son in parallel, and the number of the full articulamentum of son is preset age threshold
Add the sum of 1.
Optionally, the convolutional neural networks according to the present invention for carrying out age identification to the face in image generate
In method, after the first full articulamentum in trained third convolutional neural networks, it is complete to add the 7th be sequentially connected
Articulamentum, eight convergent points articulamentum and the 4th classifier are further comprised the steps of: before generating the step of Volume Four accumulates neural network
To in face image data set human face image information age carry out distributed code, distributed code include according to Gaussian Profile into
Row age coding.
Optionally, the convolutional neural networks according to the present invention for carrying out age identification to the face in image generate
In method, preset age threshold is 12.
According to a further aspect of the invention, a kind of age recognition methods is provided, suitable for being executed in calculating equipment, the party
Method based on for in image face carry out age identification convolutional neural networks generation method in, trained Volume Four product
Neural network carries out age identification to the face in image, comprising steps of facial image to be identified is input to trained the
Age identification is carried out in four convolutional neural networks;The output of the second classifier is in the good Volume Four product neural network of training of judgement
It is no to be greater than preset age threshold;If the output of the second classifier is not more than preset age threshold, trained the is obtained
The output of third classifier is the age of people corresponding to face in four convolutional neural networks;If the output of the second classifier is greater than pre-
If age threshold, then the output for obtaining the 4th classifier in trained Volume Four product neural network is people's corresponding to face
Age.
It optionally, further include being pre-processed to images to be recognized to obtain in age recognition methods according to the present invention
Take facial image to be identified.
Optionally, in age recognition methods according to the present invention, images to be recognized is pre-processed to obtain wait know
Others' face image includes: to carry out Face datection to images to be recognized, obtains face location information;It, will by face location information
Face in images to be recognized is converted after cutting to pre-set dimension;Face, which is calculated, according to face key point information carries out Plane Rotation
Transformation matrix;The facial image under pre-set dimension is rotated into horizontal front to obtain face figure to be identified using transformation matrix
Picture.
According to a further aspect of the invention, it provides a kind of for carrying out the convolution mind of age identification to the face in image
It through network generating means, is calculated in equipment suitable for residing in, which includes the first training module, attribute adding module, first
Generation module, the second training module, the second generation module, third training module, third generation module and the 4th training module.Its
In, the first training module is suitable for being trained the first convolutional neural networks according to the face image data set obtained in advance
So that the first convolutional neural networks are suitable for identification face, face image data set includes multiple human face image informations, each
Human face image information includes the age information of people in facial image and correspondence image, and the first convolutional neural networks include being sequentially connected
Multiple convolution groups, the first full articulamentum, the second full articulamentum, the full articulamentum of third and the first classifier;Attribute adding module
Suitable for being handled according to preset age threshold each of face image data set face image information, to be every
A human face image information adds new age threshold attribute, and it is preset that age threshold attribute indicates that the age of corresponding people is greater than
Age threshold is also no more than preset age threshold;First generation module is suitable for will be in trained first convolutional neural networks
The full articulamentum of third and the first classifier replace with the 4th full articulamentum and the second classifier respectively, with generate the second convolution mind
Through network;Second training module is suitable for basis and is added to the face image data set of age threshold attribute to the second convolutional Neural
Network is trained, and the age of people corresponding to the output instruction face so as to the second classifier of the second convolutional neural networks is high
Age threshold is also no greater than in age threshold;Second generation module is suitable for the in trained second convolutional neural networks
After one full articulamentum, the 5th full articulamentum, the 6th full articulamentum and the third classifier being sequentially connected are added, to generate third
Convolutional neural networks;Third training module is suitably selected in face image data set, the age is no more than default age threshold
The human face image information of value is trained third convolutional neural networks, so as to the third classifier of third convolutional neural networks
The age of people corresponding to output instruction face is which of zero to default age threshold;Third generation module is suitable for instructing
After the first full articulamentum in the third convolutional neural networks perfected, the 7th full articulamentum, the eight convergent points being sequentially connected are added
Articulamentum and the 4th classifier, to generate Volume Four product neural network;4th training module is suitable for according to face image data collection
It closes and Volume Four product neural network is trained, the age of people corresponding to the output instruction face so as to the 4th classifier.
According to a further aspect of the invention, a kind of age identification device is provided, is calculated in equipment suitable for residing in, the dress
Set based on for in image face carry out age identification convolutional neural networks generating means in, trained Volume Four product
Neural network carries out age identification, including identification module, judgment module and acquisition module to the face in image.Wherein, it identifies
Module, which is suitable for for facial image to be identified being input in trained Volume Four product neural network, carries out age identification;Judgment module
Suitable for judging the output of the second classifier in the trained Volume Four product neural network after identification module carries out age identification;It obtains
Modulus block is suitable for obtaining identification module when judgment module judges the output of the second classifier no more than preset age threshold
The output of third classifier is the year of people corresponding to face in trained Volume Four product neural network after carrying out age identification
Age obtains identification module and carries out the age when judgment module judges the output of the second classifier greater than preset age threshold
The output of the 4th classifier is the age of people corresponding to face in trained Volume Four product neural network after identification.
According to a further aspect of the invention, a kind of calculating equipment is also provided, including according to the present invention for image
In face carry out age identification convolutional neural networks generating means and age identification device.
The technical side that convolutional neural networks according to the present invention for carrying out age identification to the face in image generate
Case is first trained the first convolutional neural networks, in the first convolutional neural networks include be sequentially connected multiple convolution groups,
Multiple full articulamentums and the first classifier classify the full articulamentum in part and first in trained first convolutional neural networks
Device is replaced accordingly, is generated the second convolutional neural networks and is simultaneously trained to it, then to trained second convolutional Neural
Network addition new full articulamentum and classifier, to generate third convolutional neural networks and be trained, finally to trained
The addition of third convolutional neural networks new full articulamentum and classifier, to generate Volume Four product neural network and be trained.?
In above-mentioned technical proposal, using convolutional neural networks as basic frame, and before generating each convolutional neural networks in case training,
Carry out such as age threshold segmentation, 0/1 coding and age distribution coding processing to for trained face image data, assist by
The model of step modification convolutional neural networks, ultimately forms the convolutional neural networks model that can accurately identify the age, without dividing
It cuts, succinct intuitive, versatility is stronger.In turn, age recognition methods according to the present invention, facial image to be identified is input to
In trained Volume Four product neural network, the range of age and its specific value are judged according to the output of different classifications device, is tied
Fruit accuracy has huge promotion.
Detailed description of the invention
To the accomplishment of the foregoing and related purposes, certain illustrative sides are described herein in conjunction with following description and drawings
Face, these aspects indicate the various modes that can practice principles disclosed herein, and all aspects and its equivalent aspect
It is intended to fall in the range of theme claimed.Read following detailed description in conjunction with the accompanying drawings, the disclosure it is above-mentioned
And other purposes, feature and advantage will be apparent.Throughout the disclosure, identical appended drawing reference generally refers to identical
Component or element.
Fig. 1 shows the schematic diagram according to an embodiment of the invention for calculating equipment 100;
Fig. 2 shows according to an embodiment of the invention for carrying out the convolution mind of age identification to the face in image
Flow chart through network generation method 200;
Fig. 3 shows the convolution for being used to carry out the face in image age identification of another embodiment according to the present invention
The flow chart of neural network generation method 300;
Fig. 4 shows the structural schematic diagram of the first convolutional neural networks according to an embodiment of the invention;
Fig. 5 shows the structural schematic diagram of the second convolutional neural networks according to an embodiment of the invention;
Fig. 6 shows the structural schematic diagram of third convolutional neural networks according to an embodiment of the invention;
Fig. 7 shows the structural schematic diagram of Volume Four product neural network according to an embodiment of the invention;
Fig. 8 shows the flow chart of age recognition methods 400 according to an embodiment of the invention;
Fig. 9 shows the flow chart of the age recognition methods 500 of another embodiment according to the present invention;
Figure 10 shows according to an embodiment of the invention for carrying out the volume of age identification to the face in image
The schematic diagram of product neural network generating means 600;
Figure 11 show according to still another embodiment of the invention for carrying out age identification to the face in image
The schematic diagram of convolutional neural networks generating means 700;
Figure 12 shows the schematic diagram of age identification device 800 according to an embodiment of the invention;And
Figure 13 shows the schematic diagram of age identification device 900 according to still another embodiment of the invention.
Specific embodiment
Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here
It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure
It is fully disclosed to those skilled in the art.
Fig. 1 is the block diagram of Example Computing Device 100.In basic configuration 102, calculating equipment 100, which typically comprises, is
System memory 106 and one or more processor 104.Memory bus 108 can be used for storing in processor 104 and system
Communication between device 106.
Depending on desired configuration, processor 104 can be any kind of processing, including but not limited to: microprocessor
(μ P), microcontroller (μ C), digital information processor (DSP) or any combination of them.Processor 104 may include such as
The cache of one or more rank of on-chip cache 110 and second level cache 112 etc, processor core
114 and register 116.Exemplary processor core 114 may include arithmetic and logical unit (ALU), floating-point unit (FPU),
Digital signal processing core (DSP core) or any combination of them.Exemplary Memory Controller 118 can be with processor
104 are used together, or in some implementations, and Memory Controller 118 can be an interior section of processor 104.
Depending on desired configuration, system storage 106 can be any type of memory, including but not limited to: easily
The property lost memory (RAM), nonvolatile memory (ROM, flash memory etc.) or any combination of them.System storage
Device 106 may include operating system 120, one or more is using 122 and program data 124.In some embodiments,
It may be arranged to be operated using program data 124 on an operating system using 122.
Calculating equipment 100 can also include facilitating from various interface equipments (for example, output equipment 142, Peripheral Interface
144 and communication equipment 146) to basic configuration 102 via the communication of bus/interface controller 130 interface bus 140.Example
Output equipment 142 include graphics processing unit 148 and audio treatment unit 150.They can be configured as facilitate via
One or more port A/V 152 is communicated with the various external equipments of such as display or loudspeaker etc.Outside example
If interface 144 may include serial interface controller 154 and parallel interface controller 156, they, which can be configured as, facilitates
Via one or more port I/O 158 and such as input equipment (for example, keyboard, mouse, pen, voice-input device, touch
Input equipment) or the external equipment of other peripheral hardwares (such as printer, scanner etc.) etc communicated.Exemplary communication is set
Standby 146 may include network controller 160, can be arranged to convenient for via one or more communication port 164 and one
A or multiple other calculate communication of the equipment 162 by network communication link.
Network communication link can be an example of communication media.Communication media can be usually presented as in such as carrier wave
Or computer readable instructions, data structure, program module in the modulated data signal of other transmission mechanisms etc, and can
To include any information delivery media." modulated data signal " can such signal, one in its data set or more
It is a or it change can the mode of encoded information in the signal carry out.As unrestricted example, communication media can be with
Wired medium including such as cable network or private line network etc, and it is such as sound, radio frequency (RF), microwave, infrared
(IR) the various wireless mediums or including other wireless mediums.Term computer-readable medium used herein may include depositing
Both storage media and communication media.
Calculating equipment 100 can be implemented as server, these servers can be such as file server, database service
The servers such as device, apps server and WEB server.Calculate equipment 100 can be implemented as small size it is portable (or move
It is dynamic) a part of electronic equipment, these electronic equipments can be such as cellular phone, personal digital assistant (PDA), individual media
Player device, wireless network browsing apparatus, personal helmet, application specific equipment or may include any of the above function
The mixing apparatus of energy.It calculates equipment 100 and is also implemented as including that desktop computer and the personal of notebook computer configuration are counted
Calculation machine.In some embodiments, equipment 100 is calculated to be configured as executing according to the present invention be used for the face progress in image
The generation method and age recognition methods of the convolutional neural networks of age identification.Using 122 include it is according to the present invention for pair
Face in image carries out the convolutional neural networks generating means 600 and age identification device 700 of age identification.
Fig. 2 shows according to an embodiment of the invention for carrying out the convolution mind of age identification to the face in image
Flow chart through network generation method 200.For carrying out the convolutional neural networks generation side of age identification to the face in image
Method 200 is suitable for executing in calculating equipment (such as calculating equipment 100 shown in FIG. 1).
As shown in Fig. 2, method 200 starts from step S210.In step S210, according to the face image data obtained in advance
Set is trained so that the first convolutional neural networks are suitable for identification face, facial image number the first convolutional neural networks
It include multiple human face image informations according to set, each facial image information includes the age letter of people in facial image and correspondence image
Breath, the first convolutional neural networks include that the multiple convolution groups being sequentially connected, the first full articulamentum, the second full articulamentum, third are complete
Articulamentum and the first classifier.Wherein, the facial image of each human face image information keeps water in face image data set
Straight and even face and meet pre-set dimension, the age that the facial image corresponds to people is integer between 0~100, the first convolutional Neural
It include at least one convolutional layer in each convolution group of network.
In the present embodiment, face image data collection is combined into the human face data collection of MsCeleb, in the face data set
For human face image information comprising facial image be image after rotation processing, i.e., the facial image is revolved in advance
Switch to horizontal front, and facial image is Three Channel Color image, pre-set dimension is 224px × 224px.First convolution nerve net
Network is that one kind, Fig. 4 show the structure of the first convolutional neural networks according to an embodiment of the invention in the VGG model used
Schematic diagram.As shown in figure 4, in the first convolutional neural networks, including five convolution groups, using the first convolution group as input terminal, after
Face is sequentially connected the second convolution group, third convolution group, Volume Four product group, the 5th convolution group, the first full articulamentum, the second full connection
Layer, the full articulamentum of third and the first classifier, wherein the first classifier is output end, it is the defeated of the first convolution group with facial image
Enter, the age of people is that the output of the first classifier carries out the training of the first convolutional neural networks in correspondence image.In fact, adjacent
Two convolution groups between there is a down-sampling layer, and there is also under one between the 5th convolution group and the first full articulamentum
Each down-sampling layer is successively become the first down-sampling layer, the second down-sampling according to the sequence from input terminal to output end by sample level
Layer, third down-sampling layer, the 4th down-sampling layer and the 5th down-sampling layer.It include two in first convolution group and the second convolution group
Convolutional layer, it includes three convolutional layers in group and the 5th convolution group that third convolution group, Volume Four, which are accumulated,.With face image data set
In human face image information A for be illustrated, human face image information A includes the age of people in facial image A1 and correspondence image
Information A2, A2 are 7 years old.
In the first convolutional neural networks, A1 is input to the first convolution group first.A1 is Three Channel Color image, size
For 224px × 224px.Two convolutional layers in first convolution group have 64 convolution kernels, each convolution in first convolutional layer
The number of parameters of core is 3 × 3 × 3, and the convolution kernel for being equivalent to 33 × 3 sizes carries out convolution, step-length 1 in each channel.Through
After crossing the convolution of first convolutional layer, according to (2243)/1+1=222 it is found that the size of the image obtained at this time be 222px ×
222px obtains the characteristic pattern of 64 222px × 222px sizes.Since triple channel being closed in first convolutional layer
Convolution is carried out together, therefore the input of second convolutional layer is the single channel image of 64 222px × 222px.Second convolution
The number of parameters of each convolution kernel is 3 × 3 in layer, and the convolution kernel for being equivalent to 13 × 3 size carries out convolution, step-length 1.Then pass through
After crossing the convolution of second convolutional layer, according to (222-3)/1+1=220 it is found that obtain 64 220px × 220px at this time big
Small characteristic pattern.Then, into the first down-sampling layer, down-sampling is also known as pond, is the principle using image local correlation,
Sub-sample is carried out to image, to reduce under data processing and retain useful information.Herein, pondization uses Maximum overlap pond
Change, i.e., piecemeal is carried out to the characteristic pattern of 220px × 220x, each piece of size is 2 × 2, step-length 2, and counts each piece
Maximum value, the pixel value as Chi Huahou image.According to (220-2)/2+1=27 it is found that the characteristic pattern of Chi Huahou having a size of
110px × 110px, then by obtaining the characteristic pattern of 64 110px × 110px after the first down-sampling layer.
Two convolutional layers in second convolution group have 128 convolution kernels, and three convolutional layers in third convolution group have
256 convolution kernels, three convolutional layers that Volume Four is accumulated in group have 512 convolution kernels, three convolutional layers in the 5th convolution group
Also have 512 convolution kernels, wherein the number of parameters of each convolution kernel is 3 × 3, be equivalent to the convolution kernel of 13 × 3 size into
Row convolution, step-length 1.Second down-sampling layer, third down-sampling layer, the 4th down-sampling layer and the 5th down-sampling layer are all made of maximum
Overlapping pool, the size for carrying out the block of piecemeal processing to characteristic pattern is 2 × 2, step-length 2, according under the first convolution group and first
The treatment process of sample level, the output for finally obtaining the 5th down-sampling layer is the characteristic pattern of 512 2px × 2px.Wherein, into
When row Maximum overlap pond, since the side length of characteristic pattern is not necessarily 2 multiple, the processing side for retaining edge is taken
Method, i.e., by the side length of characteristic pattern with 0 be filled with 2 multiple, then carry out pond again.In fact, completing to roll up in each convolutional layer
After product processing, it is also necessary to which the output for adjusting the convolutional layer by activation primitive, avoiding next layer of output is upper one layer of line
Property combination and arbitrary function can not be approached.Using ReLU (Rectified Linear Unit), function is as activation primitive, into one
Step alleviates overfitting problem, and expression formula is f (x)=max (0, wx+b), and wherein wx+b is conventional linear function.Often
A convolutional layer completes the characteristic pattern obtained after process of convolution to image, may be applicable to the adjustment of above-mentioned activation primitive, below will
It repeats no more.
Then, into the first full articulamentum, the neuron number of the first full articulamentum selects 4096, then the first full connection
The output of layer is the characteristic pattern of 4096 1px × 1px sizes.In actual treatment, it will usually first lead to above-mentioned 4096 characteristic patterns
The activation of ReLU function is crossed, then carries out dropout processing, dropout can be understood as model and be averaged, i.e., in forward direction in training process
When conduction, the activation value of some neuron is allowed to stop working with certain Probability p, i.e. the activation value of the neuron is become with Probability p
It is 0.For example, the neuron of the first full articulamentum is 4096, if dropout ratio selection 0.4, then this layer of neuron warp
It crosses after dropout, wherein there are about the values of 1638 neurons to be set to 0, is equivalent to by preventing the collaboration of certain features from making
For alleviating over-fitting, the phenomenon that appearance of a neuron is dependent on another neuron is avoided.By dropout, treated
Characteristic pattern is input in the second full articulamentum, and the neuron number of the second full articulamentum is still 4096, then its output is 4096
The characteristic pattern for opening 1px × 1px size can equally first pass through the activation of ReLU function to it, then carry out dropout processing, and will place
Result after reason is input in the full articulamentum of third.It is classification problem more than one due to being identified to the age, and in this reality
Applying age in example is the integer between 0~100, therefore the neuron number of the full articulamentum of third is chosen as 101, then final the
Three full articulamentum outputs are also 101, respectively correspond the probability at 0~100 this 101 ages.First classifier selects softmax
Classifier, output are the maximum probability corresponding age, which should be the age A2 of facial image A1.About softmax points
The content of class device is not repeated herein for mature technological means.In order to train the first convolutional neural networks, according to input
The corresponding age A2 of facial image A1 be 7 years old this foreseen outcome, the output of the first classifier is adjusted, by minimization
The method backpropagation of error is to adjust each parameter in the first convolutional neural networks.By a large amount of in face image data set
Human face image information be trained after, obtain trained first convolutional neural networks.
In step S220, according to preset age threshold to each of face image data set face image information
It is handled, to add new age threshold attribute for each human face image information, age threshold attribute indicates corresponding people
Age be greater than preset age threshold and be also no more than preset age threshold.In the present embodiment, preset age threshold
Value is 12, then age threshold attribute added by each human face image information in face image data set, can indicate corresponding people
Age be greater than 12 years old and be also no more than 12 years old.
Then, S230 is entered step, the third in the first convolutional neural networks trained in step S210 is connected entirely
Layer and the first classifier replace with the 4th full articulamentum and the second classifier respectively, to generate the second convolutional neural networks, and root
The second convolutional neural networks are trained according to the face image data set for being added to age threshold attribute, so as to the second convolution
The age of people corresponding to the output instruction face of second classifier of neural network is above age threshold and is also no greater than the age
Threshold value.In the present embodiment, the neuron of the 4th full articulamentum should be 2, and the result of output is to be greater than 12 years old and not at the age
Probability greater than 12 years old then corresponds to the softmax classifier of the second classifier for the corresponding the range of age of output probability the larger value.
By taking human face image information A as an example, the age A2 in human face image information A is 7 years old, after adding age threshold attribute, it is known that year
Age A2 is not higher than age threshold, that is, is not more than 12 years old.Using facial image A1 as the first convolution group in the second convolutional neural networks
Input, the age be not more than the output as the second classifier in the second convolutional neural networks in 12 years old, to the second convolution nerve net
Network carries out fine-tune training.Fig. 5 shows the structural representation of the second convolutional neural networks according to an embodiment of the invention
Figure.As shown in figure 5, the second convolutional neural networks and the first convolutional neural networks are completely the same in hierarchical structure.
Next, in step S240, trained second convolutional neural networks in obtaining step S230, trained
After the first full articulamentum in second convolutional neural networks, the 5th full articulamentum, the 6th full articulamentum being sequentially connected are added
With third classifier, to generate third convolutional neural networks, and select in face image data set, the age is no more than presetting
The human face image information of age threshold third convolutional neural networks are trained, so as to the third of third convolutional neural networks
The age of people corresponding to the output instruction face of classifier is which of zero to default age threshold.6th full articulamentum
Including multiple full articulamentums of son in parallel, the number of the full articulamentum of son is that preset age threshold adds the sum of 1.Fig. 6 shows root
According to the structural schematic diagram of the third convolutional neural networks of one embodiment of the invention.As shown in fig. 6, the first full articulamentum it
Afterwards, the 5th full articulamentum, the 6th full articulamentum and the third classifier being sequentially connected are added to, respectively with the second full articulamentum,
4th full articulamentum and the second classifier are in same level, so as to form respectively with the first classifier be output first point
Branch and with the second classifier be export the second branch.The number of neuron is 4096 in 5th full articulamentum, the 6th Quan Lian
It connects layer to be made of 13 full articulamentums of son, therefore this full articulamentum of 13 sons and the 4th full articulamentum are same level, are selected
Softmax classifier is as third classifier.It should be noted that in the 6th full articulamentum 13 full articulamentums of son neuron
Number is 2, and the output result of each full articulamentum of son is number 1 and digital 0 corresponding probability, then classifies as third
The softmax classifier of device exports according to the size of probability value greater probability and is worth corresponding number.
In the present embodiment, human face image information third convolutional neural networks being trained, using face figure
As in data acquisition system, the age be not more than 12 years old human face image information, that is, the age of the human face image information used should be 0~12
Integer in year.In hands-on, third convolutional neural networks are trained using Ordinal loss as loss function.
About Ordinal loss, i.e. order loss is generally handled by ordered logistic regress model, is directed to for one kind
The homing method of ordinal scale dependent variable data can utilize predictive variable, such as classifying type variable and numeric type variable, to orderly
The modeling that the response variable of classificatory scale type is returned, relevant technology contents repeat no more here.
Finally, enter step S250, step S240 the first full connection in trained third convolutional neural networks
After layer, the 7th full articulamentum, eight convergent points articulamentum and the 4th classifier being sequentially connected are added, to generate Volume Four product nerve
Network, and Volume Four product neural network is trained according to face image data set, so that the output of the 4th classifier refers to
It lets others have a look at age of people corresponding to face.The structure that Fig. 7 shows Volume Four product neural network according to an embodiment of the invention is shown
It is intended to.As shown in fig. 7, after the first full articulamentum, be added to the 7th full articulamentum being sequentially connected, eight convergent points articulamentum and
4th classifier, so as to form respectively with the first branch that the second classifier is output, with third classifier is output the
Two branches and with the 4th classifier be output third branch.7th full articulamentum and the second full articulamentum, the 5th full articulamentum
For same level, neuron number is 4096, and eight convergent points articulamentum and the 4th full articulamentum, the 6th full articulamentum are same layer
Grade, neuron number are 101, and the 4th classifier and the second classifier, third classifier are same level, same to select
Softmax classifier is as the 4th classifier.In hands-on, using Euclidean loss as loss function to Volume Four
Product neural network is trained.About Euclidean loss, i.e. Euclidean distance is lost, and function expression is as follows:
Wherein, predicted valueLabel value y ∈ [- ∞ ,+∞], N are number of samples.About Euclidean
The realization of loss has mature technical method, is not repeated herein.
Fig. 3 shows the convolution for being used to carry out the face in image age identification of another embodiment according to the present invention
The flow chart of neural network generation method 300.As shown in figure 3, step S310, S320, S330, S340 and S350 of method 300
It corresponds, is consistent respectively with step S210, S220, S230, S240 and S250 of method 200 in Fig. 2, and in step
Step S360 is increased before S340, step S370 is increased before step S350.
In step S360, in face image data set, the age be not more than preset age threshold facial image
The age of information carries out 0/1 coding, and 0/1 coding includes adding the sum of 1 with preset age threshold for number of encoding bits, each is number
Any one of word 0 and number 1, since first place, the difference that the number that number 1 occurs subtracts 1 is the age.In the present embodiment
In, to the human face image information that third convolutional neural networks are trained, using in face image data set, the age not
Human face image information greater than 12 years old, that is, the age of the human face image information used should be the integer in 0~12 years old.With face figure
As for information A comprising age A2 be 7 years old, can be used for training third convolutional neural networks.Firstly, being carried out to age A2
0/1 coding, the digit of 0/1 coding is 13, and since first place, the difference that the number that number 1 occurs subtracts 1 is 7, then explanation is opened from first place
Beginning sequentially to amount to has 8 continuous numbers 1, therefore 7 years old corresponding 0/1 is encoded to 1111111100000.To in step
In S340, the age 1111111100000 after 0/1 coding is carried out in obtaining step S360 to age A2, by human face image information A
In facial image A1 as the input of the first convolution group, age in the third convolutional neural networks generated in step S330
1111111100000 output as third classifier in the third convolutional neural networks generated in step S330, with Ordinal
Loss is trained third convolutional neural networks as loss function.
And in step S370, distributed code is carried out to the age of human face image information in face image data set, point
Cloth coding includes carrying out age coding according to Gaussian Profile.In the present embodiment, distributed code is carried out to the age to refer to one
The numerical value at age is indicated with a Gaussian Profile.And in face image data set human face image information age be 0~
Integer in 100, then to 0,1,2 ..., 99,100 this 101 integers there is the general of a corresponding Gaussian distributed
The expression formula of rate density function, the function is as follows:
X is the integer in 0~100
In formula, px(i) indicating that age i obeys a mean value is x, variance δ2Gaussian Profile.
For example, the human face image information B in face image data set comprising in facial image B1 and correspondence image
The age information B2 of people, wherein age B2 is 28 years old.At this point, the value of x is 28, corresponding probability density function are as follows:
As can be seen from the above equation, the probability density function p obtained after distributed code is carried out to age B228(i) for, work as i
When=28, p28(i) maximum value p is obtained28(28)。
Since the output result of eight convergent points articulamentum is 0~100 years old this corresponding probability of 101 age numerical value, and conduct
The softmax classifier of 4th classifier exports the most probable value corresponding age, therefore in step for according to the size of probability value
It is using facial image B1 as step S340 Volume Four generated when being trained in rapid S350 to Volume Four product neural network
The input of first convolution group in product neural network, age B2 carry out the result after distributed code as step in step S370
The output of eight convergent points articulamentum in S340 Volume Four product neural network generated, using Euclidean loss as loss function
Volume Four product neural network is trained.
Fig. 8 shows the flow chart of age recognition methods 400 according to an embodiment of the invention.Age recognition methods
400 are suitable for executing in calculating equipment (such as calculating equipment 100 shown in FIG. 1), based on for carrying out to the face in image
Age identification convolutional neural networks generation method in, trained Volume Four product neural network carry out age identification.
As shown in figure 4, method 400 starts from step S410.In step S410, facial image to be identified is input to training
Age identification is carried out in good Volume Four product neural network.In the present embodiment, are carried out to two facial images to be identified the age
Identification, the corresponding age of image one are 9 years old, and the corresponding age of image two is 43 years old, and image one and image two are input to training
Age identification is carried out in good Volume Four product neural network.
Then, it enters step S420, after carrying out age identification to facial image to be identified in judgment step S410, trains
Volume Four product neural network in the output of the second classifier whether be greater than preset age threshold.For image one, judgement
The output of second classifier is not more than 12 years old, for image two, judges that the output of the second classifier is greater than 12 years old.
Next, in step S430, if the output of the second classifier is to obtain step no more than preset age threshold
After carrying out age identification to facial image to be identified in rapid S410, third classifier in trained Volume Four product neural network
Output is the age of people corresponding to face.Since the output of corresponding second classifier of image one is no more than preset age threshold
Value is not more than 12 years old, then the age for obtaining the output of third classifier is final result, 1111111111000 is expressed as, from head
Position start it is total have 10 continuous numbers 1, thus identify that age be 9 years old, it is consistent with true value.
And in step S440, if the output of the second classifier is greater than preset age threshold, obtaining step S410
In age identification carried out to facial image to be identified after, the output of the 4th classifier is in trained Volume Four product neural network
The age of people corresponding to face.Because greater than preset age threshold, i.e., greatly the output of corresponding second classifier of image two is
In 12 years old, then the age for obtaining the output of the 4th classifier was final result, and 101 due to eight convergent points articulamentum output at this time are general
The 44th numerical value is maximum in rate value, therefore the age of the 4th classifier output is 43 years old, consistent with true value.
Fig. 8 shows the flow chart of the age recognition methods 500 of another embodiment according to the present invention.Age recognition methods
500 are suitable for executing in calculating equipment (such as calculating equipment 100 shown in FIG. 1), based on for carrying out to the face in image
Age identification convolutional neural networks generation method in, trained Volume Four product neural network carry out age identification.Such as Fig. 8 institute
Show, step S510, S520, S530 and S540 of method 500 and step S410, S420, S430 and S440 in method 400 distinguish
It corresponds, is consistent, and increase step S550 before step S510.
In step S550, images to be recognized is pre-processed to obtain facial image to be identified.This is because right
Before face in images to be recognized carries out age identification, need first to extract the facial image in images to be recognized.
Firstly, carrying out Face datection to images to be recognized, face location information is obtained;By face location information, by images to be recognized
In face cut after conversion to pre-set dimension;The transformation square that face carries out Plane Rotation is calculated according to face key point information
Battle array;The facial image under pre-set dimension is rotated into horizontal front to obtain facial image to be identified using transformation matrix.At this
In embodiment, to the advanced row Face datection of images to be recognized, i.e., first determine that scan image is carried out in a region, to each sector scanning
The position arrived carries out feature extraction, then classification processing to judge whether the position includes face.For there are human face regions
Images to be recognized is converted after cutting face to 224px × 224px size.Due to above-mentioned face location generally i.e. refer to face and
Outer profile, then face rotational value needs the line of two fixing points determination to obtain, and selects pupil as fixed point, passes through two
The line and facial image horizontal line of pupil calculate an angle, use affine transformation by the angle, obtain spin matrix,
After using spin matrix to the image, face can be rotated to be to interpupillary line and image level line is in parallel relation, thus
It obtains and keeps horizontal positive facial image to be identified.According to the line of pupil two o'clock in face, this line and horizontal line are calculated
Angle to obtain the angle A ngleValue of rotation, and can be into using the get RotationMatrix2D function in OpenCV
The relevant calculation of the row spin matrix carries out face rotation using warpAffine, and specific function is as follows:
RotateMatrix=cv2.getRotationMatrix2D (center=(Img.shape [1]/2,
Img.shape [0]/2), angle=AngleValue, scale=1)
RotImg=cv2.warpAffine (Img, RotateMatrix, (Img.shape [0] * 2, Img.shape [1] *
2))
Figure 10 shows according to an embodiment of the invention for carrying out the convolution of age identification to the face in image
The schematic diagram of neural network generating means 600.The device includes: the first training module 610, the life of attribute adding module 620, first
At module 630, the second training module 640, the second generation module 650, third training module 660, third generation module 670 and
Four training modules 680.
First training mould 610 is suitable for according to the face image data set that obtains in advance, to the first convolutional neural networks into
Row training is suitable for identification face so as to the first convolutional neural networks, and face image data set is believed comprising multiple facial images
Breath, each facial image information includes the age information of people in facial image and correspondence image, and the first convolutional neural networks include
Multiple convolution groups, the first full articulamentum, the second full articulamentum, the full articulamentum of third and the first classifier being sequentially connected.Wherein,
The holding of the facial image of each human face image information is horizontal positive in face image data set and meets pre-set dimension, face
The age that image corresponds to people is integer between 0~100, includes at least one in each convolution group of the first convolutional neural networks
Convolutional layer.
Attribute adding module 620 is suitable for according to preset age threshold to each of face image data set face figure
As information is handled, to add new age threshold attribute, the instruction pair of age threshold attribute for each human face image information
The age of the people answered is greater than preset age threshold and is also no more than preset age threshold.Wherein, preset age threshold
It is 12.
First generation module 630 is connected with the first training module 610, and being suitable for will be trained in the first training module 610
The full articulamentum of third and the first classifier in first convolutional neural networks replace with the 4th full articulamentum and the second classification respectively
Device, to generate the second convolutional neural networks.
Second training module 640 is connected with attribute adding module 620 and the first generation module 630 respectively, is suitable for according to category
The face image data set of age threshold attribute is added in property adding module 620, it is generated to the first generation module 630
Second convolutional neural networks are trained, corresponding to the output instruction face so as to the second classifier of the second convolutional neural networks
The age of people is above age threshold and is also no greater than age threshold.
Second generation module 650 is connected with the second training module 640, is suitable for obtaining the second training module 640 trained
Second convolutional neural networks, after the first full articulamentum in trained second convolutional neural networks, addition is sequentially connected
The 5th full articulamentum, the 6th full articulamentum and third classifier, to generate third convolutional neural networks.Wherein, the 6th Quan Lian
Connecing layer includes multiple full articulamentums of son in parallel, and the number of the full articulamentum of son is that preset age threshold adds the sum of 1.
Third training module 660 is connected with the second generation module 650, is suitably selected in face image data set, the age
No more than default age threshold human face image information to the third convolutional neural networks generated in the second generation module 650
It is trained, the age of people corresponding to the output instruction face so as to the third classifier of third convolutional neural networks is zero to institute
Which of default age threshold.
Third generation module 670 is connected with third training module 660, is suitable for obtaining third training module 670 trained
Third convolutional neural networks, after the first full articulamentum in trained third convolutional neural networks, addition is sequentially connected
The 7th full articulamentum, eight convergent points articulamentum and the 4th classifier, with generate Volume Four product neural network.
4th training module 680 is connected with third generation module 670, is suitable for raw to third according to face image data set
It is trained at the Volume Four generated of module 670 product neural network, corresponding to the output instruction face so as to the 4th classifier
The age of people.
Figure 11 shows the schematic diagram of the age identification device 700 according to this method another embodiment.As shown in figure 11,
The first training module 710, attribute adding module 720, the first generation module 730, the second training module 740, second of device 700
Generation module 750, third training module 760, third generation module 770 and the 4th training module 780, respectively with device in Figure 10
600 the first training module 610, attribute adding module 620, the first generation module 630, the second training module 640, second generate
Module 650, third training module 660, third generation module 670 and the 4th training module 680 correspond, and are consistent, and
It has increased the first coding module 790 being connected with third training module 760 newly and second to be connected with the 4th training module 780 encodes
Module 792.
First coding module 791 is connected with third training module 760, be suitable for in face image data set, the age not
Age greater than the human face image information of preset age threshold carries out 0/1 coding, and 0/1 coding includes with preset age threshold
Add the sum of 1 for number of encoding bits, each is any one of number 0 and number 1, and since first place, the number that number 1 occurs subtracts
1 difference is the age.In turn, third training module 760 is suitably selected in face image data set, the age is no more than default
The human face image information of age threshold is trained the third convolutional neural networks generated in the second generation module 750, wherein
For training the age of the human face image information of third convolutional neural networks to carry out coded treatment by the first coding module 791.
Second coding module 792 is connected with the 4th training module 780, is suitable for facial image in face image data set
The age of information carries out distributed code, and distributed code includes carrying out age coding according to Gaussian Profile.Then the 4th training module 780
Suitable for being trained according to face image data set to the Volume Four generated of third generation module 770 product neural network,
In for train Volume Four product neural network human face image information by the second coding module 792 carry out coded treatment.
The specific steps and reality generated about the convolutional neural networks for carrying out age identification to the face in image
Example is applied, has been disclosed in detail in the description based on Fig. 2-7, details are not described herein again.
Figure 12 shows the schematic diagram of age identification device 800 according to an embodiment of the invention.Age identification device
800 based on for in image face carry out age identification convolutional neural networks generating means in, trained Volume Four
Product neural network carries out age identification, which includes: identification module 810, judgment module 820 and acquisition module 830.Wherein,
Identification module 810 is connected with judgment module 810, suitable for facial image to be identified is input to trained Volume Four product nerve net
Age identification is carried out in network.Judgment module 820 is suitable for the trained Volume Four for judging that identification module 810 carries out after age identification
Whether the output of the second classifier is greater than preset age threshold in product neural network.Obtain module 830 respectively with identification module
810 are connected with judgment module 820, suitable for judging the output of the second classifier no more than the preset age when judgment module 820
When threshold value, obtains identification module 810 and carry out third classifier in the trained Volume Four product neural network after age identification
Output is the age of people corresponding to face, when the output of the second classifier is greater than preset age threshold, obtains identification module
810 outputs for carrying out the 4th classifier in the trained Volume Four product neural network after age identification are people's corresponding to face
Age.
Figure 13 shows the schematic diagram of the age identification device 900 of another embodiment according to the present invention.Age identification dress
Set 900 based on for in image face carry out age identification convolutional neural networks generating means in, the trained 4th
Convolutional neural networks carry out age identification.As shown in figure 13, the identification module 910 of device 900, judgment module 920 and acquisition mould
Block 930 corresponds with the identification module 810 of device 800, judgment module 820 and acquisition module 830 in Figure 12 respectively, is one
It causes, and has increased preprocessing module 940 newly, be connected with identification module 910, suitable for being pre-processed images to be recognized to obtain
Facial image to be identified, so that the facial image to be identified obtained from preprocessing module 940 is input to training by identification module 910
Age identification is carried out in good Volume Four product neural network.Preprocessing module 940 is further adapted for carrying out people to images to be recognized
Face detection, obtains face location information;By the face location information, turn after the face in the images to be recognized is cut
Shift to pre-set dimension;The transformation matrix that face carries out Plane Rotation is calculated according to face key point information;Utilize the transformation square
Facial image under pre-set dimension is rotated into horizontal front to obtain facial image to be identified by battle array.
About the specific steps and embodiment of age identification, it has been disclosed in detail in the description based on Fig. 8-9, herein
It repeats no more.
In existing face age recognition methods, age true value is replaced using age distribution value, the result got is in
A certain age range, it is not accurate enough, even if VGG model is taken to carry out age identification, it is also desirable to be taken according to the distribution of age bracket
Multiple models are realized, complex, and versatility is lower.It is according to the present invention to be used to carry out age knowledge to the face in image
The technical solution that other convolutional neural networks generate, is first trained the first convolutional neural networks, the first convolution nerve net
It include multiple convolution groups, multiple full articulamentums and the first classifier being sequentially connected in network, by trained first convolutional Neural
The full articulamentum in part and the first classifier in network are replaced accordingly, are generated the second convolutional neural networks and are carried out to it
Training, then the full articulamentum and classifier new to the addition of trained second convolutional neural networks, to generate third convolutional Neural
Network is simultaneously trained, finally new to the addition of trained third convolutional neural networks full articulamentum and classifier, to generate
Volume Four product neural network is simultaneously trained.In the above-mentioned technical solutions, using convolutional neural networks as basic frame, and in life
At each convolutional neural networks in case before training, compiled to such as age threshold segmentation, 0/1 is carried out for trained face image data
The processing such as code and age distribution coding, auxiliary gradually modify the model of convolutional neural networks, and ultimately forming one can accurately identify
The convolutional neural networks model at age, without segmentation, succinct intuitive, versatility is stronger.In turn, age identification according to the present invention
Facial image to be identified is input in trained Volume Four product neural network, according to the output of different classifications device by method
Judge the range of age and its specific value, as a result accuracy has huge promotion.
Claims (21)
1. it is a kind of for carrying out the convolutional neural networks generation method of age identification to the face in image, it is suitable for calculating equipment
Middle execution, the method includes the steps:
According to the face image data set obtained in advance, the first convolutional neural networks are trained so as to first convolution
Neural network is suitable for identification face, and the face image data set includes multiple human face image informations, each facial image
Information includes the age information of people in facial image and correspondence image, first convolutional neural networks include be sequentially connected it is more
A convolution group, the first full articulamentum, the second full articulamentum, the full articulamentum of third and the first classifier;
Each of face image data set face image information is handled according to preset age threshold, so as to
New age threshold attribute is added for each human face image information, the age threshold attribute indicates that the age of corresponding people is big
Preset age threshold is also no more than in preset age threshold;
By in trained first convolutional neural networks the full articulamentum of third and the first classifier replace with the 4th Quan Lian respectively
Layer and the second classifier are connect, to generate the second convolutional neural networks, and according to the face figure for being added to age threshold attribute
As data acquisition system is trained the second convolutional neural networks, so as to second convolutional neural networks the second classifier it is defeated
Indicate that the age of people corresponding to face is above the age threshold and is also no greater than the age threshold out;
After the first full articulamentum in trained second convolutional neural networks, the full connection of the 5th be sequentially connected is added
Layer, the 6th full articulamentum and third classifier, to generate third convolutional neural networks, and select the face image data set
In, the age third convolutional neural networks are trained no more than the human face image information of default age threshold, with toilet
The age of people corresponding to the output instruction face of the third classifier of third convolutional neural networks is stated by zero to default age threshold
Which of value;
After the first full articulamentum in trained third convolutional neural networks, the full connection of the 7th be sequentially connected is added
Layer, eight convergent points articulamentum and the 4th classifier, to generate Volume Four product neural network, and according to the face image data set
Volume Four product neural network is trained, the year of people corresponding to the output instruction face so as to the 4th classifier
Age.
2. the method as described in claim 1, the facial image of each human face image information in the face image data set
The horizontal front of holding and meet pre-set dimension, the facial image correspond to the age of people as the integer between 0~100.
3. it is method according to claim 1 or 2, it include at least one in each convolution group of first convolutional neural networks
Convolutional layer.
4. the method as described in claim 1, in the first full articulamentum in trained second convolutional neural networks
Later, the 5th full articulamentum, the 6th full articulamentum and the third classifier being sequentially connected are added, to generate third convolutional Neural net
Before the step of network, further comprise the steps of:
To in the face image data set, the age no more than preset age threshold human face image information age carry out
0/1 coding, 0/1 coding include adding the sum of 1 with preset age threshold for number of encoding bits, each is number 0 and number 1
Any one of, since first place, the difference that the number that number 1 occurs subtracts 1 is the age.
5. the method as described in claim 1, the 6th full articulamentum includes multiple full articulamentums of son in parallel, the full connection of son
The number of layer is that preset age threshold adds the sum of 1.
6. the method as described in claim 1, in the first full articulamentum in trained third convolutional neural networks
Later, the 7th full articulamentum, eight convergent points articulamentum and the 4th classifier being sequentially connected are added, to generate Volume Four product nerve net
Before the step of network, further comprise the steps of:
Distributed code is carried out to the age of human face image information in face image data set, the distributed code includes according to height
This distribution carries out age coding.
7. the method as described in claim 1, the preset age threshold is 12.
8. a kind of age recognition methods, suitable for executing in calculating equipment, the method is based on any one of claim 1-7 institute
The trained Volume Four product neural network stated carries out age identification to the face in image, comprising steps of
Facial image to be identified is input in trained Volume Four product neural network and carries out age identification;
Judge whether the output of the second classifier in the trained Volume Four product neural network is greater than preset age threshold;
If the output of second classifier is to obtain the trained Volume Four product mind no more than preset age threshold
Output through third classifier in network is the age of people corresponding to face;
If the output of second classifier is to obtain the trained Volume Four product nerve greater than preset age threshold
The output of the 4th classifier is the age of people corresponding to face in network.
9. method according to claim 8 further includes being pre-processed to images to be recognized to obtain facial image to be identified.
10. method as claimed in claim 9, described to be pre-processed to images to be recognized to obtain facial image packet to be identified
It includes:
Face datection is carried out to images to be recognized, obtains face location information;
By the face location information, convert after the face in the images to be recognized is cut to pre-set dimension;
The transformation matrix that face carries out Plane Rotation is calculated according to face key point information;
The facial image under pre-set dimension is rotated into horizontal front to obtain facial image to be identified using the transformation matrix.
11. it is a kind of for carrying out the convolutional neural networks generating means of age identification to the face in image, suitable for residing in meter
It calculates in equipment, described device includes:
First training module, suitable for being instructed to the first convolutional neural networks according to the face image data set obtained in advance
Practice so that first convolutional neural networks are suitable for identification face, the face image data set includes multiple facial images
Information, each facial image information include the age information of people in facial image and correspondence image, the first convolution nerve net
Network includes the multiple convolution groups being sequentially connected, the first full articulamentum, the second full articulamentum, the full articulamentum of third and the first classification
Device;
Attribute adding module is suitable for according to preset age threshold to each of face image data set face image
Information is handled, to add new age threshold attribute, the age threshold attribute instruction for each human face image information
The age of corresponding people is greater than preset age threshold and is also no more than preset age threshold;
First generation module, suitable for by the full articulamentum of third and the first classifier point in trained first convolutional neural networks
The 4th full articulamentum and the second classifier are not replaced with, to generate the second convolutional neural networks;
Second training module, suitable for being added to the face image data set of age threshold attribute according to the second convolution mind
It is trained through network, the year of people corresponding to the output instruction face so as to the second classifier of second convolutional neural networks
Age is above the age threshold and is also no greater than the age threshold;
Second generation module, after the first full articulamentum in trained second convolutional neural networks, addition is successively
Connected the 5th full articulamentum, the 6th full articulamentum and third classifier, to generate third convolutional neural networks;
Third training module is suitably selected in the face image data set, the age is no more than default age threshold
Human face image information is trained third convolutional neural networks, so as to the third classifier of the third convolutional neural networks
The age of people corresponding to output instruction face is which of zero to default age threshold;
Third generation module, after the first full articulamentum in trained third convolutional neural networks, addition is successively
Connected the 7th full articulamentum, eight convergent points articulamentum and the 4th classifier, to generate Volume Four product neural network;
4th training module, suitable for being trained according to the face image data set to Volume Four product neural network,
The age of people corresponding to output instruction face so as to the 4th classifier.
12. device as claimed in claim 11, the face figure of each human face image information in the face image data set
As the horizontal front of equal holdings and meet pre-set dimension, the facial image correspond to the age of people as the integer between 0~100.
It include at least 13. the device as described in claim 11 or 12, in each convolution group of first convolutional neural networks
One convolutional layer.
14. device as claimed in claim 11 further includes the first coding module, is suitable for:
To in the face image data set, the age no more than preset age threshold human face image information age carry out
0/1 coding, 0/1 coding include adding the sum of 1 with preset age threshold for number of encoding bits, each is number 0 and number 1
Any one of, since first place, the difference that the number that number 1 occurs subtracts 1 is the age.
15. device as claimed in claim 11, the 6th full articulamentum includes multiple full articulamentums of son in parallel, sub- Quan Lian
The number for connecing layer is that preset age threshold adds the sum of 1.
16. device as claimed in claim 11 further includes the second coding module, is suitable for:
Distributed code is carried out to the age of human face image information in face image data set, the distributed code includes according to height
This distribution carries out age coding.
17. device as claimed in claim 11, the preset age threshold is 12.
18. a kind of age identification device calculates in equipment suitable for residing in, described device is based on any in claim 11-17
Trained Volume Four product neural network described in carries out age identification to the face in image, comprising:
Identification module carries out age knowledge suitable for facial image to be identified to be input in trained Volume Four product neural network
Not;
Judgment module, suitable for judging in the trained Volume Four product neural network after the identification module carries out age identification the
Whether the output of two classifiers is greater than preset age threshold;
Module is obtained, when suitable for judging the output of the second classifier when the judgment module no more than preset age threshold,
It obtains the identification module and carries out the output of third classifier in the trained Volume Four product neural network after age identification and be
The age of people corresponding to face, when the judgment module judges the output of the second classifier greater than preset age threshold,
It obtains the identification module and carries out the output of the 4th classifier in the trained Volume Four product neural network after age identification and be
The age of people corresponding to face.
19. device as claimed in claim 18 further includes preprocessing module, suitable for being pre-processed images to be recognized to obtain
Take facial image to be identified.
20. device as claimed in claim 19, the preprocessing module is further adapted for:
Face datection is carried out to images to be recognized, obtains face location information;
By the face location information, convert after the face in the images to be recognized is cut to pre-set dimension;
The transformation matrix that face carries out Plane Rotation is calculated according to face key point information;
The facial image under pre-set dimension is rotated into horizontal front to obtain facial image to be identified using the transformation matrix.
21. a kind of calculating equipment, comprising:
The convolutional neural networks for being used to carry out the face in image age identification as described in any one of claim 11-17
Generating means;And
Age identification device as described in any one of claim 18-20.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610925676.0A CN106485235B (en) | 2016-10-24 | 2016-10-24 | A kind of convolutional neural networks generation method, age recognition methods and relevant apparatus |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610925676.0A CN106485235B (en) | 2016-10-24 | 2016-10-24 | A kind of convolutional neural networks generation method, age recognition methods and relevant apparatus |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106485235A CN106485235A (en) | 2017-03-08 |
CN106485235B true CN106485235B (en) | 2019-05-03 |
Family
ID=58270488
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610925676.0A Active CN106485235B (en) | 2016-10-24 | 2016-10-24 | A kind of convolutional neural networks generation method, age recognition methods and relevant apparatus |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106485235B (en) |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106980830A (en) * | 2017-03-17 | 2017-07-25 | 中国人民解放军国防科学技术大学 | One kind is based on depth convolutional network from affiliation recognition methods and device |
CN106874964B (en) * | 2017-03-30 | 2023-11-03 | 李文谦 | Foot-type image size automatic prediction method and prediction device based on improved convolutional neural network |
CN107122798A (en) * | 2017-04-17 | 2017-09-01 | 深圳市淘米科技有限公司 | Chin-up count detection method and device based on depth convolutional network |
CN107194407B (en) * | 2017-05-18 | 2020-04-07 | 网易(杭州)网络有限公司 | Image understanding method and device |
JP6729516B2 (en) * | 2017-07-27 | 2020-07-22 | トヨタ自動車株式会社 | Identification device |
CN107527024A (en) * | 2017-08-08 | 2017-12-29 | 北京小米移动软件有限公司 | Face face value appraisal procedure and device |
CN108021863B (en) * | 2017-11-01 | 2022-05-06 | 平安科技(深圳)有限公司 | Electronic device, age classification method based on image and storage medium |
CN107622261A (en) * | 2017-11-03 | 2018-01-23 | 北方工业大学 | Face age estimation method and device based on deep learning |
CN108052861A (en) * | 2017-11-08 | 2018-05-18 | 北京卓视智通科技有限责任公司 | A kind of nerve network system and the model recognizing method based on the nerve network system |
CN108073910B (en) * | 2017-12-29 | 2021-05-07 | 百度在线网络技术(北京)有限公司 | Method and device for generating human face features |
CN108197602B (en) * | 2018-01-30 | 2020-05-19 | 厦门美图之家科技有限公司 | Convolutional neural network generation method and expression recognition method |
CN108596042A (en) * | 2018-03-29 | 2018-09-28 | 青岛海尔智能技术研发有限公司 | Enabling control method and system |
CN109002884A (en) * | 2018-07-20 | 2018-12-14 | 郑州云海信息技术有限公司 | A kind of pond processing unit and pond processing method |
CN109101938B (en) * | 2018-08-23 | 2021-01-15 | 武汉嫦娥医学抗衰机器人股份有限公司 | Multi-label age estimation method based on convolutional neural network |
CN110971901B (en) * | 2018-09-29 | 2022-08-02 | 杭州海康威视数字技术股份有限公司 | Processing method, device and equipment of convolutional neural network and storage medium |
CN111096736A (en) * | 2018-10-26 | 2020-05-05 | 深圳市理邦精密仪器股份有限公司 | Electrocardiogram classification method, device and system based on active learning |
CN109522872A (en) * | 2018-12-04 | 2019-03-26 | 西安电子科技大学 | A kind of face identification method, device, computer equipment and storage medium |
CN109711546B (en) * | 2018-12-21 | 2021-04-06 | 深圳市商汤科技有限公司 | Neural network training method and device, electronic equipment and storage medium |
CN110185939B (en) * | 2019-05-16 | 2021-04-02 | 西北工业大学 | Gas pipeline leakage identification method based on convolutional neural network |
CN112052710A (en) * | 2019-06-06 | 2020-12-08 | 搜狗(杭州)智能科技有限公司 | Face age identification method and device |
CN110399813A (en) * | 2019-07-10 | 2019-11-01 | 深兰科技(上海)有限公司 | A kind of age recognition methods, device, electronic equipment and storage medium |
CN112307796B (en) * | 2019-07-24 | 2022-06-24 | 魔门塔(苏州)科技有限公司 | Age prediction method and device for infrared image |
CN111159450A (en) * | 2019-12-30 | 2020-05-15 | 上海眼控科技股份有限公司 | Picture classification method and device, computer equipment and storage medium |
CN111325256A (en) * | 2020-02-13 | 2020-06-23 | 上海眼控科技股份有限公司 | Vehicle appearance detection method and device, computer equipment and storage medium |
CN112927801B (en) * | 2021-01-24 | 2024-05-14 | 武汉东湖大数据科技股份有限公司 | Construction method and device of health prediction model based on dressing meteorological index |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103824054A (en) * | 2014-02-17 | 2014-05-28 | 北京旷视科技有限公司 | Cascaded depth neural network-based face attribute recognition method |
CN104504376A (en) * | 2014-12-22 | 2015-04-08 | 厦门美图之家科技有限公司 | Age classification method and system for face images |
CN104866829A (en) * | 2015-05-25 | 2015-08-26 | 苏州大学 | Cross-age face verify method based on characteristic learning |
CN105095833A (en) * | 2014-05-08 | 2015-11-25 | 中国科学院声学研究所 | Network constructing method for human face identification, identification method and system |
CN105279499A (en) * | 2015-10-30 | 2016-01-27 | 小米科技有限责任公司 | Age recognition method and device |
CN105426963A (en) * | 2015-12-01 | 2016-03-23 | 北京天诚盛业科技有限公司 | Convolutional neural network Training method and apparatus for human face identification and application |
CN105825191A (en) * | 2016-03-23 | 2016-08-03 | 厦门美图之家科技有限公司 | Face multi-attribute information-based gender recognition method and system and shooting terminal |
CN105975916A (en) * | 2016-04-28 | 2016-09-28 | 西安电子科技大学 | Age estimation method based on multi-output convolution neural network and ordered regression |
-
2016
- 2016-10-24 CN CN201610925676.0A patent/CN106485235B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103824054A (en) * | 2014-02-17 | 2014-05-28 | 北京旷视科技有限公司 | Cascaded depth neural network-based face attribute recognition method |
CN105095833A (en) * | 2014-05-08 | 2015-11-25 | 中国科学院声学研究所 | Network constructing method for human face identification, identification method and system |
CN104504376A (en) * | 2014-12-22 | 2015-04-08 | 厦门美图之家科技有限公司 | Age classification method and system for face images |
CN104866829A (en) * | 2015-05-25 | 2015-08-26 | 苏州大学 | Cross-age face verify method based on characteristic learning |
CN105279499A (en) * | 2015-10-30 | 2016-01-27 | 小米科技有限责任公司 | Age recognition method and device |
CN105426963A (en) * | 2015-12-01 | 2016-03-23 | 北京天诚盛业科技有限公司 | Convolutional neural network Training method and apparatus for human face identification and application |
CN105825191A (en) * | 2016-03-23 | 2016-08-03 | 厦门美图之家科技有限公司 | Face multi-attribute information-based gender recognition method and system and shooting terminal |
CN105975916A (en) * | 2016-04-28 | 2016-09-28 | 西安电子科技大学 | Age estimation method based on multi-output convolution neural network and ordered regression |
Non-Patent Citations (3)
Title |
---|
《Age and Gender Classification using Convolutional Neural Networks》;Gil Levi等;《2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)》;20150612;第34-42页 |
《基于人脸特征的性别判断和年龄估计方法研究》;方晟;《中国优秀硕士论文全文数据库》;20151215;第I138-433页 |
《神经网络集成》;周志华等;《计算机学报》;20020131;第25卷(第1期);第1-8页 |
Also Published As
Publication number | Publication date |
---|---|
CN106485235A (en) | 2017-03-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106485235B (en) | A kind of convolutional neural networks generation method, age recognition methods and relevant apparatus | |
Zhang et al. | Human action recognition using convolutional LSTM and fully-connected LSTM with different attentions | |
CN106650806B (en) | A kind of cooperating type depth net model methodology for pedestrian detection | |
CN104217214B (en) | RGB D personage's Activity recognition methods based on configurable convolutional neural networks | |
CN105512624B (en) | A kind of smiling face's recognition methods of facial image and its device | |
CN104281853B (en) | A kind of Activity recognition method based on 3D convolutional neural networks | |
CN110096950A (en) | A kind of multiple features fusion Activity recognition method based on key frame | |
CN108509978A (en) | The multi-class targets detection method and model of multi-stage characteristics fusion based on CNN | |
CN106778854A (en) | Activity recognition method based on track and convolutional neural networks feature extraction | |
CN109389055A (en) | Video classification methods based on mixing convolution sum attention mechanism | |
CN107808132A (en) | A kind of scene image classification method for merging topic model | |
CN107330362A (en) | A kind of video classification methods based on space-time notice | |
CN107748858A (en) | A kind of multi-pose eye locating method based on concatenated convolutional neutral net | |
CN107785061A (en) | Autism-spectrum disorder with children mood ability interfering system | |
CN101477619B (en) | Movement data gesture classification process based on DTW curve | |
CN104103033B (en) | View synthesis method | |
CN107729872A (en) | Facial expression recognition method and device based on deep learning | |
CN107657233A (en) | Static sign language real-time identification method based on modified single multi-target detection device | |
CN109034224A (en) | Hyperspectral classification method based on double branching networks | |
CN109558902A (en) | A kind of fast target detection method | |
CN110399788A (en) | AU detection method, device, electronic equipment and the storage medium of image | |
CN109934158A (en) | Video feeling recognition methods based on local strengthening motion history figure and recursive convolution neural network | |
CN108122003A (en) | A kind of Weak target recognition methods based on deep neural network | |
CN107609563A (en) | Picture semantic describes method and device | |
CN106845329A (en) | A kind of action identification method based on depth convolution feature multichannel pyramid pond |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |