CN110443132A - A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning - Google Patents

A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning Download PDF

Info

Publication number
CN110443132A
CN110443132A CN201910590960.0A CN201910590960A CN110443132A CN 110443132 A CN110443132 A CN 110443132A CN 201910590960 A CN201910590960 A CN 201910590960A CN 110443132 A CN110443132 A CN 110443132A
Authority
CN
China
Prior art keywords
face
picture
size
region
facial
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201910590960.0A
Other languages
Chinese (zh)
Inventor
张赛男
李千目
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Tech University
Original Assignee
Nanjing Tech University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Tech University filed Critical Nanjing Tech University
Priority to CN201910590960.0A priority Critical patent/CN110443132A/en
Publication of CN110443132A publication Critical patent/CN110443132A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

The invention discloses a kind of Face datection based on deep learning and the multiattribute convergence analysis methods of face.This method are as follows: input picture first generates the different scale set of the picture using image pyramid, is handled using image of the full convolutional neural networks to input, and the first hierarchical network generates preliminary face candidate region;Then face detection module filters out face candidate frame from thick to thin, will repeat area using non-maxima suppression algorithm and simplifies beyond the candidate window of given threshold, determines human face region;Then human face region is amplified, is sent to facial attributive analysis network and is analyzed, obtains age, gender prediction's result;Finally Face datection result and attributive analysis result are labeled on picture and are saved, obtains visual prediction result.Present invention reduces the complexities of Face datection and the more attribute convergence analysis of face, have the advantages that structure is simple, parameter is few, practical.

Description

A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning
Technical field
The present invention relates to machine learning, depth learning technology field, especially a kind of Face datection based on deep learning And the more attribute convergence analysis methods of face.
Background technique
Currently, deep learning has become the very active project of research, especially in terms of human face analysis.Such as Modern big data era, as the computing capability of hardware greatly enhances, deep learning can more play the advantage of its own, analyze Significant increase is obtained in precision.The experiment that many carries out in large-scale data is it has been shown that by obtained by deep learning Character representation can show good performance in fields such as natural language processing, computer visions.
Face character refer in facial image imply some attribute informations, as the age of people, gender, hair style, adornment, Head pose etc..Face character analysis, which refers to, to be analyzed and is identified to face character according to facial image.It is limited to current Shooting condition, that there are resolution ratio is low for some pictures under road traffic scene, facial size is small, face obscures, blocks face etc. Problem, how to carry out face character analysis to such picture is still the problem in computer vision field.
Summary of the invention
That the purpose of the present invention is to provide a kind of image resolution requirements is low, analysis accuracy rate is high, parameter is few, practical Face datection based on deep learning and the more attribute convergence analysis methods of face.
The technical solution for realizing the aim of the invention is as follows: a kind of Face datection and the more attributes of face based on deep learning Convergence analysis method, comprising the following steps:
Step 1, input picture generate the different scale set of the picture using image pyramid, using full convolutional Neural Network handles the image of input, and the first hierarchical network generates preliminary face candidate region;
Step 2, face detection module filter out face candidate frame according to preliminary face candidate region from thick to thin, benefit It will repeat area with non-maxima suppression algorithm to simplify beyond the candidate window of given threshold, deletion overlapping area, which is greater than, to be set The candidate window for determining threshold value, determines human face region;
Step 3 amplifies human face region, is sent to facial attributive analysis network and is analyzed, obtains age, gender prediction As a result;
Face datection result and attributive analysis result are labeled on picture and are saved by step 4, are obtained visual Prediction result.
Further, input picture described in step 1 generates the different scale set of the picture using image pyramid, It is handled using image of the full convolutional neural networks to input, the first hierarchical network generates preliminary face candidate region, tool Body is as follows:
Step 1.1, input picture, having a size of (h, w);
Step 1.2, the different size set that the picture is generated using image pyramid, according to the minimum facial size of setting Zoom ratio factor between facial size net_face_size that min_face_size, the network layer can detect, pyramidal layer, Input picture, which is zoomed to the network layer, to detect suitable size:
(h_n, w_n)=(h*net_face_size/min_face_size* (factor^n), h*net_face_size/ Min_face_size* (factor^n),
Wherein, n is the pyramid number of plies, is started counting from 0, until being less than net_face_ compared with decimal in (h_n.w_n) Size is handled using image of the full convolutional neural networks to input, and the first hierarchical network generates preliminary face candidate area Domain.
Further, face detection module described in step 2 filters out people according to preliminary face candidate region from thick to thin Face candidate frame will repeat area using non-maxima suppression algorithm and simplify beyond the candidate frame of given threshold, deletes overlapping The candidate window that area is greater than given threshold determines human face region, specific as follows:
Step 2.1 is rolled up using human face region differentiation, the recurrence of face bounding box and Face detection three cascade multitasks Product neural network, training face detection module;
Step 2.2, using trained face detection module Stepwise Screening face candidate frame, obtain final result.
Further, human face region is amplified described in step 3, is sent to facial attributive analysis network and is analyzed, is obtained To age, gender prediction as a result, specific as follows:
The face that step 3.1, input face detection module are predicted;
Step 3.2, the picture region that will be delivered to facial attributive analysis network expand, if a left side for protoplast's face zone boundary frame Upper, lower right coordinate is (x1,y1), (x2,y2), then in former bounding box picture height h, width w are as follows:
Coordinate according to former bounding box improves, the practical upper left for intercepting picture region, lower right coordinate (x '1,y′1)、 (x′2,y′2) are as follows:
Increase effective information with this, includes the incoming facial attributive analysis network of information of ear, hair style by face periphery;
Step 3.3 carries out human face analysis using convolutional layer, and the convolution kernel size of three convolutional layers is 7*7,5*5,3* respectively 3, the excitation function used is PReLU, and by primary maximum pond after each convolutional layer, convolutional layer generates multiple features Two layers of the feature in front and back is locally tied, reduces the size of characteristic pattern by face;
Step 3.4, the face characteristic extracted according to two full articulamentums, determine that the face belongs to the general of every age class Rate and belong to male, women probability;
Step 3.5, lower probability more of all categories obtain face age, gender prediction's result.
Further, facial attributive analysis network described in step 3 selects the shooting picture under real scene as face The data set of portion's attributive analysis.
Further, facial attributive analysis network described in step 3 is trained using online difficult example mining algorithm, According to the difficult example ratio and this batch of total number of samples set, crucial sample number, carries out to losing in this lot sample sheet needed for calculating Descending arrangement, extracts the crucial sample of respective numbers, forms small quantities of sample, be trained.
Compared with prior art, the present invention its remarkable advantage are as follows: (1) model prediction accuracy and network structure between the two It is balanced, reduces image resolution requirement;(2) complexity of Face datection and the more attribute convergence analysis of face is reduced Degree, has the advantages that structure is simple, parameter is few, practical.
Detailed description of the invention
Fig. 1 is the process signal the present invention is based on the Face datection of deep learning and the more attribute convergence analysis methods of face Figure.
Fig. 2 is the flow diagram of face detection module in the present invention.
Fig. 3 is the flow diagram of septum reset attributive analysis of the present invention.
Fig. 4 is the prediction result schematic diagram of picture under actual scene in the embodiment of the present invention.
Specific embodiment
The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments.
In conjunction with Fig. 1, a kind of Face datection and the more attribute convergence analysis methods of face based on deep learning of the present invention is special Sign is, comprising the following steps:
Step 1, input picture generate the different scale set of the picture using image pyramid, using full convolutional Neural Network handles the image of input, and the first hierarchical network generates preliminary face candidate region, specific as follows in conjunction with Fig. 2:
Step 1.1, input picture, having a size of (h, w);
Step 1.2, the different size set that the picture is generated using image pyramid, according to the minimum facial size of setting Zoom ratio factor between facial size net_face_size that min_face_size, the network layer can detect, pyramidal layer, Input picture, which is zoomed to the network layer, to detect suitable size:
(h_n, w_n)=(h*net_face_size/min_face_size* (factor^n), h*net_face_size/ Min_face_size* (factor^n),
Wherein, n is the pyramid number of plies, is started counting from 0, until being less than net_face_ compared with decimal in (h_n.w_n) Size is handled using image of the full convolutional neural networks to input, and the first hierarchical network generates preliminary face candidate area Domain.
Step 2, face detection module filter out face candidate frame according to preliminary face candidate region from thick to thin, benefit It will repeat area with non-maxima suppression algorithm to simplify beyond the candidate window of given threshold, deletion overlapping area, which is greater than, to be set The candidate window for determining threshold value, determines human face region, specific as follows in conjunction with Fig. 2:
Step 2.1 is rolled up using human face region differentiation, the recurrence of face bounding box and Face detection three cascade multitasks Product neural network, training face detection module;
Step 2.2, using trained face detection module Stepwise Screening face candidate frame, obtain final result.
Step 3 amplifies human face region, is sent to facial attributive analysis network and is analyzed, obtains age, gender prediction As a result, in conjunction with Fig. 3, it is specific as follows:
The face that step 3.1, input face detection module are predicted;
Step 3.2, the picture region that will be delivered to facial attributive analysis network expand, if a left side for protoplast's face zone boundary frame Upper, lower right coordinate is (x1,y1), (x2,y2), then in former bounding box picture height h, width w are as follows:
Coordinate according to former bounding box improves, the practical upper left for intercepting picture region, lower right coordinate (x '1,y′1)、 (x′2,y′2) are as follows:
Increase effective information with this, includes the incoming facial attributive analysis network of information of ear, hair style by face periphery;
Step 3.3 carries out human face analysis using convolutional layer, and the convolution kernel size of three convolutional layers is 7*7,5*5,3* respectively 3, the excitation function used is PReLU, and by primary maximum pond after each convolutional layer, convolutional layer generates multiple features Two layers of the feature in front and back is locally tied, reduces the size of characteristic pattern by face;
Step 3.4, the face characteristic extracted according to two full articulamentums, determine that the face belongs to the general of every age class Rate and belong to male, women probability;
Step 3.5, lower probability more of all categories obtain face age, gender prediction's result.
Face datection result and attributive analysis result are labeled on picture and are saved by step 4, are obtained visual Prediction result.
Further, facial attributive analysis network described in step 3 selects the shooting picture under real scene as face The data set of portion's attributive analysis.
Further, facial attributive analysis network described in step 3 is trained using online difficult example mining algorithm, According to the difficult example ratio and this batch of total number of samples set, crucial sample number, carries out to losing in this lot sample sheet needed for calculating Descending arrangement, extracts the crucial sample of respective numbers, forms small quantities of sample, be trained, to increase the specific gravity of difficult example sample, by force Change the specific aim of model.
Embodiment 1
In conjunction with Fig. 4, a specific embodiment of the invention, comprising the following steps:
Step 1, input picture generate the different scale set of the picture using image pyramid, using full convolutional Neural Network handles the image of input, and the first hierarchical network generates preliminary face candidate region;
Step 2, face detection module filter out face candidate frame from thick to thin, will be repeated using non-maxima suppression algorithm Area is simplified beyond the candidate window of given threshold, determines human face region;
Step 3 amplifies human face region, comprising will be enlarged by model into effective pictorial information such as hair lengths, ear etc. Face picture after enclosing is sent to facial attributive analysis network and is analyzed, and obtains age, gender prediction's result;
Face datection result and attributive analysis result are labeled on picture and are saved by step 4, are obtained visual Prediction result is convenient for practical application.
Specific testing result (to protect privacy, blurs face in test data) as shown in Figure 4, in test chart In piece, rectangle frame outlines human face region, and the Gender on rectangle frame represents prediction of the present invention to the face gender, and M is male, F is women, and the Age on rectangle frame represents prediction of the present invention to the face age class, behind institute with data represent the corresponding age Section shares 8 kinds of possibility: 0-2,4-6,8-13,15-20,25-32,38-43,48-53,60-.Although testing the picture light used Lines part is poor, and resolution ratio is low, and the result of model prediction is still more bonded true.As can be seen that the present invention is based on depth The Face datection and the more attribute convergence analysis methods of face of habit, are put down between the two in model prediction accuracy and network structure Weighing apparatus, reduces image resolution requirement, and reduces the complexity of Face datection and the more attribute convergence analysis of face, has knot The advantage that structure is simple, parameter is few, practical.

Claims (6)

1. a kind of Face datection and the more attribute convergence analysis methods of face based on deep learning, which is characterized in that including following Step:
Step 1, input picture generate the different scale set of the picture using image pyramid, using full convolutional neural networks The image of input is handled, the first hierarchical network generates preliminary face candidate region;
Step 2, face detection module filter out face candidate frame, utilization is non-according to preliminary face candidate region from thick to thin Maximum restrainable algorithms will repeat area and simplify beyond the candidate window of given threshold, delete overlapping area and are greater than setting threshold The candidate window of value, determines human face region;
Step 3 amplifies human face region, is sent to facial attributive analysis network and is analyzed, and obtains age, gender prediction's knot Fruit;
Face datection result and attributive analysis result are labeled on picture and are saved by step 4, are visually predicted As a result.
2. the Face datection and the multiattribute convergence analysis method of face according to claim 1 based on deep learning, It is characterized in that, input picture described in step 1, the different scale set of the picture is generated using image pyramid, using full volume Product neural network handles the image of input, and the first hierarchical network generates preliminary face candidate region, specific as follows:
Step 1.1, input picture, having a size of (h, w);
Step 1.2, the different size set that the picture is generated using image pyramid, according to the minimum facial size min_ of setting Zoom ratio factor between facial size net_face_size that face_size, the network layer can detect, pyramidal layer, will be defeated Suitable size can be detected to the network layer by entering image scaling:
(h_n, w_n)=(h*net_face_size/min_face_size* (factor^n), h*net_face_size/min_ Face_size* (factor^n),
Wherein, n is the pyramid number of plies, is started counting from 0, until being less than net_face_size compared with decimal in (h_n.w_n), is adopted It is handled with image of the full convolutional neural networks to input, the first hierarchical network generates preliminary face candidate region.
3. the Face datection and the multiattribute convergence analysis method of face according to claim 1 based on deep learning, It is characterized in that, face detection module described in step 2 filters out face candidate according to preliminary face candidate region from thick to thin Frame will repeat area using non-maxima suppression algorithm and simplify beyond the candidate frame of given threshold, it is big to delete overlapping area Human face region is determined in the candidate window of given threshold, specific as follows:
Step 2.1 utilizes human face region differentiation, the recurrence of face bounding box and Face detection three cascade multitask convolution minds Through network, training face detection module;
Step 2.2, using trained face detection module Stepwise Screening face candidate frame, obtain final result.
4. the Face datection and the multiattribute convergence analysis method of face according to claim 1 based on deep learning, It is characterized in that, amplifies human face region described in step 3, be sent to facial attributive analysis network and analyzed, obtain age, property Other prediction result, specific as follows:
The face that step 3.1, input face detection module are predicted;
Step 3.2, the picture region that will be delivered to facial attributive analysis network expand, if the upper left of protoplast's face zone boundary frame, Lower right coordinate is (x1,y1), (x2,y2), then in former bounding box picture height h, width w are as follows:
Coordinate according to former bounding box improves, the practical upper left for intercepting picture region, lower right coordinate (x '1,y′1)、(x′2, y′2) are as follows:
Increase effective information with this, includes the incoming facial attributive analysis network of information of ear, hair style by face periphery;
Step 3.3 carries out human face analysis using convolutional layer, and the convolution kernel size of three convolutional layers is 7*7,5*5,3*3 respectively, is adopted Excitation function is PReLU, and by primary maximum pond after each convolutional layer, convolutional layer generates multiple characteristic faces, will The feature that two layers of front and back is locally tied, and reduces the size of characteristic pattern;
Step 3.4, the face characteristic extracted according to two full articulamentums, determine the face belong to every age class probability and Belong to male, women probability;
Step 3.5, lower probability more of all categories obtain face age, gender prediction's result.
5. the Face datection and the multiattribute convergence analysis method of face according to claim 1 based on deep learning, It is characterized in that, facial attributive analysis network described in step 3, selects the shooting picture under real scene as facial attribute point The data set of analysis.
6. the Face datection and the multiattribute convergence analysis method of face according to claim 1 based on deep learning, It is characterized in that, facial attributive analysis network described in step 3, is trained using online difficult example mining algorithm, according to setting Good difficult example ratio and this batch of total number of samples, crucial sample number needed for calculating carry out descending arrangement to loss in this lot sample sheet, The crucial sample for extracting respective numbers, forms small quantities of sample, is trained.
CN201910590960.0A 2019-07-02 2019-07-02 A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning Withdrawn CN110443132A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910590960.0A CN110443132A (en) 2019-07-02 2019-07-02 A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910590960.0A CN110443132A (en) 2019-07-02 2019-07-02 A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning

Publications (1)

Publication Number Publication Date
CN110443132A true CN110443132A (en) 2019-11-12

Family

ID=68428543

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910590960.0A Withdrawn CN110443132A (en) 2019-07-02 2019-07-02 A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning

Country Status (1)

Country Link
CN (1) CN110443132A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062426A (en) * 2019-12-11 2020-04-24 北京金山云网络技术有限公司 Method, device, electronic equipment and medium for establishing training set
CN111091109A (en) * 2019-12-24 2020-05-01 厦门瑞为信息技术有限公司 Method, system and equipment for predicting age and gender based on face image
CN111626193A (en) * 2020-05-26 2020-09-04 北京嘀嘀无限科技发展有限公司 Face recognition method, face recognition device and readable storage medium
CN111914628A (en) * 2020-06-19 2020-11-10 北京百度网讯科技有限公司 Training method and device of face recognition model
CN114782400A (en) * 2022-05-17 2022-07-22 东风本田发动机有限公司 Method, device, equipment, medium and program product for detecting slag point of metal material

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062426A (en) * 2019-12-11 2020-04-24 北京金山云网络技术有限公司 Method, device, electronic equipment and medium for establishing training set
CN111091109A (en) * 2019-12-24 2020-05-01 厦门瑞为信息技术有限公司 Method, system and equipment for predicting age and gender based on face image
CN111091109B (en) * 2019-12-24 2023-04-07 厦门瑞为信息技术有限公司 Method, system and equipment for predicting age and gender based on face image
CN111626193A (en) * 2020-05-26 2020-09-04 北京嘀嘀无限科技发展有限公司 Face recognition method, face recognition device and readable storage medium
CN111914628A (en) * 2020-06-19 2020-11-10 北京百度网讯科技有限公司 Training method and device of face recognition model
CN111914628B (en) * 2020-06-19 2023-06-20 北京百度网讯科技有限公司 Training method and device of face recognition model
CN114782400A (en) * 2022-05-17 2022-07-22 东风本田发动机有限公司 Method, device, equipment, medium and program product for detecting slag point of metal material

Similar Documents

Publication Publication Date Title
CN110443132A (en) A kind of Face datection and the more attribute convergence analysis methods of face based on deep learning
CN109101930B (en) Crowd counting method and system
CN106874894B (en) Human body target detection method based on regional full convolution neural network
CN107292256B (en) Auxiliary task-based deep convolution wavelet neural network expression recognition method
CN107358262B (en) High-resolution image classification method and classification device
WO2020119661A1 (en) Target detection method and device and pedestrian detection method and system
WO2022134655A1 (en) End-to-end video action detection and positioning system
CN109919013A (en) Method for detecting human face and device in video image based on deep learning
CN110020606A (en) A kind of crowd density estimation method based on multiple dimensioned convolutional neural networks
CN108898145A (en) A kind of image well-marked target detection method of combination deep learning
CN108416307A (en) A kind of Aerial Images road surface crack detection method, device and equipment
CN110852941B (en) Neural network-based two-dimensional virtual fitting method
US10540568B2 (en) System and method for coarse-to-fine video object segmentation and re-composition
CN111401293B (en) Gesture recognition method based on Head lightweight Mask scanning R-CNN
CN108416314B (en) Picture important face detection method
CN110348357A (en) A kind of fast target detection method based on depth convolutional neural networks
CN109635812B (en) The example dividing method and device of image
CN111696080B (en) Face fraud detection method, system and storage medium based on static texture
CN112528913A (en) Grit particulate matter particle size detection analytic system based on image
CN104484890A (en) Video target tracking method based on compound sparse model
CN109635811A (en) The image analysis method of spatial plant
CN110969171A (en) Image classification model, method and application based on improved convolutional neural network
CN111027497A (en) Weak and small target rapid detection method based on high-resolution optical remote sensing image
CN113888501B (en) Attention positioning network-based reference-free image quality evaluation method
CN107506792A (en) A kind of semi-supervised notable method for checking object

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20191112

WW01 Invention patent application withdrawn after publication