CN109902641B - Semantic alignment-based face key point detection method, system and device - Google Patents

Semantic alignment-based face key point detection method, system and device Download PDF

Info

Publication number
CN109902641B
CN109902641B CN201910168643.XA CN201910168643A CN109902641B CN 109902641 B CN109902641 B CN 109902641B CN 201910168643 A CN201910168643 A CN 201910168643A CN 109902641 B CN109902641 B CN 109902641B
Authority
CN
China
Prior art keywords
key point
face
network
point detection
response
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910168643.XA
Other languages
Chinese (zh)
Other versions
CN109902641A (en
Inventor
朱翔昱
雷震
王金桥
刘智威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Automation of Chinese Academy of Science
Original Assignee
Institute of Automation of Chinese Academy of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Automation of Chinese Academy of Science filed Critical Institute of Automation of Chinese Academy of Science
Priority to CN201910168643.XA priority Critical patent/CN109902641B/en
Publication of CN109902641A publication Critical patent/CN109902641A/en
Application granted granted Critical
Publication of CN109902641B publication Critical patent/CN109902641B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

The invention belongs to the field of face recognition, in particular to a face key point detection method, a system and a device based on semantic alignment, aiming at improving the accuracy of face key point detection, the method adopts a training sample which is constructed by a basic convergent face key point detection network and comprises a face image sample marked with key points and a response image of standard Gaussian with each key point as the center, and optimizes the face key point detection network by using a probability model containing hidden variables as a target of maximum likelihood estimation; and predicting the coordinates of the key points of the human face through the finally optimized human face key point detection network. The invention effectively solves the problem of training oscillation caused by labeling randomness in the network training process and improves the accuracy of face key point detection.

Description

Semantic alignment-based face key point detection method, system and device
Technical Field
The invention belongs to the field of face recognition, and particularly relates to a face key point detection method, system and device based on semantic alignment.
Background
The face key points play an important role in computer vision and pattern recognition applications based on faces, such as video monitoring and identity recognition systems. For most face applications, accurate detection of key points of the face is required first.
The mainstream face key point detection methods in recent years are mainly divided into two categories, one category is a traditional method. One type is a convolutional neural network based approach. The traditional method directly regresses model parameters through manual image features. A representative method is cascade regression, and the fitting process can be summarized as the following formula:
pk+1=pk+Regk(Fea(I,pk))
at the k-th iteration, go through RegkRegressing the shape index feature Fea to update the shape parameter pk. Wherein the shape index feature Fea depends on the input image I and the current shape parameter pk. The regressor updates the model parameters according to the shape index features and calculates new features for the next iteration. By utilizing the characteristic, a plurality of weak regressors can be connected in series to form a strong regressor so as to gradually reduce the error.
The method is mainly divided into two categories, one is a coordinate regression-based method, the method regards key point positioning as a regression process of mapping image pixels to key point coordinates, and a face picture is input into the convolutional neural network to directly predict a vector formed by the coordinates of each key point. And the other type is a response graph-based method, a response graph of each key point is predicted by a convolutional neural network, and the peak position of the response is taken as the position of the predicted key point.
In the face key point detection method, the positions of the manually marked key points are taken as regression targets during training, so that the positions of the manually marked points are estimated as much as possible by the model. However, there are a large number of weak semantic points in the face key points, and these points are usually only required to be uniformly distributed on the designated edges, such as the face contour, the eye socket, the nose bridge, and other areas, and there is no strict semantic position. Because the identification degree of the texture information around the weak semantic points is low, random errors inevitably exist in manual labeling results, and thus the phenomenon of inconsistent semantics exists in the labeling of different samples. Therefore, the direct training of the model using the manual calibration points results in a large number of invalid errors during the training process, so that the network fitting capability cannot be concentrated on the real required places. So far, the influence of the labeling randomness of key points on model training is not considered.
Disclosure of Invention
In order to solve the above-mentioned problems in the prior art, that is, in order to improve the accuracy of face key point detection, a first aspect of the present invention provides a face key point detection method based on semantic alignment, including:
step S10, acquiring a response image of each key point of the face image to be detected based on the face key point detection network;
step S20, selecting the coordinate of the response peak value as the predicted coordinate of each key point for the obtained response graph of each key point;
the face key point detection network is constructed based on a convolutional neural network, and a probability model containing hidden variables is used as a target of maximum likelihood estimation to perform network optimization and is used for outputting a response graph of key points in a face image.
In some preferred embodiments, the training samples of the face keypoint detection network include face image samples marked with keypoints and response maps of standard gaussians with the keypoints as centers.
In some preferred embodiments, the face key point detection network adopts an objective function in the optimization process of
Figure BDA0001987196480000031
Figure BDA0001987196480000032
Figure BDA0001987196480000033
Wherein x is an input face imageAn image;
Figure BDA0001987196480000034
a set of coordinates for k keypoints;
Figure BDA0001987196480000035
predicting a real position with consistent semantics of the kth key point in x by the network; k is the number of key points; w is the network weight; okRepresenting the kth labeled key point of x; sigma1、σ2Respectively is a preset first weight and a preset second weight;
Figure BDA0001987196480000036
the distribution distance between the response graph representing the network prediction and the standard Gaussian response graph; n (o)k) Is to okIs the neighborhood of the center.
In some preferred embodiments, in the iterative optimization process of the face key point detection network, the network weight W is fixed in the t-th iteration process, and the real position is calculated by the following formula
Figure BDA0001987196480000037
Figure BDA0001987196480000038
Wherein o ist kThe value of (A) is the real position obtained by optimization in the last iteration
Figure BDA0001987196480000039
And then based on the true position
Figure BDA00019871964800000310
The optimized network weights W are obtained by the following formula,
Figure BDA00019871964800000311
in some preferred embodiments, the end condition of the iterative optimization process of the face keypoint detection network is as follows:
objective function
Figure BDA00019871964800000312
Reach a preset convergence condition, or
The iteration times reach the preset times.
The invention provides a human face key point detection system based on semantic alignment, which comprises a response image acquisition module and a key point prediction coordinate acquisition module;
the response image acquisition module is configured to acquire a response image of each key point of the face image to be detected based on the face key point detection network;
the key point prediction coordinate acquisition module is configured to select the coordinate of the response peak value as the prediction coordinate of each key point for the acquired response graph of each key point;
the face key point detection network is constructed based on a convolutional neural network and is used for outputting a response image of key points in a face image.
In some preferred embodiments, the training samples of the face keypoint detection network include face image samples marked with keypoints and response maps of standard gaussians with the keypoints as centers.
In some preferred embodiments, the face key point detection network adopts an objective function in the optimization process of
Figure BDA0001987196480000041
Figure BDA0001987196480000042
Figure BDA0001987196480000043
Wherein, x is an input face image;
Figure BDA0001987196480000044
a set of coordinates for k keypoints;
Figure BDA0001987196480000045
predicting a real position with consistent semantics of the kth key point in x by the network; k is the number of key points; w is the network weight; okRepresenting the kth labeled key point of x; sigma1、σ2Respectively is a preset first weight and a preset second weight;
Figure BDA0001987196480000046
the distribution distance between the response graph representing the network prediction and the standard Gaussian response graph; n (o)k) Is to okIs the neighborhood of the center.
In a third aspect of the present invention, a storage device is provided, in which a plurality of programs are stored, the programs being suitable for being loaded and executed by a processor to implement the above-mentioned face key point detection method based on semantic alignment.
In a fifth aspect of the present invention, a processing apparatus is provided, which includes a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; the program is suitable to be loaded and executed by a processor to realize the above-mentioned face key point detection method based on semantic alignment.
The invention has the beneficial effects that:
the semantic ambiguity of weak semantic key points distributed on the edge can cause random errors of manual labeling. The manual labeling is directly used as an optimization target of network training, so that a large amount of invalid errors exist in the training, and the predicted position continuously vibrates. Resulting in the network fitting capability not being concentrated where it is really needed. The human face key point detection method based on semantic alignment can calculate the true positions of key points without errors and with consistent semantics, effectively solves the problem of training oscillation caused by labeling randomness, and simultaneously enables the fitting of weak semantic points to be more flexible. Due to the fact that the network fitting capacity is distributed more reasonably, the performance can be improved remarkably on the premise that the operation amount is not changed.
Drawings
Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
fig. 1 is a schematic diagram of training and detection processes in a face key point detection method based on semantic alignment according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.
The invention discloses a face key point detection method based on semantic alignment, which comprises the following steps:
step S10, acquiring a response image of each key point of the face image to be detected based on the face key point detection network;
step S20, selecting the coordinate of the response peak value as the predicted coordinate of each key point for the obtained response graph of each key point;
the face key point detection network is constructed based on a convolutional neural network, and a probability model containing hidden variables is used as a target of maximum likelihood estimation to perform network optimization, so as to output a response graph of key points in a face image
In order to more clearly explain the human face key point detection method based on semantic alignment, the method of the invention is expanded and detailed from three aspects of training data, network optimization and human face key point detection by combining with the attached figure 1.
1. Training data
Manually calibrating the key points of the human face to serve as training samples, presetting a calibrated human face frame, and accurately calibrating the two-dimensional coordinates of each key point in the frame for each human face frame.
For each labeled keypoint in the training sample, a response graph of a standard Gaussian centered on the labeled position is generated to be used as network supervision information.
In the constructed training data, each training sample comprises a face image sample marked with key points and a response graph of standard gaussians with the key points as centers.
2. Network optimization
The face key point detection network adopted in the embodiment may be an hourglass network or other full convolution network structures. When the network is trained by using the original artificially labeled training set until the network is basically converged, the network is in a shaking state due to continuous fitting of key points with labeling randomness, so that the network cannot be converged to a better local optimal value.
2.1 construction of the objective function
Because the manual labeling has semantic inconsistency, the problem of difficult convergence can be caused by directly taking the manual labeling as an optimization target of network training. Therefore, the invention introduces real key points which are free of errors and have consistent semantics among different samples as the target of network optimization, and the manually marked positions can be regarded as an observed value of the real positions of the key points, thereby more reasonably modeling the detection problem of the key points of the human face. As the training process of the detection network can be regarded as maximum likelihood estimation, the invention designs a probability model containing hidden variables as the target of the maximum likelihood estimation. The probability model is shown in formula (1)
Figure BDA0001987196480000071
Wherein,
Figure BDA0001987196480000072
representing semantically consistent true positions (true values) of key points, i.e. hidden variables, o represents
Figure BDA0001987196480000073
Such as manually labeled key points. x represents the input face picture and W represents the network weight.
Figure BDA0001987196480000074
For a given true value
Figure BDA0001987196480000075
Obtaining the probability of the current observed value o, estimating according to experience that the observed value should be close to the true value, so that the prior probability model
Figure BDA0001987196480000076
Is as defined in formula (2):
Figure BDA0001987196480000077
wherein K is the number of key points, sigma1The weight representing the prior probability model, denoted as the first weight, okRepresents the kth labeled key point of x. Likelihood function
Figure BDA0001987196480000078
Defining the confidence degree that the current network predicted value is a true hidden variable as shown in formulas (3) and (4):
Figure BDA0001987196480000081
Figure BDA0001987196480000082
wherein,
Figure BDA0001987196480000083
the invention measures the distribution distance between the response graph predicted by the network and the standard Gaussian response graph by using a chi-square test. i denotes the pixel index, E denotes the standard gaussian response map, and Φ (y | x; W) denotes the response map predicted for the input picture x. Sigma2The weight representing the likelihood function is denoted as the second weight. By integrating the prior probability model and the likelihood function, the human face key point detection network training can be converted into a constraint optimization problem, as shown in formula (5):
Figure BDA0001987196480000084
Figure BDA0001987196480000085
Figure BDA0001987196480000086
the actual value may not be too far from the observed value
Figure BDA0001987196480000087
Is limited to the current observed value okA central neighborhood N (o)k) In (1).
2.2 iterative optimization
In the training process, the optimization of the network weight in the face key point detection network is carried out in an iterative optimization mode, the network weight W is fixed in the t-th iteration process, and the real position is calculated through the formula (6)
Figure BDA0001987196480000088
Figure BDA0001987196480000089
Wherein o ist kThe value of (A) is the real position obtained by optimization in the last iteration
Figure BDA00019871964800000810
The actual value obtained by optimization at this time
Figure BDA00019871964800000811
The location of (a) is integrated into consideration of the predicted location and observed location of the network. The knowledge prior of the whole training set sample is learned by the network, and the optimized knowledge is obtained
Figure BDA0001987196480000091
Labeling randomness in a single sample can be smoothed to some extent.
And then based on the true position
Figure BDA0001987196480000092
The optimized network weight W is obtained by equation (7),
Figure BDA0001987196480000093
considering the objective function as a loss function, the optimization weights W can be considered as a standard network training process. The whole training process of the key point detection model can be regarded as that the real position of the key point is obtained by optimizing in each iteration process
Figure BDA0001987196480000094
Reuse of true position
Figure BDA0001987196480000095
And training the network. Compared with the existing algorithm, the method provides a more flexible optimization target, so that unreasonable errors in the training process are effectively reduced, the fitting performance of the convolutional neural network can be more reasonably distributed, and the accuracy of key point positioning is improved.
The preferred steps in the training process are: (1) training a face key point detection network directly based on the artificially labeled face image training sample in the step 1 until basic convergence; (2) and (3) further optimizing the face key point detection network by adopting a 2.2 iterative optimization method based on the training sample constructed in the step 1. The end condition of the iterative optimization process of the face key point detection network is an objective function
Figure BDA0001987196480000096
And reaching a preset convergence condition, or enabling the iteration times to reach preset times.
3. Face keypoint detection
Step S10, acquiring a response image of each key point of the face image to be detected based on the face key point detection network trained by the method;
step S20, for each obtained response map of the key point, selecting the coordinate of the response peak as the predicted coordinate of the key point.
The invention discloses a human face key point detection system based on semantic alignment, which comprises a response image acquisition module and a key point prediction coordinate acquisition module, wherein the response image acquisition module is used for acquiring a key point prediction coordinate;
the response image acquisition module is configured to acquire a response image of each key point of the face image to be detected based on the face key point detection network;
the key point prediction coordinate acquisition module is configured to select the coordinate of the response peak value as the prediction coordinate of each key point for the acquired response graph of each key point;
the face key point detection network is constructed based on a convolutional neural network, and a probability model containing hidden variables is used as a target of maximum likelihood estimation to perform network optimization and is used for outputting a response graph of key points in a face image.
The specific training method of the face key point detection network has been described in detail above, and is not repeated here.
It can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working process and related description of the system described above may refer to the corresponding process in the foregoing method embodiments, and will not be described herein again.
It should be noted that, the face keypoint detection system based on semantic alignment provided in the foregoing embodiment is only illustrated by the division of the foregoing functional modules, and in practical applications, the above functions may be allocated to different functional modules according to needs, that is, the modules or steps in the embodiment of the present invention are further decomposed or combined, for example, the modules in the foregoing embodiment may be combined into one module, or may be further split into multiple sub-modules, so as to complete all or part of the above described functions. The names of the modules and steps involved in the embodiments of the present invention are only for distinguishing the modules or steps, and are not to be construed as unduly limiting the present invention.
A storage device according to a third embodiment of the present invention stores a plurality of programs, and the programs are suitable for being loaded and executed by a processor to implement the above-mentioned face key point detection method based on semantic alignment.
A processing apparatus according to a fourth embodiment of the present invention includes a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; the program is suitable to be loaded and executed by a processor to realize the above-mentioned face key point detection method based on semantic alignment.
It can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working processes and related descriptions of the storage device and the processing device described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
Those of skill in the art would appreciate that the various illustrative modules, method steps, and modules described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that programs corresponding to the software modules, method steps may be located in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. To clearly illustrate this interchangeability of electronic hardware and software, various illustrative components and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as electronic hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
The terms "first," "second," and the like are used for distinguishing between similar elements and not necessarily for describing or implying a particular order or sequence.
The terms "comprises," "comprising," or any other similar term are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
So far, the technical solutions of the present invention have been described in connection with the preferred embodiments shown in the drawings, but it is easily understood by those skilled in the art that the scope of the present invention is obviously not limited to these specific embodiments. Equivalent changes or substitutions of related technical features can be made by those skilled in the art without departing from the principle of the invention, and the technical scheme after the changes or substitutions can fall into the protection scope of the invention.

Claims (6)

1. A face key point detection method based on semantic alignment is characterized by comprising the following steps:
step S10, acquiring a response image of each key point of the face image to be detected based on the face key point detection network;
in the face key point detection network, an objective function adopted in the optimization process is as follows:
Figure FDA0002801813790000011
Figure FDA0002801813790000012
Figure FDA0002801813790000013
wherein e is an input face image;
Figure FDA0002801813790000014
a set of coordinates for k keypoints;
Figure FDA0002801813790000015
predicting a real position with consistent semantics of the kth key point in x by the network; k is the number of key points; w is the network weight; e represents the kth labeled key point of the x; sigma1、σ2Respectively is a preset first weight and a preset second weight;
Figure FDA0002801813790000016
the distribution distance between the response graph representing the network prediction and the standard Gaussian response graph; n (o)k) Is to okA neighborhood that is the center;
in the iterative optimization process of the face key point detection network, the network weight W is fixed in the t-th iteration process, and the real position is calculated through the following formula
Figure FDA0002801813790000017
Figure FDA0002801813790000018
Wherein o ist kThe value of (A) is the real position obtained by optimization in the last iteration
Figure FDA0002801813790000019
And then based on the true position
Figure FDA00028018137900000110
The optimized network weights W are obtained by the following formula,
Figure FDA00028018137900000111
the end condition of the iterative optimization process of the face key point detection network is as follows: objective function
Figure FDA0002801813790000027
Reaching a preset convergence condition, or enabling the iteration times to reach preset times;
step S20, selecting the coordinate of the response peak value as the predicted coordinate of each key point for the obtained response graph of each key point;
the face key point detection network is constructed based on a convolutional neural network, and a probability model containing hidden variables is used as a target of maximum likelihood estimation to perform network optimization and is used for outputting a response graph of key points in a face image.
2. The method for detecting the key points of the human face based on the semantic alignment as claimed in claim 1, wherein the training samples of the human face key point detection network comprise human face image samples marked with key points and response graphs of standard gaussians with the key points as centers.
3. A face key point detection system based on semantic alignment is characterized by comprising: a response map acquisition module and a key point prediction coordinate acquisition module;
the response image acquisition module is configured to acquire a response image of each key point of the face image to be detected based on the face key point detection network;
the human face key point detection network adopts an objective function in the optimization process of
Figure FDA0002801813790000021
Figure FDA0002801813790000022
Figure FDA0002801813790000023
Wherein, x is an input face image;
Figure FDA0002801813790000024
a set of coordinates for k keypoints;
Figure FDA0002801813790000025
predicting a real position with consistent semantics of the kth key point in x by the network; k is the number of key points; w is the network weight; okRepresenting the kth labeled key point of x; sigma1、σ2Respectively is a preset first weight and a preset second weight;
Figure FDA0002801813790000026
the distribution distance between the response graph representing the network prediction and the standard Gaussian response graph; n (o)k) Is to okA neighborhood that is the center;
the key point prediction coordinate acquisition module is configured to select the coordinate of the response peak value as the prediction coordinate of each key point for the acquired response graph of each key point;
the face key point detection network is constructed based on a convolutional neural network and is used for outputting a response image of key points in a face image.
4. The semantic alignment-based face keypoint detection system of claim 3, wherein the training samples of the face keypoint detection network comprise face image samples marked with keypoints and response graphs of standard gaussians with the keypoints as centers.
5. A storage device having stored therein a plurality of programs adapted to be loaded and executed by a processor to implement the semantic alignment based face keypoint detection method of any of claims 1-2.
6. A processing device comprising a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; characterized in that the program is adapted to be loaded and executed by a processor to implement the semantic alignment based face keypoint detection method of any of claims 1-2.
CN201910168643.XA 2019-03-06 2019-03-06 Semantic alignment-based face key point detection method, system and device Active CN109902641B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910168643.XA CN109902641B (en) 2019-03-06 2019-03-06 Semantic alignment-based face key point detection method, system and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910168643.XA CN109902641B (en) 2019-03-06 2019-03-06 Semantic alignment-based face key point detection method, system and device

Publications (2)

Publication Number Publication Date
CN109902641A CN109902641A (en) 2019-06-18
CN109902641B true CN109902641B (en) 2021-03-02

Family

ID=66946462

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910168643.XA Active CN109902641B (en) 2019-03-06 2019-03-06 Semantic alignment-based face key point detection method, system and device

Country Status (1)

Country Link
CN (1) CN109902641B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111144422A (en) * 2019-12-19 2020-05-12 华中科技大学 Positioning identification method and system for aircraft component
CN113963389A (en) * 2020-07-01 2022-01-21 复旦大学 Face key point detection method based on deep neural network
CN112580515B (en) * 2020-12-21 2022-05-10 浙江大学 Lightweight face key point detection method based on Gaussian heat map regression
CN113610799B (en) * 2021-08-04 2022-07-08 沭阳九鼎钢铁有限公司 Artificial intelligence-based photovoltaic cell panel rainbow line detection method, device and equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106575367A (en) * 2014-08-21 2017-04-19 北京市商汤科技开发有限公司 A method and a system for facial landmark detection based on multi-task
CN107492116A (en) * 2017-09-01 2017-12-19 深圳市唯特视科技有限公司 A kind of method that face tracking is carried out based on more display models
CN108596090A (en) * 2018-04-24 2018-09-28 北京达佳互联信息技术有限公司 Facial image critical point detection method, apparatus, computer equipment and storage medium
CN109214343A (en) * 2018-09-14 2019-01-15 北京字节跳动网络技术有限公司 Method and apparatus for generating face critical point detection model

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140185924A1 (en) * 2012-12-27 2014-07-03 Microsoft Corporation Face Alignment by Explicit Shape Regression
US10860837B2 (en) * 2015-07-20 2020-12-08 University Of Maryland, College Park Deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition
CN108304765B (en) * 2017-12-11 2020-08-11 中国科学院自动化研究所 Multi-task detection device for face key point positioning and semantic segmentation

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106575367A (en) * 2014-08-21 2017-04-19 北京市商汤科技开发有限公司 A method and a system for facial landmark detection based on multi-task
CN107492116A (en) * 2017-09-01 2017-12-19 深圳市唯特视科技有限公司 A kind of method that face tracking is carried out based on more display models
CN108596090A (en) * 2018-04-24 2018-09-28 北京达佳互联信息技术有限公司 Facial image critical point detection method, apparatus, computer equipment and storage medium
CN109214343A (en) * 2018-09-14 2019-01-15 北京字节跳动网络技术有限公司 Method and apparatus for generating face critical point detection model

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Combining Data-Driven and Model-Driven Methods for Robust Facial Landmark Detection;Hongwen Zhang et.al;《IEEE Transactions on Information Forensics and Security》;20180201;第13卷(第10期);2409-2422 *
Deep and Deformable Convolutional Mixtures of Deformable Part-Based Models;Kritaphat Songsri-in et.al;《2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018)》;20180607;218-225 *
偏标记损失函数研究;唐才智;《中国优秀硕士学位论文全文数据库 信息科技辑》;20180415;1-6 *
基于深度学习的人脸关键点检测和人脸属性分析;曾家建;《中国优秀硕士学位论文全文数据库 信息科技辑》;20181215;全文 *

Also Published As

Publication number Publication date
CN109902641A (en) 2019-06-18

Similar Documents

Publication Publication Date Title
CN109902641B (en) Semantic alignment-based face key point detection method, system and device
JP2023506803A (en) Cooperative positioning method, device, equipment and storage medium
CN104867137B (en) A kind of method for registering images based on improvement RANSAC algorithms
US11283991B2 (en) Method and system for tuning a camera image signal processor for computer vision tasks
CN110796135B (en) Target positioning method and device, computer equipment and computer storage medium
CN108984785A (en) A kind of update method and device of the fingerprint base based on historical data and increment
CN112802071B (en) Three-dimensional reconstruction effect evaluation method and system
CN113065593A (en) Model training method and device, computer equipment and storage medium
CN112949519B (en) Target detection method, device, equipment and storage medium
CN112802108B (en) Target object positioning method, target object positioning device, electronic equipment and readable storage medium
CN110866872A (en) Pavement crack image preprocessing intelligent selection method and device and electronic equipment
CN112884820A (en) Method, device and equipment for training initial image registration and neural network
CN115527050A (en) Image feature matching method, computer device and readable storage medium
CN111652168B (en) Group detection method, device, equipment and storage medium based on artificial intelligence
CN111985439B (en) Face detection method, device, equipment and storage medium
CN116563096B (en) Method and device for determining deformation field for image registration and electronic equipment
CN110070120B (en) Depth measurement learning method and system based on discrimination sampling strategy
CN116597246A (en) Model training method, target detection method, electronic device and storage medium
CN113890833B (en) Network coverage prediction method, device, equipment and storage medium
CN105894029B (en) A kind of adaptive motion track data denoising method solved based on Fermat point
CN112446428B (en) Image data processing method and device
CN114374931A (en) Fingerprint positioning method based on measurement learning of neighbor component analysis
CN114449435A (en) Fingerprint positioning method based on metric learning of large-interval neighbors
CN113837228B (en) Fine granularity object retrieval method based on punishment perception center loss function
CN117739809B (en) Bottleneck size measuring method, computer equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant