CN105469041A - Facial point detection system based on multi-task regularization and layer-by-layer supervision neural networ - Google Patents
Facial point detection system based on multi-task regularization and layer-by-layer supervision neural networ Download PDFInfo
- Publication number
- CN105469041A CN105469041A CN201510807796.6A CN201510807796A CN105469041A CN 105469041 A CN105469041 A CN 105469041A CN 201510807796 A CN201510807796 A CN 201510807796A CN 105469041 A CN105469041 A CN 105469041A
- Authority
- CN
- China
- Prior art keywords
- regularization
- multitask
- successively
- task
- layer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/174—Facial expression recognition
Abstract
The invention discloses a facial point detection system based on multi-task regularization and a layer-by-layer supervision neural network. The system comprises a multi-task regularization module and a layer-by-layer supervision network module. The multi-task regularization module includes a main task and a related task; and the main task and the related task study jointly to obtain a common feature space and then an additional regular term is provided by using an auxiliary tag of the related task to enhance a generalization ability of a network. The layer-by-layer supervision network module, different from the traditional convolution neural network only optimizing an objective function of an output layer, introduces a supervision objective function into each interlayer, thereby enhancing the saliency of features obtained by studying of the interlayers. Therefore, problems that overfitting occurs and the feature robustness is uncertain according to the traditional convolution neural network can be solved effectively.
Description
Technical field
The present invention relates to a kind of face point detecting method of computer vision field, specifically a kind of based on multitask regularization and the face point detection system of successively supervising neural network.
Background technology
At computer vision field, face point, the detection as eyes, nose, face etc. is a very basic and important problem, is the basis of follow-up recognition of face, tracking and the modeling of 3D face.Even if there is large quantifier elimination to drop into wherein, due to head pose change and the partial occlusion problem of people in image, when face point detects limited at ambient, remain a challenging problem.
Existing face point detecting method is mainly divided into two classes: template adaptive with based on the method returned.First carry out feature extraction to input picture based on the method returned, the Feature Mapping then study arrived is to the space of human face characteristic point.Original image as input, utilizes multiple linear filter automatically to calculate high-level character representation by convolutional neural networks, extracts in application obtain remarkable achievement at actual characteristic.
" Deepconvolutionalnetworkcascadeforfacialpointdetection " that the people such as Y.Sun deliver in " IEEEComputerVisionandPatternRecognition " (IEEECVPR) meeting of 2013 one proposes a kind of face point detecting method of multiple convolutional neural networks cascade in literary composition, face is divided into several part by advance, convolutional neural networks is used alone to each part and carries out feature point detection from coarse to fine, but the method for this cascade makes network parameter be multiplied causes training difficulty, and very large computing cost can be brought.
" Faciallandmarkdetectionbydeepmulti-tasklearning " that the people such as Z.Zhang deliver in " EuropeanConferenceonComputerVision " meeting in 2014 one proposes a kind of method of multi-task learning in literary composition.This method utilizes the correlativity of other characteristics of face and unique point to carry out the foundation of convolutional neural networks model, to promote the detection to main task and face point.This method reduces model complexity, but do not consider the physical relationship of main task and inter-related task.
Summary of the invention
The present invention is directed to defect of the prior art, provide a kind of based on multitask regularization and the face point detection system of successively supervising neural network, effectively can solve over-fitting and the uncertain problem of feature robustness that conventional roll amasss neural network.
The present invention is achieved by the following technical solutions:
Of the present invention a kind of based on multitask regularization and the face point detection system of successively to supervise neural network, comprise two parts: multitask regularization module and successively supervise mixed-media network modules mixed-media, wherein:
Describedly successively supervise mixed-media network modules mixed-media, according to its pixel value, feature extraction is carried out to input picture, be different from traditional convolutional neural networks to be only optimized output layer objective function, this module all introduces supervision objective function to each middle layer, thus strengthen the conspicuousness of the feature that middle layer learns, again output characteristic is inputed to the backpropagation that multitask regularization module carries out signal, repeat with this until network convergence;
Described multitask regularization module, comprise main task and inter-related task, the parameter that main task and inter-related task learn successively to supervise mixed-media network modules mixed-media jointly obtains the total feature space of all tasks, the assisted tag of recycling inter-related task provides additional regular terms with the generalization ability of Strengthens network, finally exports the prediction coordinate figure of main task.
Preferably, described multitask regularization module, comprises main task submodule and inter-related task submodule, wherein:
Described main task submodule, to the detection of input facial image 5 unique points, respectively: the detection of left eye, right eye, nose, the left corners of the mouth and the right corners of the mouth, predicts that the coordinate figure of each point is as final output.
Described inter-related task submodule carries out Attitude estimation, smile's detection, Glasses detection and gender prediction to input facial image respectively, predicts that the label value of each classification task is to promote the predictablity rate of main task.
More preferably, the fundamental purpose of described multitask regularization module produces objective function to be optimized, the i.e. difference of predicted value and actual value, carries out minimization problem solve to make predicted value approaching to reality value as far as possible to this objective function.
More preferably, the optimization object function of described multitask regularization module is the linear combination of main task loss function and inter-related task loss function.
More preferably, described main task loss function and inter-related task loss function use difference of two squares regression function and cross entropy function representation respectively.
Preferably, describedly successively supervise mixed-media network modules mixed-media, after each convolutional layer in centre, add returning Monitor function, carry out the backpropagation of signal together with the objective function to be optimized in multitask regularization module.
Preferably, describedly successively supervise mixed-media network modules mixed-media, wherein return the difference of two squares function that Monitor function is this convolutional layer output coordinate value and true coordinate value.
Preferably, describedly successively supervise mixed-media network modules mixed-media, only main task exercised supervision, and not to inter-related task supervision with the priority ensureing main task.
Preferably, describedly successively supervise mixed-media network modules mixed-media, wherein successively supervise the backpropagation of neural network, alleviate the gradient disperse problem of traditional convolutional neural networks.
Compared with prior art, the present invention has following beneficial effect:
Technique scheme of the present invention, for traditional convolutional neural networks Problems existing, proposes the method for improvement.The present invention adds supervision item to every one deck of traditional convolution nerve net, with strengthen feature the transparency and alleviate the problem of gradient disperse.The detection sharing feature space of face point, to strengthen the accuracy rate of main task, also strengthens the overall generalization ability of network to 4 inter-related tasks of the present invention---attitude detection, smile's detection, Glasses detection and gender prediction and main task---.
Accompanying drawing explanation
By reading the detailed description done non-limiting example with reference to the following drawings, other features, objects and advantages of the present invention will become more obvious:
Fig. 1 is the structured flowchart of present system one embodiment;
Fig. 2 successively supervises network diagram in the inventive method.
Embodiment
Below in conjunction with specific embodiment, the present invention is described in detail.Following examples will contribute to those skilled in the art and understand the present invention further, but not limit the present invention in any form.It should be pointed out that to those skilled in the art, without departing from the inventive concept of the premise, some distortion and improvement can also be made.These all belong to protection scope of the present invention.
The present invention is directed to the problem existing for traditional convolutional neural networks, propose a kind of based on multitask regularization and the face point detection system of successively supervising neural network.Native system, in multitask canonical part, for the over-fitting problem of traditional convolutional neural networks, utilizes the advantage of inter-related task label to represent with the common characteristic of study to high-level identification mission.In successively supervised learning part, native system for conventional roll and the disperse of neural network gradient learn the inadequate problem of feature significance, add monitor layer in each middle layer of neural network, to promote the gradient signal of returning from output layer backpropagation.This system to be used for face point and to detect by the present invention, valid certificates multitask canonical and the validity of successively supervising neural network.
As shown in Figure 1, be the structured flowchart of present system one embodiment, comprise: multitask regularization module and successively supervise mixed-media network modules mixed-media.
In the present embodiment, the main task in described multitask regularization module and several inter-related task learn the feature space obtaining having jointly, and the assisted tag of recycling inter-related task provides additional regular terms with the generalization ability of Strengthens network.
In the present embodiment, optimization object function is represented by the linear combination of main task loss function and inter-related task loss function and forms:
Wherein λ
athe weight of a inter-related task,
main task loss function,
be the loss function of a inter-related task, T is total number of all tasks, and w is each layer parameter to be asked of neural network.
Main task in the present embodiment is the detection of face 5 coordinate points, inter-related task respectively: attitude detection, smile detect, Glasses detection and gender prediction.
For one group of training sample
i=1 ..., N, t=1 ..., T, N and T are respectively total sample number and task number, wherein sample
represent the original input of t task,
represent corresponding true tag data.The detection of left eye, right eye, nose, the left corners of the mouth and these 5 points of the right corners of the mouth is recurrence tasks, and therefore desired value is the coordinate figure of respective point.The loss function of main task adopts squared error function:
wherein f (x; W) be the prediction coordinate figure of 5 points, || .||
2it is difference of two squares function; The loss function of 4 inter-related tasks adopts cross entropy function:
wherein
softmax function, in order to the modeling to posterior probability.
Therefore, corresponding with formula (1), final optimization pass objective function is:
In this example, described successively supervision mixed-media network modules mixed-media is different from traditional convolutional neural networks and is only optimized output layer objective function, but all introduces supervision objective function as Fig. 2 to each middle layer, thus strengthens the conspicuousness of the feature that middle layer learns.Convolutional neural networks alternately forms the feature to extract stratification by K layer convolutional layer and pooling layer, can be represented by following recursive formula:
Z
k=pool(Z
k-1*W
k+b
k)(3)
Wherein Z
kthe characteristic pattern of a kth convolutional layer, Z
k-1the characteristic pattern of kth-1 convolutional layer, W
kthe filter weight needing study, b
kit is bias term.
The method that the present invention adopts the degree of depth to supervise, adds and returns supervision after each convolutional layer response in centre, to make solution formula (2) more accurately,
Wherein
the objective function of last output layer, and
it is the adjoint supervision objective function that kth layer exports.Therefore:
Wherein w and w
krepresent the filter parameter in final layer and middle layer respectively, K is the total number of plies of convolution, α
kbe a kth convolutional layer recurrence Monitor function shared by weight.Noticing the priority in order to ensure main task, only supervision item being applied to main task.
For final problem formulations to be optimized (4), the i.e. output function of multitask regularization module, stochastic gradient descent method is adopted to solve, namely the character representation that first forward direction study is total, again signals reverse is propagated and go back to represent so that refinement is this, repeat above-mentioned two steps until network convergence.
Implementation result
According to above-mentioned steps, the step in summary of the invention is adopted to implement, test the 10000 width pictures altogether that training data used derives from data set LFW and network, each width picture has all marked 5 points, respectively: left eye, right eye, nose, the left corners of the mouth and the right corners of the mouth.All mark values are all normalized to [0,1] according to picture size.Experiment used test is data from data set AFLW, AFW and LFPW.The present invention adopts three layers of wave filter size to be the convolutional layer of 5x5, connect pooling layer after each convolutional layer respectively and return monitor layer, 4th layer is containing 64 neuronic full articulamentums, is finally the multitask network layer containing the detection of main task face point and 4 face's association attributeses.This instance system compares traditional convolutional Neural respectively and returns net, successively supervises network, multitask regularization network, and the vision response test of measured 5 points is respectively: 2.14%, 5.18%, 2.80% and 2.71%.Experiment shows, has good effect in the problem detected at face point based on multi-task learning and the system of successively to supervise neural network that the present invention proposes.
Above specific embodiments of the invention are described.It is to be appreciated that the present invention is not limited to above-mentioned particular implementation, those skilled in the art can make various distortion or amendment within the scope of the claims, and this does not affect flesh and blood of the present invention.
Claims (9)
1. based on multitask regularization and a face point detection system of successively to supervise neural network, it is characterized in that, comprising: multitask regularization module and successively supervise mixed-media network modules mixed-media, wherein:
Describedly successively supervise mixed-media network modules mixed-media, according to its pixel value, feature extraction is carried out to input picture, be different from traditional convolutional neural networks to be only optimized output layer objective function, this module all introduces supervision objective function to each middle layer, thus strengthen the conspicuousness of the feature that middle layer learns, again output characteristic is inputed to the backpropagation that multitask regularization module carries out signal, repeat with this until network convergence;
Described multitask regularization module, comprise main task and inter-related task, the parameter that main task and inter-related task learn successively to supervise mixed-media network modules mixed-media jointly obtains the total feature space of all tasks, the assisted tag of recycling inter-related task provides additional regular terms with the generalization ability of Strengthens network, finally exports the prediction coordinate figure of main task.
It is 2. according to claim 1 that based on multitask regularization and the face point detection system of successively supervising neural network, it is characterized in that, described multitask regularization module, comprises main task submodule and inter-related task submodule, wherein:
Described main task submodule, to the detection of input facial image 5 unique points, respectively: the detection of left eye, right eye, nose, the left corners of the mouth and the right corners of the mouth, predicts that the coordinate figure of each point is as final output;
Described inter-related task submodule carries out Attitude estimation, smile's detection, Glasses detection and gender prediction to input facial image respectively, predicts that the label value of each classification task is to promote the predictablity rate of main task.
3. according to claim 2 based on multitask regularization and the face point detection system of successively supervising neural network, it is characterized in that, the fundamental purpose of described multitask regularization module produces objective function to be optimized, the i.e. difference of predicted value and actual value, carries out minimization problem to this objective function and solves to make predicted value approaching to reality value as far as possible.
4. according to claim 3 based on multitask regularization and the face point detection system of successively supervising neural network, it is characterized in that, the optimization object function of described multitask regularization module is the linear combination of main task loss function and inter-related task loss function.
5. according to claim 4 based on multitask regularization and the face point detection system of successively supervising neural network, it is characterized in that, described main task loss function and inter-related task loss function use difference of two squares regression function and cross entropy function representation respectively.
6. according to any one of claim 1-5 based on multitask regularization and the face point detection system of successively to supervise neural network, it is characterized in that, describedly successively supervise mixed-media network modules mixed-media, add after each convolutional layer in centre and return Monitor function, carry out the backpropagation of signal together with the objective function to be optimized in multitask regularization module.
7. according to claim 6 based on multitask regularization and the face point detection system of successively supervising neural network, it is characterized in that, describedly successively supervise mixed-media network modules mixed-media, wherein return the difference of two squares function that Monitor function is convolutional layer output coordinate value and true coordinate value.
8. according to claim 6 based on multitask regularization and the face point detection system of successively supervising neural network, it is characterized in that, described describedly successively supervise mixed-media network modules mixed-media, wherein return the backpropagation of Monitor function, alleviate the gradient disperse problem of traditional convolutional neural networks.
9. according to any one of claim 1-5 based on multitask regularization and the face point detection system of successively to supervise neural network, it is characterized in that, described successively supervision mixed-media network modules mixed-media, only exercises supervision to main task, and does not supervise inter-related task with the priority ensureing main task.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510807796.6A CN105469041B (en) | 2015-11-19 | 2015-11-19 | Face point detection system based on multitask regularization and layer-by-layer supervision neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510807796.6A CN105469041B (en) | 2015-11-19 | 2015-11-19 | Face point detection system based on multitask regularization and layer-by-layer supervision neural network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105469041A true CN105469041A (en) | 2016-04-06 |
CN105469041B CN105469041B (en) | 2019-05-24 |
Family
ID=55606712
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510807796.6A Active CN105469041B (en) | 2015-11-19 | 2015-11-19 | Face point detection system based on multitask regularization and layer-by-layer supervision neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105469041B (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106227836A (en) * | 2016-07-26 | 2016-12-14 | 上海交通大学 | Associating visual concept learning system and method is supervised with the nothing of word based on image |
CN106529402A (en) * | 2016-09-27 | 2017-03-22 | 中国科学院自动化研究所 | Multi-task learning convolutional neural network-based face attribute analysis method |
CN106778590A (en) * | 2016-12-09 | 2017-05-31 | 厦门大学 | It is a kind of that video detecting method is feared based on convolutional neural networks model cruelly |
CN106951840A (en) * | 2017-03-09 | 2017-07-14 | 北京工业大学 | A kind of facial feature points detection method |
CN107229968A (en) * | 2017-05-24 | 2017-10-03 | 北京小米移动软件有限公司 | Gradient parameter determines method, device and computer-readable recording medium |
CN107463903A (en) * | 2017-08-08 | 2017-12-12 | 北京小米移动软件有限公司 | Face key independent positioning method and device |
CN107784647A (en) * | 2017-09-29 | 2018-03-09 | 华侨大学 | Liver and its lesion segmentation approach and system based on multitask depth convolutional network |
CN107886062A (en) * | 2017-11-03 | 2018-04-06 | 北京达佳互联信息技术有限公司 | Image processing method, system and server |
CN108268822A (en) * | 2016-12-30 | 2018-07-10 | 深圳光启合众科技有限公司 | Face identification method, device and robot |
CN108399452A (en) * | 2017-02-08 | 2018-08-14 | 西门子保健有限责任公司 | The Layered Learning of the weight of neural network for executing multiple analysis |
CN108805259A (en) * | 2018-05-23 | 2018-11-13 | 北京达佳互联信息技术有限公司 | neural network model training method, device, storage medium and terminal device |
CN109101869A (en) * | 2018-06-14 | 2018-12-28 | 深圳市博威创盛科技有限公司 | Test method, equipment and the storage medium of multi-task learning depth network |
CN109948633A (en) * | 2017-12-20 | 2019-06-28 | 广东欧珀移动通信有限公司 | User gender prediction method, apparatus, storage medium and electronic equipment |
CN110119750A (en) * | 2018-02-05 | 2019-08-13 | 浙江宇视科技有限公司 | Data processing method, device and electronic equipment |
CN112446499A (en) * | 2019-08-30 | 2021-03-05 | 西门子医疗有限公司 | Improving performance of machine learning models for automated quantification of coronary artery disease |
US11675876B2 (en) | 2020-10-28 | 2023-06-13 | International Business Machines Corporation | Training robust machine learning models |
CN111401456B (en) * | 2020-03-20 | 2023-08-22 | 杭州涂鸦信息技术有限公司 | Training method, system and device for face gesture recognition model |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103778414A (en) * | 2014-01-17 | 2014-05-07 | 杭州电子科技大学 | Real-time face recognition method based on deep neural network |
CN103793718A (en) * | 2013-12-11 | 2014-05-14 | 台州学院 | Deep study-based facial expression recognition method |
CN104866810A (en) * | 2015-04-10 | 2015-08-26 | 北京工业大学 | Face recognition method of deep convolutional neural network |
-
2015
- 2015-11-19 CN CN201510807796.6A patent/CN105469041B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103793718A (en) * | 2013-12-11 | 2014-05-14 | 台州学院 | Deep study-based facial expression recognition method |
CN103778414A (en) * | 2014-01-17 | 2014-05-07 | 杭州电子科技大学 | Real-time face recognition method based on deep neural network |
CN104866810A (en) * | 2015-04-10 | 2015-08-26 | 北京工业大学 | Face recognition method of deep convolutional neural network |
Non-Patent Citations (1)
Title |
---|
ZHANPENG ZHANG等: "Facial Landmark Detection by Deep Multi-task Learning", 《EUROPEAN CONFERENCE ON COMPUTER VISION》 * |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106227836B (en) * | 2016-07-26 | 2020-07-14 | 上海交通大学 | Unsupervised joint visual concept learning system and unsupervised joint visual concept learning method based on images and characters |
CN106227836A (en) * | 2016-07-26 | 2016-12-14 | 上海交通大学 | Associating visual concept learning system and method is supervised with the nothing of word based on image |
CN106529402A (en) * | 2016-09-27 | 2017-03-22 | 中国科学院自动化研究所 | Multi-task learning convolutional neural network-based face attribute analysis method |
CN106529402B (en) * | 2016-09-27 | 2019-05-28 | 中国科学院自动化研究所 | The face character analysis method of convolutional neural networks based on multi-task learning |
CN106778590A (en) * | 2016-12-09 | 2017-05-31 | 厦门大学 | It is a kind of that video detecting method is feared based on convolutional neural networks model cruelly |
CN106778590B (en) * | 2016-12-09 | 2020-07-17 | 厦门大学 | Violence and terrorism video detection method based on convolutional neural network model |
CN108268822A (en) * | 2016-12-30 | 2018-07-10 | 深圳光启合众科技有限公司 | Face identification method, device and robot |
CN108399452A (en) * | 2017-02-08 | 2018-08-14 | 西门子保健有限责任公司 | The Layered Learning of the weight of neural network for executing multiple analysis |
CN108399452B (en) * | 2017-02-08 | 2022-11-29 | 西门子保健有限责任公司 | Hierarchical learning of weights for neural networks performing multiple analyses |
US11328412B2 (en) | 2017-02-08 | 2022-05-10 | Siemens Healthcare Gmbh | Hierarchical learning of weights of a neural network for performing multiple analyses |
CN106951840A (en) * | 2017-03-09 | 2017-07-14 | 北京工业大学 | A kind of facial feature points detection method |
CN107229968B (en) * | 2017-05-24 | 2021-06-29 | 北京小米移动软件有限公司 | Gradient parameter determination method, gradient parameter determination device and computer-readable storage medium |
CN107229968A (en) * | 2017-05-24 | 2017-10-03 | 北京小米移动软件有限公司 | Gradient parameter determines method, device and computer-readable recording medium |
CN107463903A (en) * | 2017-08-08 | 2017-12-12 | 北京小米移动软件有限公司 | Face key independent positioning method and device |
CN107463903B (en) * | 2017-08-08 | 2020-09-04 | 北京小米移动软件有限公司 | Face key point positioning method and device |
CN107784647A (en) * | 2017-09-29 | 2018-03-09 | 华侨大学 | Liver and its lesion segmentation approach and system based on multitask depth convolutional network |
CN107784647B (en) * | 2017-09-29 | 2021-03-09 | 华侨大学 | Liver and tumor segmentation method and system based on multitask deep convolutional network |
CN107886062A (en) * | 2017-11-03 | 2018-04-06 | 北京达佳互联信息技术有限公司 | Image processing method, system and server |
CN109948633A (en) * | 2017-12-20 | 2019-06-28 | 广东欧珀移动通信有限公司 | User gender prediction method, apparatus, storage medium and electronic equipment |
CN110119750A (en) * | 2018-02-05 | 2019-08-13 | 浙江宇视科技有限公司 | Data processing method, device and electronic equipment |
CN108805259A (en) * | 2018-05-23 | 2018-11-13 | 北京达佳互联信息技术有限公司 | neural network model training method, device, storage medium and terminal device |
CN109101869A (en) * | 2018-06-14 | 2018-12-28 | 深圳市博威创盛科技有限公司 | Test method, equipment and the storage medium of multi-task learning depth network |
CN112446499A (en) * | 2019-08-30 | 2021-03-05 | 西门子医疗有限公司 | Improving performance of machine learning models for automated quantification of coronary artery disease |
CN111401456B (en) * | 2020-03-20 | 2023-08-22 | 杭州涂鸦信息技术有限公司 | Training method, system and device for face gesture recognition model |
US11675876B2 (en) | 2020-10-28 | 2023-06-13 | International Business Machines Corporation | Training robust machine learning models |
Also Published As
Publication number | Publication date |
---|---|
CN105469041B (en) | 2019-05-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105469041A (en) | Facial point detection system based on multi-task regularization and layer-by-layer supervision neural networ | |
Ma et al. | Dimension reduction of image deep feature using PCA | |
EP3074918B1 (en) | Method and system for face image recognition | |
JP6159489B2 (en) | Face authentication method and system | |
CN111814661B (en) | Human body behavior recognition method based on residual error-circulating neural network | |
CN109002845A (en) | Fine granularity image classification method based on depth convolutional neural networks | |
CN109325430B (en) | Real-time behavior identification method and system | |
CN105447473A (en) | PCANet-CNN-based arbitrary attitude facial expression recognition method | |
CN106469298A (en) | Age recognition methodss based on facial image and device | |
CN105574534A (en) | Significant object detection method based on sparse subspace clustering and low-order expression | |
CN104992167A (en) | Convolution neural network based face detection method and apparatus | |
CN104281853A (en) | Behavior identification method based on 3D convolution neural network | |
Deng et al. | Amae: Adaptive motion-agnostic encoder for event-based object classification | |
CN108446676B (en) | Face image age discrimination method based on ordered coding and multilayer random projection | |
CN109919059A (en) | Conspicuousness object detecting method based on depth network layerization and multitask training | |
CN116012950B (en) | Skeleton action recognition method based on multi-heart space-time attention pattern convolution network | |
Zhou et al. | Classroom learning status assessment based on deep learning | |
Iosifidis et al. | Neural representation and learning for multi-view human action recognition | |
CN110111365B (en) | Training method and device based on deep learning and target tracking method and device | |
CN114492634A (en) | Fine-grained equipment image classification and identification method and system | |
CN108009512A (en) | A kind of recognition methods again of the personage based on convolutional neural networks feature learning | |
Xia et al. | Face recognition and application of film and television actors based on Dlib | |
CN110490165B (en) | Dynamic gesture tracking method based on convolutional neural network | |
WO2021038840A1 (en) | Object number estimation device, control method, and program | |
Nemec et al. | Unmanned aerial vehicle control using hand gestures and neural networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |