CN109886072A - Face character categorizing system based on two-way Ladder structure - Google Patents

Face character categorizing system based on two-way Ladder structure Download PDF

Info

Publication number
CN109886072A
CN109886072A CN201811595193.4A CN201811595193A CN109886072A CN 109886072 A CN109886072 A CN 109886072A CN 201811595193 A CN201811595193 A CN 201811595193A CN 109886072 A CN109886072 A CN 109886072A
Authority
CN
China
Prior art keywords
scoring
module
face character
submodule
indicate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811595193.4A
Other languages
Chinese (zh)
Other versions
CN109886072B (en
Inventor
赫然
郑欣
黄怀波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Automation of Chinese Academy of Science
Original Assignee
Institute of Automation of Chinese Academy of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Automation of Chinese Academy of Science filed Critical Institute of Automation of Chinese Academy of Science
Priority to CN201811595193.4A priority Critical patent/CN109886072B/en
Publication of CN109886072A publication Critical patent/CN109886072A/en
Application granted granted Critical
Publication of CN109886072B publication Critical patent/CN109886072B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention relates to technical field of computer vision, more particularly to a kind of face character categorizing system based on two-way Ladder structure, the feature for how making full use of different levels in depth network aimed to solve the problem that, and the corresponding relationship between different levels feature and different faces attribute, to improve the accuracy of face character classification.For this purpose, the face character categorizing system provided by the invention based on two-way Ladder structure includes two-way Ladder self-encoding encoder module, adaptive attention power module and the Fusion Module that adaptively scores;Two-way Ladder self-encoding encoder module includes coder module and decoder module;It is adaptive to notice that power module includes multiple attention submodules;Adaptive scoring Fusion Module is configured to obtain the face character classification results of facial image to be measured according to the result of output result and attention the submodule output of coder module.The coding characteristic and decoding feature that different levels can be made full use of based on above structure, improve the accuracy of face character classification.

Description

Face character categorizing system based on two-way Ladder structure
Technical field
The present invention relates to technical field of computer vision, and in particular to a kind of face category based on two-way Ladder structure Property categorizing system.
Background technique
Face character classification method be it is a kind of judge in given facial image whether include certain face character method, It is widely used in the fields such as recognition of face, face retrieval, face verification.Currently, face character classification is carried out using depth network, Classification performance can be significantly promoted, but depth network is primarily upon effect of the further feature to attributive classification, shallow-layer is special Sign is not sufficiently excavated and is utilized.In addition, pair between the feature and different faces attribute of depth network different levels It should be related to, also not be utilized sufficiently, so significantly limit the accuracy of face character classification.
Correspondingly, this field needs a kind of new face character categorizing system to solve the above problems.
Summary of the invention
In order to solve the above problem in the prior art, in order to solve how to make full use of different levels in depth network Feature and different levels feature and different faces attribute between corresponding relationship, with improve face character classification it is accurate Degree, the present invention provides a kind of face character categorizing system based on two-way Ladder structure, the face character categorizing systems Include:
The face character categorizing system includes two-way Ladder self-encoding encoder module, adaptive pays attention to power module and adaptive Should score Fusion Module;
The two-way Ladder self-encoding encoder module includes coder module and decoder module, the coder module packet Multiple sequentially connected coding characteristic extract layers are included, the decoder module includes multiple sequentially connected decoding feature extractions Layer, the outlet side of the coder module are connect with the input side of the decoder module;Wherein, the coding characteristic extract layer It is equal with the decoding quantity of feature extraction layer;
The adaptive attention power module includes multiple attention submodules, the attention submodule respectively with it is described double Into Ladder self-encoding encoder module, the corresponding coding characteristic extract layer of same data characteristics layer and decoding feature extraction layer connect It connects;The attention submodule is configured to be mentioned according to the coding characteristic that the coding characteristic extract layer extracts with the decoding feature The decoding feature for taking layer to extract carries out Fusion Features;
The adaptive scoring Fusion Module is configured to output result and each note according to the coder module The Fusion Features result of meaning power submodule output obtains the face character classification results of the facial image to be measured.
Further, a preferred embodiment provided by the invention are as follows:
The adaptive scoring Fusion Module includes scoring fusion submodule and attributive classification device submodule, the attribute point Class device submodule includes the first attributive classification device and multiple second attributive classification devices, the quantity of the second attributive classification device and institute The quantity for stating attention submodule is equal;
The input side of the first attributive classification device is connect with the outlet side of the coder module, first attribute point The outlet side of class device merges submodule connection with the scoring;The first attributive classification device is configured to according to the encoder mould The coding characteristic of block output obtains corresponding first scoring of preset multiple face characters;
The input side of the multiple second attributive classification device is connect with the multiple attention submodule respectively, the multiple The outlet side of second attributive classification device merges submodule connection with the scoring;The second attributive classification device is configured to basis The fusion feature of the attention submodule output obtains corresponding second scoring of the multiple face character;
Scoring fusion submodule be configured to score to first scoring with described second be weighted merge and The face character classification results of the facial image to be measured are obtained according to Weighted Fusion result.
Further, a preferred embodiment provided by the invention are as follows:
The attention submodule be further configured to according to the following formula shown in method according to the coding characteristic extract layer The decoding feature that the coding characteristic of extraction and the decoding feature extraction layer extract carries out Fusion Features:
Wherein, the OutiIndicate the fusion feature of i-th of attention submodule output, the A (i) indicate described i-th The corresponding mapping function of attention submodule andIt is describedDescribed in expressionWith institute It statesMatrix parallel operation, it is describedIndicate that the corresponding coding characteristic extract layer of i-th of attention submodule mentions The coding characteristic taken, it is describedIndicate the decoding that the corresponding decoding feature extraction layer of i-th of attention submodule extracts Feature.
Further, a preferred embodiment provided by the invention are as follows:
The scoring fusion submodule is further configured to execute following operation:
Method shown according to the following formula scores to be weighted and merge and according to adding to first scoring with described second Power fusion results obtain the face character classification results of the facial image to be measured:
Wherein, the P indicate Weighted Fusion result andAny one groupIt is all satisfiedIt is described M indicates preset face character number, describedIndicate that the facial image to be measured belongs to the general of default kth kind face character Rate, it is describedIndicate that the facial image to be measured is not belonging to the probability of default kth kind face character, the WsIndicate preset power Weight matrix, the sjIndicate the face character classification scoring of j-th of particular community classifier output, the particular community classifier For the first attributive classification device or the second attributive classification device, the people when the particular community classifier is the first attributive classification device The scoring of face attributive classification is the first scoring, the face character point when the particular community classifier is the second attributive classification device Class scoring is the second scoring, the ∑ sjIndicate the sum of the face character classification scoring of all particular community classifier outputs, institute State " (sj,∑sj) " indicate the sjWith the ∑ sjMatrix parallel operation, it is describedRepresenting matrix multiplication operations;
Described in selectionWithClassification results of the corresponding face character of middle the larger value as kth kind face character.
Further, a preferred embodiment provided by the invention are as follows:
The two-way Ladder self-encoding encoder module, adaptive attention power module and adaptive scoring Fusion Module are bases In neural network building module, the system also includes training module, the training module be configured to according to the following formula shown in Loss function pays attention to power module and adaptive scoring Fusion Module to the two-way Ladder self-encoding encoder module, adaptively simultaneously It is trained:
Wherein, describedIndicate the corresponding loss function of j-th of particular community classifier, the particular community point Class device is the first attributive classification device or the second attributive classification device, and the n indicates the quantity of the particular community classifier, described lFusionIndicate the corresponding loss function of the scoring fusion submodule, the lReconstructIndicate the corresponding loss letter of the decoder module Number, the α, β respectively indicate preset hyper parameter.
Further, a preferred embodiment provided by the invention are as follows:
The corresponding loss function of the j particular community classifierIt is shown below:
Wherein, the M indicates preset face character number, pkIndicate the kth of the j particular community classifier output The corresponding face character classification scoring of kind face character, the ykIndicate true kth kind face character label.
Further, a preferred embodiment provided by the invention are as follows:
The corresponding loss function l of submodule is merged in the scoringFusionIt is shown below:
Wherein, the tkIndicate that first scoring corresponding to kth kind face character, all second scorings and scoring are total With the scoring vector being operated in parallel, the scoring summation be kth kind face character it is corresponding it is described first scoring with The sum of all second scorings.
Further, a preferred embodiment provided by the invention are as follows:
The corresponding loss function l of the decoder moduleReconstructIt is shown below:
Wherein, describedIndicate L2Square of norm, it is describedIndicate the two-way Ladder self-encoding encoder module The facial image synthesized according to facial image x to be measured.
Compared with the nearest prior art, above-mentioned technical method is at least had the following beneficial effects:
Face character categorizing system provided by the invention based on two-way Ladder structure mainly comprises the following structure: two-way Ladder self-encoding encoder module, adaptive attention power module and the Fusion Module that adaptively scores;Two-way Ladder self-encoding encoder mould Block includes coder module and decoder module, and coder module includes multiple sequentially connected coding characteristic extract layers, decoding Device module includes multiple sequentially connected decoding feature extraction layers, and the outlet side of coder module is defeated with the decoder module Enter side connection;Wherein, coding characteristic extract layer is equal with the decoding quantity of feature extraction layer;It is adaptive to notice that power module includes more A attention submodule, attention submodule are right with the same data characteristics layer in two-way Ladder self-encoding encoder module respectively The coding characteristic extract layer answered is connect with decoding feature extraction layer;Attention submodule is configured to be mentioned according to coding characteristic extract layer The decoding feature that the coding characteristic and decoding feature extraction layer taken extracts carries out Fusion Features;Adaptive scoring Fusion Module configuration To obtain face figure to be measured according to the Fusion Features result of the output result of coder module and the output of each attention submodule The face character classification results of picture.Based on above structure can make full use of different levels coding characteristic and decoding feature and Corresponding relationship between coding characteristic and decoding feature, improves the precision of face character classification.
Further, adaptively scoring Fusion Module includes scoring fusion submodule and attributive classification device submodule, attribute Classifier submodule includes the first attributive classification device and multiple second attributive classification devices, the quantity and attention of the second attributive classification device The quantity of power submodule is equal;The input side of first attributive classification device and the outlet side of coder module connect, the first attribute point The outlet side of class device merges submodule connection with scoring;First attributive classification device is configured to the coding exported according to coder module Feature obtains corresponding first scoring of preset multiple face characters;The input side of second attributive classification device is sub with attention respectively Module connection, the outlet side of the second attributive classification device merge submodule connection with scoring;Second attributive classification device is configured to root Corresponding second scoring of multiple face characters is obtained according to the fusion feature of attention submodule output;Scoring fusion submodule configuration To be weighted the face for merging and obtaining facial image to be measured according to Weighted Fusion result with the second scoring to the first scoring Attributive classification result.Based on above structure, precise score can be carried out to different levels feature is come from and be merged, further Improve the accuracy of face character classification in ground.
Detailed description of the invention
Fig. 1 is the main knot of face character categorizing system of one of the embodiment of the present invention based on two-way Ladder structure Structure schematic diagram;
Fig. 2 is a kind of the main of face character categorizing system based on two-way Ladder structure of another embodiment of the present invention Structural schematic diagram.
Specific embodiment
The preferred embodiment of the present invention described with reference to the accompanying drawings.It will be apparent to a skilled person that this A little embodiments are used only for explaining technical principle of the invention, it is not intended that limit the scope of the invention.
Face has contained a large amount of attribute information as important biological characteristic, such as sex, race, age, expression, hair Individual event or multinomial face may be implemented in color etc., the face character categorizing system provided by the invention based on two-way Ladder structure The prediction of attributive classification task.Specifically, the face character categorizing system based on two-way Ladder structure, which can be extracted, to be had not The facial characteristics of same level improves the accuracy of facial image attributive classification.The system uses modularized design, between each module Can be independent mutually, facilitate the building and update of model.Face character categorizing system based on two-way Ladder structure can adopt It is constructed with depth convolutional neural networks, the local feature and global characteristics of different levels is extracted using convolutional neural networks, thus Prediction part and global face character respectively.By adaptively paying attention to power module Weighted Fusion coding characteristic and decoding feature, It is indicated with the level characteristics for learning richer.Adaptive scoring Fusion Module predicts different layers using multiple attributive classification devices respectively Secondary facial attribute, and carry out the accuracy that scoring fusion further promotes face character classification.With reference to the accompanying drawing to base It is described in detail in the face character categorizing system of two-way Ladder structure.
Refering to attached drawing 1, Fig. 1 illustrates the main knot of the face character categorizing system based on two-way Ladder structure Structure schematic diagram, the face character categorizing system as shown in Figure 1 based on two-way Ladder structure may include such as flowering structure: two-way Ladder self-encoding encoder module, adaptive attention power module and the Fusion Module that adaptively scores.
Two-way Ladder self-encoding encoder module includes coder module and decoder module, and coder module includes multiple suitable The coding characteristic extract layer of secondary connection, decoder module include multiple sequentially connected decoding feature extraction layers, coder module Outlet side connect with the input side of the decoder module;Wherein, the number of coding characteristic extract layer and decoding feature extraction layer It measures equal.In some embodiments, two-way Ladder self-encoding encoder module is constructed based on depth convolutional neural networks, encoder mould Block includes sequentially connected coding characteristic extract layer to extract the coding characteristic of different layers, and decoder module includes multiple sequentially connecting The decoding feature extraction layer connect, to extract the decoding feature of different levels.Two-way Ladder self-encoding encoder module passes through encoder Facial image to be measured is mapped to feature space by module, decoder module based on the coding characteristic that coder module exports restore to Survey the facial image that facial image is synthesized.During synthesizing facial image, decoder can be in the decoding of different levels Feature extraction layer learns the face character information to different levels, so two-way Ladder self-encoding encoder module is in order to obtain more The character representation for adding different levels abundant when extracting the coding characteristic of different levels, while extracting the decoding of different levels Feature.In addition, with incremental, the data characteristics tool that shallow-layer network extracts of the network number of plies in two-way Ladder self-encoding encoder module There is locality, it includes the information of the details such as hair color, face contour, eye color in facial image, can be right Answer the prediction of local facial attribute;Deep layer network extract data characteristics have it is of overall importance, it includes the property in facial image Not, beautiful contour level semantic information, can correspond to the prediction of Global Face attribute.
Adaptive to notice that power module includes multiple attention submodules, attention submodule is self-editing with two-way Ladder respectively The corresponding coding characteristic extract layer of same data characteristics layer is connect with feature extraction layer is decoded in code device module;Attention submodule It is special that block is configured to the decoding feature progress that the coding characteristic extracted according to coding characteristic extract layer and decoding feature extraction layer extract Sign fusion.Specifically, the quantity of attention submodule is less than or equal to the quantity of coding characteristic extract layer, the connection of attention submodule Coding characteristic extract layer and decoding feature extraction layer, the corresponding coding characteristic of the two and decode the feature hierarchy of feature it is identical. Each attention submodule can adaptive mode learn merge coding characteristic and decode feature.In some embodiments, Attention submodule is further configured to the coding characteristic extracted according to method shown in formula (1) according to coding characteristic extract layer The decoding feature extracted with decoding feature extraction layer carries out Fusion Features:
Wherein, OutiIndicate the fusion feature of i-th of attention submodule output, A(i)Indicate i-th of attention submodule Corresponding mapping function and It indicatesWithMatrix parallel operation,Indicate the coding characteristic that the corresponding coding characteristic extract layer of i-th of attention submodule extracts,Indicate i-th of attention The decoding feature that the corresponding decoding feature extraction layer of power submodule extracts.It should be noted that the adaptive attention mould of the present invention It the quantity of block and can be adjusted according to the number of plies of network in the demand and two-way Ladder self-encoding encoder of user, it is right here The adaptive quantity for paying attention to power module is not construed as limiting.
Adaptive scoring Fusion Module is configured to defeated according to the output result of coder module and each attention submodule Fusion Features result out obtains the face character classification results of facial image to be measured.
Specifically, adaptively scoring Fusion Module includes scoring fusion submodule and attributive classification device submodule, attribute point Class device submodule includes the first attributive classification device and multiple second attributive classification devices, the quantity and attention of the second attributive classification device The quantity of submodule is equal.The input side of first attributive classification device and the outlet side of coder module connect, the first attributive classification The outlet side of device merges submodule connection with scoring;It is special that first attributive classification device is configured to the coding exported according to coder module Sign obtains corresponding first scoring of preset multiple face characters;The input side of multiple second attributive classification devices respectively with multiple notes Power submodule of anticipating connects, and the outlet side of multiple second attributive classification devices merges submodule connection with scoring;Second attributive classification Device is configured to obtain corresponding second scoring of multiple face characters according to the fusion feature that attention submodule exports;
Scoring fusion submodule is configured to be weighted the first scoring with the second scoring and merge and according to Weighted Fusion As a result the face character classification results of facial image to be measured are obtained, it specifically, can be according to method shown in formula (2) to first It scores to be weighted with the second scoring and merges and classified according to the face character that Weighted Fusion result obtains facial image to be measured As a result:
Wherein, P indicate Weighted Fusion result andAppoint It anticipates one groupIt is all satisfiedM indicates preset face character number, wherein default face character is root According to the face character that face character classification task defines, such as gender, age, hair color be one such or multiple combinations,Indicate that facial image to be measured belongs to the probability of default kth kind face character,Indicate that facial image to be measured is not belonging to default the The probability of k kind face character, WsIndicate preset weight matrix, in the present embodiment, weight matrix WsIt can be scoring fusion submodule Study obtains after the completion of block training, sjIndicate the face character classification scoring of j-th of particular community classifier output, particular community Classifier is the first attributive classification device or the second attributive classification device, the face when particular community classifier is the first attributive classification device Attributive classification scoring is the first scoring, and when particular community classifier is the second attributive classification device, face character classification scoring is the Two scorings, ∑ sjIndicate the sum of the face character classification scoring of all particular community classifier outputs, " (sj,∑sj) " indicate sj With ∑ sjMatrix parallel operation,Representing matrix multiplication operations;
It choosesWithClassification results of the corresponding face character of middle the larger value as kth kind face character.It needs to illustrate , Weighted Fusion result P is two classification to each face character, (for example, " 1 " indicates to belong to preset face character, " 0 " indicates to be not belonging to preset face character), two probability values corresponding for each face characterWithAccording toWithThe result that middle the larger value is classified as this kind of face character.
In some embodiments, two-way Ladder self-encoding encoder module, adaptive attention power module and adaptive scoring are melted Molding block is all based on the module of neural network building, and system further includes training module, and training module is configured to according to formula (3) Shown in loss function to two-way Ladder self-encoding encoder module, adaptive pay attention to power module and the Fusion Module that adaptively scores It is trained simultaneously:
Wherein,Indicate that the corresponding loss function of j-th of particular community classifier, particular community classifier are the first category Property classifier or the second attributive classification device, n indicates the quantity of particular community classifier, lFusionIndicate that scoring fusion submodule is corresponding Loss function, lReconstructIndicate the corresponding loss function of decoder module, α, β respectively indicate preset hyper parameter.
Wherein, the corresponding loss function of above-mentioned j-th of particular community classifierAs shown in formula (4):
Wherein, M indicates preset face character number, pkIndicate the kth kind face of j-th of particular community classifier output The corresponding face character classification scoring of attribute, yk indicate true kth kind face character label.
The corresponding loss function l of scoring fusion submoduleFusionAs shown in formula (5):
Wherein, tkIndicate that the first scoring corresponding to kth kind face character, all second scorings and scoring summation carry out simultaneously The scoring vector that connection operation obtains, scoring summation are the sum of corresponding first scoring of kth kind face character and all second scorings.
The corresponding loss function l of decoder moduleReconstructAs shown in formula (6):
Wherein,Indicate L2Square of norm,Indicate two-way Ladder self-encoding encoder module according to people to be measured The facial image of face image x synthesis,The output of decoder module in as two-way Ladder self-encoding encoder module.Decoder The corresponding loss function l of moduleReconstructDecoder module can be made to retain face character information as much as possible, so that decoder mould The facial image to be measured that the facial image of block synthesis is corresponding as far as possible is approximate.
In conclusion the face character categorizing system provided by the invention based on two-way Ladder structure can be excavated sufficiently The local feature and global characteristics of facial image to be measured, using the corresponding relationship between local feature and local facial attribute, with And the corresponding relationship between global characteristics and Global Face attribute, promote the accuracy of face character classification.
The face character based on two-way Ladder structure point provided below with reference to another embodiment the present invention is described in detail Class system.
Refering to attached drawing 2, Fig. 2 illustrates a kind of face category based on two-way Ladder structure of another embodiment The primary structure of property categorizing system, as shown in Fig. 2, the face character categorizing system based on two-way Ladder structure includes two-way Ladder self-encoding encoder module, adaptive attention power module and the Fusion Module that adaptively scores.Two-way Ladder self-encoding encoder mould Block includes coder module and decoder module, and adaptive coder module includes 7 sequentially connected coding characteristic extract layers, Decoder module includes 7 sequentially connected decoding feature extraction layers.It is adaptive to notice that power module includes 4 attention submodules Block is to pay attention to power module respectively1, pay attention to power module2, pay attention to power module3, pay attention to power module4, attention submodule respectively with it is double Into Ladder self-encoding encoder module, the corresponding coding characteristic extract layer of same data characteristics layer and decoding feature extraction layer connect It connects, the adaptive Fusion Module that scores includes scoring fusion submodule and attributive classification device submodule, attributive classification device submodule packet Include the first attributive classification device (i.e. classifier5) and 4 the second attributive classification device (i.e. classifiers1, classifier2, classifier3And classification Device4).The input side of first attributive classification device and the outlet side of coder module connect, the outlet side of the first attributive classification device with Scoring fusion submodule connection.That is, classifier5Input side and coder module outlet side connect, classifier5Outlet side Submodule connection is merged with scoring.The input side of second attributive classification device is connect with attention submodule, i.e. classifier4Input Side and attention power module4Connection, classifier3Input side and pay attention to power module3Connection, classifier2Input side and attention mould Block2Connection, classifier1Input side and pay attention to power module1Connection, classifier1, classifier2, classifier3And classifier4Output Submodule connection is merged with scoring in side.Two-way Ladder self-encoding encoder module can be according to original facial image (people i.e. to be measured Face image) carry out the facial image that coding-decoding operation is synthesized.
With continued reference to attached drawing 2, the face character categorizing system based on two-way Ladder structure carries out original facial image The step of face character is classified is as follows:
S1, original facial image is inputted into two-way Ladder self-encoding encoder module, extracts coding characteristic respectively and decoding is special Sign.
Two-way Ladder self-encoding encoder module is encoded using coder module original facial image being mapped to feature space, And original facial image is restored by decoder module and obtains synthesis facial image.In the mistake of decoder module synthesis facial image Cheng Zhong, while extracting coding characteristic and decoding feature.As the number of plies of two-way Ladder self-encoding encoder is deepened, it is located at two-way The feature of different levels has different characteristics in Ladder self-encoding encoder.Specifically, in two-way Ladder self-encoding encoder module The feature extracted of shallow-layer there is locality, local face character prediction can be corresponded to;Have in the feature that deep layer is extracted complete Office's property can correspond to global face character prediction.Therefore, by extracting the part of different levels and the character representation of the overall situation, It may be implemented respectively to the classification with part and the face character of global property.
S2, the coding characteristic of two-way Ladder self-encoding encoder module output and decoding feature are sent into adaptive attention mould Block carries out Fusion Features.
It is adaptive to notice that power module is that the output feature of coder module obtained in step S1 and decoder module is same Shi Zuowei input carries out Fusion Features.The present embodiment is the data using four levels of two-way Ladder self-encoding encoder module Feature, and construct four attention submodules that structure is identical but parameter is different.Each attention submodule is with certainly The mode of adaptation learns to merge two kinds of features of coder module and decoder module output.To any one attention submodule I (i=1,2,3,4) is enabledIndicate the coding characteristic that the corresponding coding characteristic extract layer of i-th of attention submodule extracts,Indicate the decoding feature that the corresponding decoding feature extraction layer of i-th of attention submodule extracts, attention submodule is first Adaptively learn a mapping function.The mapping function can be followed by a Sigmoid letter by one 1 × 1 convolutional layer Number obtains.Therefore, corresponding attention mapping can be expressed asFinally, attention submodule i Output such as formula (1) shown in.
S3, it will adaptively notice that the fused feature of power module is sent into adaptive scoring Fusion Module, and carry out attributive classification Face character classification results are obtained with scoring fusion.
By the volume of four attention submodule fusion features exported and two-way Ladder self-encoding encoder in step S2 The coding characteristic of code device end of module output, is respectively fed to five different attributive classification device submodules, to realize two-way Five feature hierarchies while multiple face characters of classifying in Ladder self-encoding encoder.Specifically, each attributive classification device submodule is enabled The face character classification scoring of block j output is sj(j=1,2,3,4,5) is obtaining the basis of five attributive classification device submodules After scoring, the scoring of five attributive classification device submodules is summed to obtain another basic score ∑ s firstj, and by its with The scoring of five attributive classification device submodules is in parallel, common to be sent into scoring fusion submodule.Scoring fusion submodule with it is a kind of from The mode of adaptation learns a weight matrix Ws, each base categories appraisal result is weighted, final face is obtained Shown in attributive classification result such as formula (2).
Those skilled in the art should be able to recognize that, mould described in conjunction with the examples disclosed in the embodiments of the present disclosure Block, system, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate electronic hardware With the interchangeability of software, each exemplary composition and step are generally described according to function in the above description.This A little functions are executed actually with electronic hardware or software mode, the specific application and design constraint item depending on technical solution Part.Those skilled in the art can use different methods to achieve the described function each specific application, but this Kind is realized and be should not be considered as beyond the scope of the present invention.
Term " first ", " second " etc. are to be used to distinguish similar objects, rather than be used to describe or indicate specific suitable Sequence or precedence.
Term " includes " or any other like term are intended to cover non-exclusive inclusion, so that including a system Process, method, article or equipment/device of column element not only includes those elements, but also including being not explicitly listed Other elements, or further include the intrinsic element of these process, method, article or equipment/devices.
So far, it has been combined preferred embodiment shown in the drawings and describes technical solution of the present invention, still, this field Technical staff is it is easily understood that protection scope of the present invention is expressly not limited to these specific embodiments.Without departing from this Under the premise of the principle of invention, those skilled in the art can make equivalent change or replacement to the relevant technologies feature, these Technical solution after change or replacement will fall within the scope of protection of the present invention.

Claims (8)

1. a kind of face character categorizing system based on two-way Ladder structure, which is characterized in that face character classification system System includes two-way Ladder self-encoding encoder module, adaptive attention power module and the Fusion Module that adaptively scores;
The two-way Ladder self-encoding encoder module includes coder module and decoder module, and the coder module includes more A sequentially connected coding characteristic extract layer, the decoder module include multiple sequentially connected decoding feature extraction layers, institute The outlet side for stating coder module is connect with the input side of the decoder module;Wherein, the coding characteristic extract layer reconciliation The quantity of code feature extraction layer is equal;
The adaptive attention power module includes multiple attention submodules, the attention submodule respectively with it is described two-way The corresponding coding characteristic extract layer of same data characteristics layer is connect with decoding feature extraction layer in Ladder self-encoding encoder module; The attention submodule is configured to the coding characteristic extracted according to the coding characteristic extract layer and the decoding feature extraction The decoding feature that layer extracts carries out Fusion Features;
The adaptive scoring Fusion Module is configured to output result and each attention according to the coder module The Fusion Features result of submodule output obtains the face character classification results of the facial image to be measured.
2. the face character categorizing system according to claim 1 based on two-way Ladder structure, which is characterized in that described Adaptive scoring Fusion Module includes scoring fusion submodule and attributive classification device submodule, the attributive classification device submodule packet Include the first attributive classification device and multiple second attributive classification devices, the quantity of the second attributive classification device and the attention submodule The quantity of block is equal;
The input side of the first attributive classification device is connect with the outlet side of the coder module, the first attributive classification device Outlet side with it is described scoring merge submodule connect;The first attributive classification device is configured to defeated according to the coder module Coding characteristic out obtains corresponding first scoring of preset multiple face characters;
The input side of the multiple second attributive classification device is connect with the multiple attention submodule respectively, and the multiple second The outlet side of attributive classification device merges submodule connection with the scoring;The second attributive classification device is configured to according to The fusion feature of attention submodule output obtains corresponding second scoring of the multiple face character;
Scoring fusion submodule be configured to score to first scoring with described second be weighted merge and according to Weighted Fusion result obtains the face character classification results of the facial image to be measured.
3. the face character categorizing system according to claim 2 based on two-way Ladder structure, which is characterized in that described Attention submodule be further configured to according to the following formula shown in the coding that is extracted according to the coding characteristic extract layer of method it is special The decoding feature that sign is extracted with the decoding feature extraction layer carries out Fusion Features:
Wherein, the OutiIndicate the fusion feature of i-th of attention submodule output, the A(i)Indicate i-th of attention The corresponding mapping function of power submodule andIt is describedDescribed in expressionWith it is describedMatrix parallel operation, it is describedIndicate what the corresponding coding characteristic extract layer of i-th of attention submodule extracted Coding characteristic, it is describedIndicate the decoding feature that the corresponding decoding feature extraction layer of i-th of attention submodule extracts.
4. the face character categorizing system according to claim 3 based on two-way Ladder structure, which is characterized in that described Scoring fusion submodule is further configured to execute following operation:
Method shown according to the following formula is weighted first scoring with second scoring and merges and melted according to weighting Close the face character classification results that result obtains the facial image to be measured:
Wherein, the P indicate Weighted Fusion result andAppoint It anticipates one groupIt is all satisfiedThe M indicates preset face character number, describedIndicate it is described to The probability that facial image belongs to default kth kind face character is surveyed, it is describedIndicate that the facial image to be measured is not belonging to default kth The probability of kind face character, the WsIndicate preset weight matrix, the sjIndicate j-th of particular community classifier output Face character classification scoring, the particular community classifier is the first attributive classification device or the second attributive classification device, as the spy Determining face character classification scoring when attributive classification device is the first attributive classification device is the first scoring, when the particular community point Class device when being the second attributive classification device face character classification scoring be the second scoring, the ∑ sjIndicate all particular communities The sum of face character classification scoring of classifier output, the " (sj,∑sj) " indicate the sjWith the ∑ sjMatrix it is in parallel Operation, it is describedRepresenting matrix multiplication operations;
Described in selectionWithClassification results of the corresponding face character of middle the larger value as kth kind face character.
5. the face character categorizing system according to claim 4 based on two-way Ladder structure, which is characterized in that described Two-way Ladder self-encoding encoder module, adaptive attention power module and adaptive scoring Fusion Module are all based on neural network structure The module built, the system also includes training module, the training module be configured to according to the following formula shown in loss function to institute It states two-way Ladder self-encoding encoder module, adaptive attention power module and adaptive scoring Fusion Module while being trained:
Wherein, describedIndicate that the corresponding loss function of the j particular community classifier, the particular community classifier are First attributive classification device or the second attributive classification device, the n indicate the quantity of the particular community classifier, the lFusionIt indicates The corresponding loss function of submodule, the l are merged in the scoringReconstructIndicate the corresponding loss function of the decoder module, it is described α, β respectively indicate preset hyper parameter.
6. the face character categorizing system according to claim 5 based on two-way Ladder structure, which is characterized in that described The corresponding loss function of j-th of particular community classifierIt is shown below:
Wherein, the M indicates preset face character number, pkIndicate the kth kind of j-th of particular community classifier output The corresponding face character classification scoring of face character, the ykIndicate true kth kind face character label.
7. the face character categorizing system according to claim 6 based on two-way Ladder structure, which is characterized in that described The corresponding loss function l of scoring fusion submoduleFusionIt is shown below:
Wherein, the tkIndicate it is corresponding to kth kind face character it is described first scoring, it is all second scoring and scoring summation into The scoring vector that row parallel operation obtains, the scoring summation are corresponding first scorings of kth kind face character and own The sum of second scoring.
8. the face character categorizing system according to claim 7 based on two-way Ladder structure, which is characterized in that described The corresponding loss function l of decoder moduleReconstructIt is shown below:
Wherein, describedIndicate L2Square of norm, it is describedIndicate the two-way Ladder self-encoding encoder module according to The facial image of facial image x synthesis to be measured.
CN201811595193.4A 2018-12-25 2018-12-25 Face attribute classification system based on bidirectional Ladder structure Active CN109886072B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811595193.4A CN109886072B (en) 2018-12-25 2018-12-25 Face attribute classification system based on bidirectional Ladder structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811595193.4A CN109886072B (en) 2018-12-25 2018-12-25 Face attribute classification system based on bidirectional Ladder structure

Publications (2)

Publication Number Publication Date
CN109886072A true CN109886072A (en) 2019-06-14
CN109886072B CN109886072B (en) 2021-02-26

Family

ID=66925169

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811595193.4A Active CN109886072B (en) 2018-12-25 2018-12-25 Face attribute classification system based on bidirectional Ladder structure

Country Status (1)

Country Link
CN (1) CN109886072B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110232672A (en) * 2019-06-20 2019-09-13 合肥工业大学 A kind of denoising method and system of exercise data
CN110826056A (en) * 2019-11-11 2020-02-21 南京工业大学 Recommendation system attack detection method based on attention convolution self-encoder
CN111104874A (en) * 2019-12-03 2020-05-05 北京金山云网络技术有限公司 Face age prediction method, training method and device of model and electronic equipment
CN111209916A (en) * 2019-12-31 2020-05-29 中国科学技术大学 Focus identification method and system and identification equipment
CN111652158A (en) * 2020-06-04 2020-09-11 浙江大华技术股份有限公司 Target object detection method and device, storage medium and electronic device
CN112434702A (en) * 2019-08-26 2021-03-02 阿里巴巴集团控股有限公司 Image processing method, image processing device, computer equipment and storage medium
WO2021057426A1 (en) * 2019-09-29 2021-04-01 腾讯科技(深圳)有限公司 Method and apparatus for training image fusion processing model, device, and storage medium
CN116168040A (en) * 2023-04-26 2023-05-26 四川元智谷科技有限公司 Component direction detection method and device, electronic equipment and readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104143079A (en) * 2013-05-10 2014-11-12 腾讯科技(深圳)有限公司 Method and system for face attribute recognition
CN105404877A (en) * 2015-12-08 2016-03-16 商汤集团有限公司 Human face attribute prediction method and apparatus based on deep study and multi-task study
US20160328606A1 (en) * 2013-11-15 2016-11-10 Facebook, Inc. Pose-aligned networks for deep attribute modeling
CN106682628A (en) * 2016-12-30 2017-05-17 佳都新太科技股份有限公司 Face attribute classification method based on multilayer depth feature information
CN106951825A (en) * 2017-02-13 2017-07-14 北京飞搜科技有限公司 A kind of quality of human face image assessment system and implementation method
CN107729909A (en) * 2016-08-11 2018-02-23 中国移动通信集团湖南有限公司 A kind of application process and device of attributive classification device
CN108229267A (en) * 2016-12-29 2018-06-29 北京市商汤科技开发有限公司 Object properties detection, neural metwork training, method for detecting area and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104143079A (en) * 2013-05-10 2014-11-12 腾讯科技(深圳)有限公司 Method and system for face attribute recognition
US20160328606A1 (en) * 2013-11-15 2016-11-10 Facebook, Inc. Pose-aligned networks for deep attribute modeling
CN105404877A (en) * 2015-12-08 2016-03-16 商汤集团有限公司 Human face attribute prediction method and apparatus based on deep study and multi-task study
CN107729909A (en) * 2016-08-11 2018-02-23 中国移动通信集团湖南有限公司 A kind of application process and device of attributive classification device
CN108229267A (en) * 2016-12-29 2018-06-29 北京市商汤科技开发有限公司 Object properties detection, neural metwork training, method for detecting area and device
CN106682628A (en) * 2016-12-30 2017-05-17 佳都新太科技股份有限公司 Face attribute classification method based on multilayer depth feature information
CN106951825A (en) * 2017-02-13 2017-07-14 北京飞搜科技有限公司 A kind of quality of human face image assessment system and implementation method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
XIN ZHEN ET AL.: "A Survey of Deep Facial Attribute Analysis", 《ARXIV》 *
邵蔚元 等: "多任务学习及卷积神经网络在人脸识别中的应用", 《计算机工程与应用》 *
黄怀波: "基于生成模型的人脸图像合成与分析", 《中国博士学位论文全文数据库》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110232672A (en) * 2019-06-20 2019-09-13 合肥工业大学 A kind of denoising method and system of exercise data
CN112434702A (en) * 2019-08-26 2021-03-02 阿里巴巴集团控股有限公司 Image processing method, image processing device, computer equipment and storage medium
WO2021057426A1 (en) * 2019-09-29 2021-04-01 腾讯科技(深圳)有限公司 Method and apparatus for training image fusion processing model, device, and storage medium
US11526712B2 (en) 2019-09-29 2022-12-13 Tencent Technology (Shenzhen) Company Limited Training method and apparatus for image fusion processing model, device, and storage medium
CN110826056A (en) * 2019-11-11 2020-02-21 南京工业大学 Recommendation system attack detection method based on attention convolution self-encoder
CN110826056B (en) * 2019-11-11 2024-01-30 南京工业大学 Recommended system attack detection method based on attention convolution self-encoder
CN111104874A (en) * 2019-12-03 2020-05-05 北京金山云网络技术有限公司 Face age prediction method, training method and device of model and electronic equipment
CN111104874B (en) * 2019-12-03 2024-02-20 北京金山云网络技术有限公司 Face age prediction method, training method and training device for model, and electronic equipment
CN111209916A (en) * 2019-12-31 2020-05-29 中国科学技术大学 Focus identification method and system and identification equipment
CN111209916B (en) * 2019-12-31 2024-01-23 中国科学技术大学 Focus identification method and system and identification equipment
CN111652158A (en) * 2020-06-04 2020-09-11 浙江大华技术股份有限公司 Target object detection method and device, storage medium and electronic device
CN116168040A (en) * 2023-04-26 2023-05-26 四川元智谷科技有限公司 Component direction detection method and device, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN109886072B (en) 2021-02-26

Similar Documents

Publication Publication Date Title
CN109886072A (en) Face character categorizing system based on two-way Ladder structure
CN110298037B (en) Convolutional neural network matching text recognition method based on enhanced attention mechanism
CN107330109B (en) A kind of trade mark inquiry result degree of approximation evaluation and sort method, device
CN106021364B (en) Foundation, image searching method and the device of picture searching dependency prediction model
CN110442684A (en) A kind of class case recommended method based on content of text
CN108920544A (en) A kind of personalized position recommended method of knowledge based map
CN109460737A (en) A kind of multi-modal speech-emotion recognition method based on enhanced residual error neural network
CN109902293A (en) A kind of file classification method based on part with global mutually attention mechanism
CN110134771A (en) A kind of implementation method based on more attention mechanism converged network question answering systems
CN109523463A (en) A kind of face aging method generating confrontation network based on condition
CN108280233A (en) A kind of VideoGIS data retrieval method based on deep learning
CN108920720A (en) The large-scale image search method accelerated based on depth Hash and GPU
CN108804677A (en) In conjunction with the deep learning question classification method and system of multi-layer attention mechanism
CN105718960A (en) Image ordering model based on convolutional neural network and spatial pyramid matching
CN110209789A (en) A kind of multi-modal dialog system and method for user's attention guidance
CN111159367B (en) Information processing method and related equipment
CN111127146A (en) Information recommendation method and system based on convolutional neural network and noise reduction self-encoder
CN109086405A (en) Remote sensing image retrieval method and system based on conspicuousness and convolutional neural networks
CN107491729B (en) Handwritten digit recognition method based on cosine similarity activated convolutional neural network
CN110175248A (en) A kind of Research on face image retrieval and device encoded based on deep learning and Hash
CN111444367A (en) Image title generation method based on global and local attention mechanism
CN110516145B (en) Information searching method based on sentence vector coding
CN109710768A (en) A kind of taxpayer's industry two rank classification method based on MIMO recurrent neural network
CN108256968A (en) A kind of electric business platform commodity comment of experts generation method
CN114693397A (en) Multi-view multi-modal commodity recommendation method based on attention neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant