CN109934197A

CN109934197A - Training method, device and the computer readable storage medium of human face recognition model

Info

Publication number: CN109934197A
Application number: CN201910219576.XA
Authority: CN
Inventors: 邵新庆; 宋咏君; 刘强
Original assignee: Shenzhen Liwei Zhilian Technology Co Ltd
Current assignee: Shenzhen Liwei Zhilian Technology Co Ltd
Priority date: 2019-03-21
Filing date: 2019-03-21
Publication date: 2019-06-25
Anticipated expiration: 2039-03-21
Also published as: CN109934197B

Abstract

The invention discloses a kind of training methods of human face recognition model, the training method of the human face recognition model is the following steps are included: carry out feature extraction to each target facial image using each target network-layer in depth convolutional neural networks, to obtain multiple global characteristics；The global characteristics are split according to the global characteristics corresponding target network-layer to obtain each local feature, and the corresponding each global characteristics of the target facial image are merged to obtain fusion feature；Loss supervised learning is carried out to the global characteristics, the local feature and fusion feature, to be trained to human face recognition model.Invention additionally discloses a kind of training device of human face recognition model and computer readable storage mediums.The present invention improves the precision of recognition of face.

Description

Training method, device and the computer readable storage medium of human face recognition model

Technical field

The present invention relates to technical field of face recognition more particularly to a kind of training method of human face recognition model, device and Computer readable storage medium.

Background technique

By means of power in the maturation and the high release for calculating power equipment of depth convolutional neural networks technology, face recognition algorithms are either The huge revolution all occurred in progress and recognition speed.Face recognition application also becomes most mature widest in AI technology It represents.Either smart city construction or field of video monitoring, recognition of face become most basic most crucial technological means.

Face recognition technology is to be not required to very important person as a kind of maximum advantage of biological identification technology of contactless unaware Forced mating can capture the identity information of people.Face identification system is can take the front face of cleaning as base Plinth, and under actual scene condition, when illumination, shooting angle, mask of wearing glasses, cause the precision of recognition of face compared with It is low.

Summary of the invention

The main purpose of the present invention is to provide a kind of training method of human face recognition model, device and computer-readable deposit Storage media, it is intended to solve the problems, such as that the precision of recognition of face is lower.

To achieve the above object, the training method of a kind of human face recognition model provided by the invention, the recognition of face mould The training method of type the following steps are included:

Feature extraction is carried out to each target facial image using each target network-layer in depth convolutional neural networks, with Obtain multiple global characteristics；

The global characteristics are split according to the global characteristics corresponding target network-layer to obtain each part spy Sign, and the corresponding each global characteristics of the target facial image are merged to obtain fusion feature；

Loss supervised learning is carried out to the global characteristics, the local feature and fusion feature, to recognition of face Model is trained.

In one embodiment, described that loss supervision is carried out to the global characteristics, the local feature and fusion feature The step of study includes:

Determine the global characteristics, the local feature and the corresponding target loss function of the fusion feature and institute State the corresponding weighted value of target loss function；

According to the target loss function and the corresponding weighted value of the target loss function, to the objective function pair The feature answered carries out loss supervised learning.

In one embodiment, the determination global characteristics, the local feature and the fusion feature are corresponding The step of target loss function and the target loss function corresponding weighted value includes:

Normalization figure penalties function is determined as the corresponding first object loss function of the local feature, and will be described It normalizes figure penalties function and triple loss function is determined as the global characteristics and the fusion feature is corresponding Second target loss function；

The first weighted value and the corresponding second target damage of the fusion feature are configured for the first object loss function It loses normalization exponential function in function and configures the second weighted value, and according to the corresponding target network-layer of the global characteristics to described Exponential function is normalized in the corresponding second target loss function of global characteristics configures third weighted value.

In one embodiment, using each target network-layer in depth convolutional neural networks to each target facial image into Row feature extraction, to obtain multiple global characteristics the step of include:

Multiple target network-layers are determined in each network layer of the depth convolutional neural networks, wherein the target Distance is less than other network layers except the target network-layer between network layer and the output layer of the depth convolutional neural networks The distance between described output layer；

Target facial image is inputted into each target network-layer, it is corresponding complete to obtain each target network-layer Office's characteristic pattern；

Pond is carried out to each global characteristics figure, obtains multiple global characteristics.

In one embodiment, described the step of carrying out loss supervised learning to the fusion feature, includes:

Reduce the dimension of the fusion feature；

Loss supervised learning is carried out to the fusion feature for reducing dimension.

In one embodiment, it is described using each target network-layer in depth convolutional neural networks to each target face figure Before the step of carrying out feature extraction, further includes:

Each test facial image is obtained, and each test facial image is pre-processed, to obtain each target person Face image, wherein the pretreatment mass image normalization processes pixel and image data enhancing processing.

In one embodiment, described that loss prison is being carried out to the global characteristics, the local feature and fusion feature After the step of educational inspector practises, further includes:

The corresponding loss function of the depth convolutional neural networks is obtained in real time；

When the corresponding loss function of the depth convolutional neural networks converges to set interval, the recognition of face is completed The training of model, and save the human face recognition model for completing training.

In one embodiment, after described the step of saving the human face recognition model for completing training, further includes:

The fusion feature for the human face recognition model output for completing training is prestored as the facial image is corresponding Fusion feature, and fusion feature is prestored described in preservation.

To achieve the above object, the present invention also provides a kind of training device of human face recognition model, the recognition of face moulds The training device of type includes processor, memory and is stored in the face that can be run on the memory and on the processor The training program of identification model is realized as described above when the training program of the human face recognition model is executed by the processor Each step of the training method of human face recognition model.

To achieve the above object, the present invention also provides a kind of computer readable storage medium, the computer-readable storages Media storage has the training program of human face recognition model, when the training program of the human face recognition model is executed by the processor Realize each step of the training method of human face recognition model as described above.

Training method, device and the computer readable storage medium of human face recognition model provided by the invention, recognition of face The training device of model carries out feature to each target facial image using each target network-layer in depth convolutional neural networks Extraction obtains multiple global characteristics, is split further according to the corresponding target network-layer of global characteristics to the overall situation, to obtain more A local feature, and then multiple local features of a corresponding facial image and multiple global characteristics are merged and melted Feature is closed, loss supervised learning finally is carried out to carry out recognition of face to each global characteristics, local feature and fusion feature The training of model；Due to human face recognition model training device by heterogeneous networks layer export global characteristics, fusion feature with And the local feature of global characteristics segmentation carries out loss supervised learning, so that device has excavated the subtle spy in facial image Sign, improves the precision of recognition of face, further, since global characteristics are divided into multiple offices by the training device of human face recognition model Portion's feature carries out loss supervised learning, without manually blocking the occlusion area of face or to the characteristic processing of face blocked, Simplify the training process of facial image.

Detailed description of the invention

Fig. 1 be the present embodiments relate to human face recognition model training device hardware structural diagram；

Fig. 2 is the flow diagram of the training method first embodiment of human face recognition model of the present invention；

Fig. 3 is the refinement flow diagram of step S10 in Fig. 2；

Fig. 4 is the refinement flow diagram of step S30 in Fig. 2；

Fig. 5 is the flow diagram of the training method second embodiment of human face recognition model of the present invention.

The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.

Specific embodiment

It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.

The primary solutions of the embodiment of the present invention are: using each target network-layer in depth convolutional neural networks to each A target facial image carries out feature extraction, to obtain multiple global characteristics；According to the corresponding target network of the global characteristics Layer is split the global characteristics to obtain each local feature, and the target facial image is corresponding each described complete Office's feature is merged to obtain fusion feature；Loss prison is carried out to the global characteristics, the local feature and fusion feature Educational inspector practises, to be trained to human face recognition model.

Since the training device of human face recognition model is by exporting global characteristics, fusion feature and complete to heterogeneous networks layer The local feature of office's Image Segmentation Methods Based on Features carries out loss supervised learning, so that device has excavated the fine feature in facial image, mentions The high precision of recognition of face, further, since that global characteristics are divided into multiple parts is special for the training device of human face recognition model Sign carries out loss supervised learning, without manually blocking the occlusion area of face or to the characteristic processing of face blocked, simplifies The training process of facial image.

As a kind of implementation, the training device of human face recognition model can be as shown in Figure 1.

What is involved is the training device of human face recognition model, the training devices of human face recognition model for the embodiment of the present invention It include: processor 101, such as CPU, memory 102, communication bus 103.Wherein, communication bus 103 is for realizing these components Between connection communication.In addition, the training device of human face recognition model includes CPU, it is responsible for executing the training of human face recognition model Program and the storage for carrying out reading data and model；In addition, the training device of human face recognition model includes GPU (graphics process Device), GPU is used for the training of model；After the training program that CPU calls the human face recognition model stored in memory, CPU is from originally A collection of training image file decoding is read in local disk and copies GPU video memory at training data format, and in GPU before progress To calculating and backward reasoning, and model parameter is updated, reads next batch data, constantly repeatedly.

Memory 102 can be high speed RAM memory, be also possible to stable memory (non- ), such as magnetic disk storage volatilememory.As shown in Figure 1, as in a kind of memory 102 of computer storage medium It may include the training program of human face recognition model；And processor 101 can be used for that the face stored in memory 102 is called to know The training program of other model, and execute following operation:

In one embodiment, processor 101 can be used for calling the training of the human face recognition model stored in memory 102 Program, and execute following operation:

Reduce the dimension of the fusion feature；

The present embodiment is according to above scheme, and the training device of human face recognition model is using each in depth convolutional neural networks Target network-layer carries out feature extraction to each target facial image and obtains multiple global characteristics, corresponding further according to global characteristics Target network-layer is split the overall situation, thus the multiple offices for obtaining multiple local features, and then corresponding to a facial image Portion's feature and multiple global characteristics are merged to obtain fusion feature, finally to each global characteristics, local feature and are melted It closes feature and carries out loss supervised learning to carry out the training of human face recognition model；Since the training device of human face recognition model passes through Loss supervised learning is carried out to heterogeneous networks layer output global characteristics, fusion feature and the local feature of global characteristics segmentation, So that device has excavated the fine feature in facial image, the precision of recognition of face is improved, further, since recognition of face mould Global characteristics are divided into multiple local features and carry out loss supervised learning by the training device of type, without manually blocking face Occlusion area or the characteristic processing blocked to face, simplify the training process of facial image.

The hardware architecture of training device based on above-mentioned human face recognition model proposes the training of human face recognition model of the present invention The embodiment of method.

It is the first embodiment of the training method of human face recognition model of the present invention, the recognition of face mould referring to Fig. 2, Fig. 2 The training method of type the following steps are included:

Step S10 carries out each target facial image using each target network-layer in depth convolutional neural networks special Sign is extracted, to obtain multiple global characteristics；

In the present invention, executing subject is the training device of human face recognition model, the training device packet of human face recognition model CPU, memory and GPU have been included, memory is stored with the training program of human face recognition model, in addition, human face recognition model Training device is additionally provided with the biggish memory of amount of storage (for example, local disk), which is stored with a large amount of facial image, For example, memory can store the 30G even facial image for training of larger capacity；CPU is responsible for executing recognition of face mould The training program of type, and GPU is then the training for model.

When being trained to model, it is multiple to carry out that the face image data in local disk is divided into multiple batches Model training, the process per a batch of model training is consistent, specifically, CPU calls the recognition of face mould stored in memory The training program of type, CPU read a certain amount of face image data for training from local disk again, and CPU is again to face Image data decoding is at training data format, then is copied in GPU video memory, and in GPU carry out forward calculation and after to Reasoning, and update and preservation model.It is understood that the face that the training device of human face recognition model passes through different batches Image constantly model is trained repeatedly.

The training device of human face recognition model be equipped with optimizer, device by depth convolutional neural networks to facial image into Row processing, specifically, referring to figure 3., i.e. step S10 includes:

Step S11 determines multiple target network-layers in each network layer of the depth convolutional neural networks, wherein Distance is less than its for removing the target network-layer between the target network-layer and the output layer of the depth convolutional neural networks The distance between its network layer and the output layer；

Target facial image is inputted each target network-layer, to obtain each target network-layer by step S12 Corresponding global characteristics figure；

Step S13 carries out pond to each global characteristics figure, obtains multiple global characteristics.

Depth convolutional neural networks contain multiple network layers, each network layer can carry out feature extraction to facial image, But the feature that the network layer closer with inputting in depth convolutional neural networks is extracted is more shallow, and therefore, in the present invention, depth is rolled up Closer several network layers are exported in product neural network as target network-layer namely device only needs to extract facial image each The feature of target network-layer, it is to be understood that (output layer is deep to output layer in target network-layer and depth convolutional neural networks The network layer that is output to the outside of degree convolutional neural networks) the distance between be less than other network layers and output except target network-layer ResNet50 can be used as backbone network in the distance between layer, depth convolutional neural networks, ResNet50 can main decomposition at 5 Network layer, according to network layer and output layer distance sequence from big to small be followed successively by layer0, layer1, layer2, layer3, Layer4, wherein can be using layer2, layer3, layer4 as target network-layer, and layer0 can be to the facial image of input The operation such as convolution, normalization, pond is carried out, and layer1 can be used as the extractor of the characteristic pattern of facial image, need to illustrate Be layer1 to layer4 be four stage, separately include 3,4,6,3 residual error structures, in addition, ResNet50 layer4 it After further include pond layer and full articulamentum, it is to be understood that layer2, layer3, layer4 can be considered as three levels Different network structures, and then extractable facial image, in the corresponding feature of these three target network-layers, this category feature is the overall situation Feature, the face information of different global characteristics characterization different levels.It should be noted that inputting target network when facial image After network layers, target network-layer exports global characteristics figure, and device carries out pond to global characteristics figure again, and then obtains global characteristics, The size of the characteristic pattern exported from layer2, layer3, layer4 is respectively 512*28*28,1024*14*14 and 2048*7* 7.The dimension of three global characteristics is respectively 512,1024 and 2048.

It should be noted that needing to pre-process facial image, specifically, obtaining before to facial image training The data set in model training is taken, multiple test facial images are contained in data set, facial image can be from face image data Downloading obtains in source；Will in data set test facial image be trained when, processes pixel is normalized to facial image, Facial image pixel is normalized -1 to 1, so that model training process is easy convergence；Normalization pixel is completed in facial image After processing, image data enhancing is carried out to facial image using operations such as Random Level overturning, random interception and colour dithers, from And expand the diversity of training data, and prevent over-fitting；Processes pixel and image is normalized in test facial image Data enhancing processing after to get arrive target facial image；It is understood that pretreatment mass normalization processes pixel and figure As data enhancing is handled.

Step S20 is split to obtain each according to the corresponding target network-layer of the global characteristics to the global characteristics A local feature, and the corresponding each global characteristics of the target facial image are merged to obtain fusion feature；

After extracting global characteristics, global characteristics can be split to obtain local feature, specifically, passing through manually Mode is repeatedly split the global characteristics figure of same target network-layer output, so that partitioning scheme the most reasonable is obtained, Then the partitioning scheme and target network-layer are associated with and are stored into device, therefore extracting the complete of target network-layer when device After office's characteristic pattern, partitioning scheme is obtained according to the binding information of the target network-layer, and then according to partitioning scheme to global special Sign figure is split, and obtains multiple characteristic blocks, these characteristic blocks are by Chi Huahou, as local feature, it should be noted that feature The size of block is smaller, and the granularity of face characteristic corresponding to characteristic block is smaller, the description of face more horn of plenty, easier to represent The fine feature of face, so that device can be easy to excavate the fine feature of face.For example, layer2 can be exported Characteristic pattern carries out the multi-scale segmentation of 2*2, the local feature block of available 2 28*14,2 14*28 and 4 14*14, simultaneously The multi-scale segmentation that 2*2 is carried out to layer3 and layer4, respectively obtain 2 14*7,2 7*14 and 4 7*7 characteristic block and The characteristic block of 2 7*4,2 4*7 and 4 4*4；8 local feature dimensions that layer2 is obtained all are 8 of 512, layer3 Local feature dimension is all that 8 local feature dimensions of 1024, layer4 are all 2048.

In addition, each global characteristics are merged after obtaining multiple global characteristics of same facial image, thus To fusion feature, the dimension of fusion feature is 512+1024+2048=3588.

Step S30 carries out loss supervised learning to the global characteristics, the local feature and fusion feature, with right Human face recognition model is trained.

After obtaining the corresponding local feature of facial image, global characteristics and fusion feature, by optimizer to part Feature, global characteristics and fusion feature carry out loss supervised learning, specifically, referring to Fig. 4, i.e. step S30 includes:

Step S31 determines the global characteristics, the local feature and the corresponding target loss letter of the fusion feature The corresponding weighted value of several and described target loss function；

Step S32, according to the target loss function and the corresponding weighted value of the target loss function, to the mesh The corresponding feature of scalar functions carries out loss supervised learning.

In the present embodiment, first object loss function is configured to each local feature, first object loss function is to return One changes figure penalties function namely Softmax loss, and configures the second target damage to each global characteristics and fusion feature Function is lost, the second target loss function includes Softmax loss and Triplet loss (triple loss function), can be with Understand, local feature, fusion feature and global characteristics are provided with Softmax loss, it is therefore desirable to each mesh be arranged It marks and normalizes figure penalties function respective weights value in loss function, and figure penalties function is normalized in global characteristics needs basis The target network-layer of output global characteristics is configured, for example, can set the Softmax loss weight of local feature to 0.4, the weight of the corresponding Softmax loss of the local feature of different levels is set as 0.6,0.8 and 1, and fusion feature pair The weight of the Softmax loss answered is then set as 1.2.In the present embodiment, optimizer can be Adam, will when being trained The learning rate of optimizer is set as 0.02, epoch and is set as 60, so by the setting parameter of optimizer, loss function and The corresponding weighted value of loss function carries out loss supervised learning to the corresponding feature of the loss function.

The Softmax loss study of feature is to carry out at Softmax layers, and Triplet loss study is in Triplet Layer carries out, but each Softmax layers first inputs full articulamentum equipped with corresponding full articulamentum namely feature, then inputs Softmax Layer.

It should be noted that in the training process, the corresponding loss function of depth convolutional neural networks need to be obtained in real time, When loss function loss converges to pre-set interval (pre-set interval can be any appropriate range, such as about 0.6), can assert people The training of face identification model finishes, certainly, can also further progress training, at this point, SGD optimizer is updated, and simultaneously by learning rate It is set as 0.005, meanwhile, the other parameters of optimizer are constant, in people after all training of each target facial image, obtained Face identification model is optimal human face recognition model, at this point, the fusion feature of human face recognition model output is corresponding for facial image Verifying fusion feature, can will verify fusion feature as identify face template to be stored in device or database, this Outside, since the dimensional comparison of fusion feature is big, dimensionality reduction can be carried out to fusion feature, for example, fusion feature is reduced to 512 dimensions.

When human face recognition model carries out subsequent recognition of face, the facial image of acquisition is subjected to target network-layer Global characteristics extract, and then each global characteristics are merged, and fusion feature are obtained, finally by the fusion feature and database In each fusion feature that prestores be compared, and then identify face.

In the present invention, global characteristics are extracted by the heterogeneous networks layer to depth convolutional neural networks, then passes through the overall situation Image Segmentation Methods Based on Features obtains local feature, and then to local feature, global characteristics and the fusion feature merged by each global characteristics The supervised training that loss function carries out parameter is set, promotes the local feature finally obtained and global characteristics to final fusion Feature is all effective, and then the feature for the face that can preferably indicate.

In technical solution provided in this embodiment, the training device of human face recognition model uses depth convolutional neural networks In each target network-layer to each target facial image carry out feature extraction obtain multiple global characteristics, further according to global characteristics Corresponding target network-layer is split the overall situation, to obtain multiple local features, and then will correspond to a facial image Multiple local features and multiple global characteristics are merged to obtain fusion feature, finally to each global characteristics, local feature And fusion feature carries out loss supervised learning to carry out the training of human face recognition model；Due to the training cartridge of human face recognition model It sets and loss supervision is carried out by the local feature for exporting global characteristics, fusion feature and global characteristics segmentation to heterogeneous networks layer Study, so that device has excavated the fine feature in facial image, improves the precision of recognition of face, further, since face Global characteristics are divided into multiple local features and carry out loss supervised learning by the training device of identification model, without manually blocking The occlusion area of face or the characteristic processing blocked to face, simplify the training process of facial image.

It is the second embodiment of the training method of human face recognition model of the present invention referring to Fig. 5, Fig. 5, is based on first embodiment, The step of carrying out loss supervised learning to the fusion feature in the step S30 include:

Step S33 reduces the dimension of the fusion feature；

Step S34 carries out loss supervised learning to the fusion feature for reducing dimension.

In the present embodiment, due to fusion feature be merge to obtain by the corresponding multiple global characteristics of facial image so that The dimensional comparison of fusion feature is big, and in one embodiment, the dimension of fusion feature is 3588, this meeting is so that the fusion finally exported Feature also can be relatively high, and the fusion feature finally exported can make people as the feature templates of facial image in database The speed of face identification increases.In this regard, in the present embodiment, after merging to obtain fusion feature by multiple global characteristics, using volume Fusion feature is carried out channel reduction by long-pending mode, and fusion feature may make to reduce dimension and can be used in the present embodiment in this way Fusion feature is reduced to 512 dimensions by the convolution of 1*1.

Convolution is mainly main herein, and there are two the effects of aspect:

1, it realizes the information selection across channel, multiple features is subjected to information integration；

2, dimensionality reduction is carried out to the port number of feature, improves nonlinear characteristic, the high dimensional feature finally merged passes through study One convolution nuclear parameter realizes de-redundancy, improves recognition time, reduces the storage of feature.

In the technical solution that embodiment provides, device carries out the reduction of dimension by way of convolution to fusion feature, So that reducing the identification duration of recognition of face when facial image is verified, and then improve the rate of recognition of face, improving User experience.

To achieve the above object, the present invention also provides a kind of training device of human face recognition model, the recognition of face moulds The training device of type includes processor, memory and is stored in the face that can be run on the memory and on the processor The training program of identification model, the training program of the human face recognition model realize embodiment as above when being executed by the processor Each step of the training method of the human face recognition model.

The present invention also provides a kind of computer readable storage medium, the computer-readable recording medium storage has face knowledge The training program of other model, the training program of the human face recognition model realize embodiment institute as above when being executed by the processor Each step of the training method for the human face recognition model stated.

The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.

It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that the process, method, article or the device that include a series of elements not only include those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do There is also other identical elements in the process, method of element, article or device.

Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in one as described above In storage medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that terminal device (it can be mobile phone, Computer, server, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.

The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills Art field, is included within the scope of the present invention.

Claims

1. a kind of training method of human face recognition model, which is characterized in that the training method of the human face recognition model include with Lower step:

Feature extraction is carried out to each target facial image using each target network-layer in depth convolutional neural networks, to obtain Multiple global characteristics；

The global characteristics are split according to the global characteristics corresponding target network-layer to obtain each local feature, and The corresponding each global characteristics of the target facial image are merged to obtain fusion feature；

Loss supervised learning is carried out to the global characteristics, the local feature and fusion feature, to human face recognition model It is trained.

2. the training method of human face recognition model as described in claim 1, which is characterized in that it is described to the global characteristics, The step of local feature and fusion feature carry out loss supervised learning include:

Determine the global characteristics, the local feature and the corresponding target loss function of the fusion feature and the mesh Mark the corresponding weighted value of loss function；

It is corresponding to the objective function according to the target loss function and the corresponding weighted value of the target loss function Feature carries out loss supervised learning.

3. the training method of human face recognition model as claimed in claim 2, which is characterized in that the determination is described global special Sign, the local feature and the corresponding target loss function of the fusion feature and the corresponding power of the target loss function The step of weight values includes:

Normalization figure penalties function is determined as the corresponding first object loss function of the local feature, and by the normalizing Change figure penalties function and triple loss function is determined as the global characteristics and the fusion feature corresponding second Target loss function；

The first weighted value and the corresponding second target loss letter of the fusion feature are configured for the first object loss function Exponential function is normalized in number and configures the second weighted value, and according to the corresponding target network-layer of the global characteristics to the overall situation Exponential function is normalized in the corresponding second target loss function of feature configures third weighted value.

4. the training method of human face recognition model as described in claim 1, which is characterized in that use depth convolutional neural networks In each target network-layer feature extraction is carried out to each target facial image, to obtain multiple global characteristics the step of includes:

Multiple target network-layers are determined in each network layer of the depth convolutional neural networks, wherein the target network Distance is less than other network layers except the target network-layer and institute between layer and the output layer of the depth convolutional neural networks State the distance between output layer；

Target facial image is inputted into each target network-layer, it is corresponding global special to obtain each target network-layer Sign figure；

5. the training method of human face recognition model as described in claim 1, which is characterized in that it is described to the fusion feature into Row loss supervised learning the step of include:

Reduce the dimension of the fusion feature；

6. the training method of human face recognition model as described in claim 1, which is characterized in that described to use depth convolutional Neural Before the step of each target network-layer carries out feature extraction to each target facial image in network, further includes:

Each test facial image is obtained, and each test facial image is pre-processed, to obtain each target face figure Picture, wherein the pretreatment mass image normalization processes pixel and image data enhancing processing.

7. the training method of human face recognition model as claimed in any one of claims 1 to 6, which is characterized in that described to described After the step of global characteristics, the local feature and fusion feature carry out loss supervised learning, further includes:

When the corresponding loss function of the depth convolutional neural networks converges to set interval, the human face recognition model is completed Training, and save complete training the human face recognition model.

8. the training method of human face recognition model as claimed in claim 7, which is characterized in that the people that training will be completed After the step of face identification model saves, further includes:

The fusion feature for the human face recognition model output for completing training is prestored into fusion as the facial image is corresponding Feature, and fusion feature is prestored described in preservation.

9. a kind of training device of human face recognition model, which is characterized in that the training device of the human face recognition model includes place Manage device, memory and the training journey for being stored in the human face recognition model that can be run on the memory and on the processor Sequence is realized when the training program of the human face recognition model is executed by the processor as claim 1-8 is described in any item Each step of the training method of human face recognition model.

10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has recognition of face The training program of model realizes such as claim 1-8 when the training program of the human face recognition model is executed by the processor Each step of the training method of described in any item human face recognition models.