CN111768354A

CN111768354A - Face image restoration system based on multi-scale face part feature dictionary

Info

Publication number: CN111768354A
Application number: CN202010779169.7A
Authority: CN
Inventors: 左旺孟; 李晓明
Original assignee: Harbin Institute of Technology
Current assignee: Harbin Institute of Technology
Priority date: 2020-08-05
Filing date: 2020-08-05
Publication date: 2020-10-13

Abstract

A face image restoration system based on a multi-scale face part feature dictionary belongs to the technical field of face image restoration. The invention aims at the problem that in the existing face image restoration technology, the high-quality face image obtained from the real low-quality face image needs to be guided by the high-definition face image of the user, so that the application of the high-quality face image is limited. The method comprises a face feature dictionary offline generation module: the face feature dictionary is used for respectively extracting high-definition face feature from each sample image in the high-definition face image data set, and obtaining a face feature dictionary by adopting a k-means clustering mode on an extraction result; the face image restoration module: the face restoration method comprises the steps of extracting features of a degraded face image to be restored, and fusing a feature extraction result with the face part feature dictionary to obtain a face feature to be restored after the part is enhanced; and reconstructing the human face features to be restored to obtain a guide restoration result image. The invention is used for restoring the low-quality image.

Description

Face image restoration system based on multi-scale face part feature dictionary

Technical Field

The invention relates to a face image restoration system based on a multi-scale face part feature dictionary, and belongs to the technical field of face image restoration.

Background

The face image restoration technology is used for restoring a low-quality face image (such as blurriness, high noise, compression artifacts, low quality caused by long-distance shooting, low-quality shooting equipment, network transmission and the like) into a high-quality face image. With the development of technology and equipment, people are pursuing high-definition quality images and videos more and more, and mobile phone manufacturers also pursue high quality of shot face images. For the face image with low quality caused by the inevitable reason, the visual perception of people is often unacceptable, and the real degradation of the image cannot be simulated, so how to recover the image with high quality from a real image with low quality is a hot point of research of enterprises and researchers.

In recent years, deep learning has been a breakthrough in improving image quality, and can significantly improve the visual quality of images. However, most of the current methods are limited to the restoration work of a single image, and learn the mapping from a low-quality face image to a high-quality image through a convolutional neural network. Because the characteristics of the real images cannot be simulated, the method cannot be applied to most of the real images, and therefore, ideal robustness and effect cannot be achieved.

In order to solve the problems of the method, one or more high-definition images of the same person are adopted as a guide to assist the recovery process of the network. Although this process achieves a certain performance improvement, it requires that the identity of the degraded image is known in advance and one or more guide maps with high definition are provided, which greatly limits its application range.

Disclosure of Invention

The invention provides a face image restoration system based on a multi-scale face part feature dictionary, aiming at the problem that in the existing face image restoration technology, a high-quality face image obtained from a real low-quality face image needs to be guided by a high-definition face image of a user, so that the application of the high-quality face image is limited.

The invention discloses a face image restoration system based on a multi-scale face part feature dictionary, which comprises:

the face feature dictionary offline generation module: the face feature dictionary is used for respectively extracting high-definition face feature from each sample image in the high-definition face image data set, and obtaining a face feature dictionary by adopting a k-means clustering mode on an extraction result;

the face image restoration module: the face restoration method comprises the steps of extracting features of a degraded face image to be restored, and fusing a feature extraction result with the face part feature dictionary to obtain a face feature to be restored after the part is enhanced; and reconstructing the human face features to be restored to obtain a guide restoration result image.

According to the face image restoration system based on the multi-scale face part feature dictionary, the face part feature dictionary obtained by the face feature dictionary off-line generation module comprises M scales, and M is an integer greater than or equal to 1.

According to the face image restoration system based on the multi-scale face part feature dictionary, when M is 4, the process of processing the sample image by adopting the Vggface model comprises the following steps:

sequentially performing convolution, activation, pooling, convolution, activation, convolution and activation operations on each sample image to obtain high-definition face part features with the scale of 1;

sequentially performing activation, pooling, convolution, activation, convolution, activation and convolution operations on the high-definition face features with the scale of 1 to obtain high-definition face part features with the scale of 2;

sequentially performing activation, pooling, convolution, activation, convolution, activation and convolution operations on the high-definition face features with the scale of 2 to obtain high-definition face part features with the scale of 3;

sequentially performing activation, pooling, convolution, activation, convolution, activation and convolution operations on the high-definition face features with the scale of 3 to obtain high-definition face part features with the scale of 4;

the activation operation adopts a ReLU activation function, and the pooling operation adopts maximum pooling operation;

the first and second convolution operations are both 64 convolution operations of 3 x 3 with a step size of 1;

the third and fourth convolution operations are both 128 convolution operations of 3 x 3 with a step size of 1;

the fifth convolution operation to the ninth convolution operation are all 256 convolution operations with 3 × 3 and step size of 1;

the tenth to sixteen convolution operations are all 512 convolution operations with 3 × 3 and step size of 1;

the high-definition face part feature with the scale of 1, the high-definition face part feature with the scale of 2, the high-definition face part feature with the scale of 3 and the high-definition face part feature with the scale of 4 are respectively processed through a dictionary generation module to obtain a face part feature dictionary with the corresponding scale;

the process of processing the input data by the dictionary generation module comprises the following steps:

acquiring high-definition face part features with the scale of 1, high-definition face part features with the scale of 2, high-definition face part features with the scale of 3 or high-definition face part features with the scale of 4;

then, carrying out region alignment operation: acquiring positions of a left eye, a right eye, a nose and a mouth of the high-definition face position characteristics by adopting a face key point detection algorithm; cutting the left eye, the right eye, the nose and the mouth from the corresponding high-definition human face part features in a RoIAlign mode according to the obtained positions of all parts to obtain the part features of the left eye, the right eye, the nose and the mouth as extraction results;

respectively obtaining K1 clustering centers of the left eye, K2 clustering centers of the right eye, K3 clustering centers of the nose and K4 clustering centers of the mouth of all the part characteristics of all the parts in the extraction result in a K-means clustering mode; wherein K1 cluster centers correspond to the left-eye dictionary, K2 cluster centers correspond to the right-eye dictionary, K3 cluster centers correspond to the nose dictionary, and K4 cluster centers correspond to the mouth dictionary; k1, K2, K3 and K4 are all greater than or equal to 1;

the face part feature dictionary with the scale of 1 is obtained corresponding to the high-definition face part feature with the scale of 1, the face part feature dictionary with the scale of 2 is obtained corresponding to the high-definition face part feature with the scale of 2, the face part feature dictionary with the scale of 3 is obtained corresponding to the high-definition face part feature with the scale of 3, and the face part feature dictionary with the scale of 4 is obtained corresponding to the high-definition face part feature with the scale of 4.

According to the facial image restoration system based on the multi-scale facial feature dictionary, the facial image restoration module comprises:

a primary face feature extraction module: sequentially performing convolution, activation, pooling, convolution, activation and convolution operations on a degraded face image to be restored to obtain degraded face features with the scale of 1;

a secondary face feature extraction module: sequentially performing pooling, activation, convolution, activation and convolution operations on the degraded human face features with the scale of 1 to obtain degraded human face features with the scale of 2;

the three-level face feature extraction module: sequentially performing activation, pooling, convolution, activation, convolution, activation and convolution operations on the degraded human face features with the scale of 2 to obtain degraded human face features with the scale of 3;

four-level face feature extraction module: sequentially performing activation, pooling, convolution, activation, convolution, activation and convolution operations on the degraded human face features with the scale of 3 to obtain degraded human face features with the scale of 4;

the fifth convolution operation to the 9 th convolution operation are all 256 convolution operations with 3 × 3 and the step size of 1;

the dictionary feature guidance enhancement module with the scale of 1: the face feature dictionary fusion method is used for fusing the degraded face features with the scale of 1 and the face part feature dictionary with the scale of 1 to obtain first-level face features to be restored after part enhancement;

scale 2 dictionary feature guidance enhancement module: the face feature dictionary fusion method is used for fusing the degraded face features with the scale of 2 and the face part feature dictionary with the scale of 2 to obtain second-level face features to be restored after part enhancement;

scale 3 dictionary feature guidance enhancement module: the face feature dictionary fusion method is used for fusing the degraded face features with the scale of 3 and the face part feature dictionary with the scale of 3 to obtain three-level face features to be restored after part enhancement;

the dictionary feature guidance enhancement module with the scale of 4: the face feature dictionary fusion method is used for fusing the degraded face features with the scale of 4 and the face part feature dictionary with the scale of 4 to obtain four-level face features to be restored after part enhancement;

a fourth-level reconstruction module: the system is used for carrying out affine transformation on the degraded human face features with the scale of 4 and four-level human face features to be restored, and inputting transformation results into network decoding features to obtain four-level reconstruction result features;

a third-level reconstruction module: the system is used for carrying out affine transformation on the four-level reconstruction result image and the three-level face features to be restored and inputting the transformation result into model network decoding features to obtain three-level reconstruction result features;

a secondary reconstruction module: the system is used for carrying out affine transformation on the three-level reconstruction result image and the second-level face feature to be restored and inputting the transformation result into the model network decoding feature to obtain the second-level reconstruction result feature;

a primary reconstruction module: the system comprises a model network decoding device, a first-level reconstruction result image processing device, a second-level reconstruction result image processing device, a first-level reconstruction result image processing device and a second-level reconstruction result image processing device, wherein the first-level reconstruction result image processing device is used for performing affine transformation on the first-level reconstruction result image and first-level to-be-restored face features;

an output module: and the primary reconstruction result image is output as a guide restoration result image.

According to the face image restoration system based on the multi-scale face part feature dictionary, the processing process of the dictionary feature guidance enhancement module with the scale of 1, the dictionary feature guidance enhancement module with the scale of 2, the dictionary feature guidance enhancement module with the scale of 3 and the dictionary feature guidance enhancement module with the scale of 4 on input data is the same; taking the dictionary feature guidance enhancement module with the scale of 1 as an example for explanation:

the scale-1 dictionary feature guidance enhancement module comprises:

face part feature extraction module: the method is used for obtaining the position characteristics of the left eye, the right eye, the nose and the mouth from the degraded human face characteristics with the scale of 1 according to the human face key points;

the dictionary feature self-adaptive normalization module: performing self-adaptive normalization operation on the part characteristics of the left eye, the right eye, the nose and the mouth in the face part characteristic dictionary with the scale of 1 by combining the left eye dictionary, the right eye dictionary, the nose dictionary and the mouth dictionary to obtain normalized dictionary characteristics;

and traversing the dictionary module: traversing in the normalized dictionary features to obtain corresponding features closest to the features of the left eye, the right eye, the nose and the mouth as matching dictionary features;

a confidence prediction module: the system is used for predicting confidence according to residual errors between the part features of the left eye, the right eye, the nose and the mouth and the corresponding matched dictionary features to obtain self-adaptive fusion features of all parts;

a restoration module: and the method is used for replacing the self-adaptive fusion features of all parts into the degraded human face image to be restored according to the human face key points to obtain the first-level human face features to be restored.

According to the face image restoration system based on the multi-scale face part feature dictionary, the method for acquiring the normalized dictionary features by the dictionary feature adaptive normalization module comprises the following steps:

and carrying out self-adaptive normalization operation on the position characteristics of the left eye, the right eye, the nose and the mouth:

wherein:

in order to normalize the characteristics of the dictionary after normalization,

constructing a kth clustering center of the c part feature with the scale of s for the offline; the sigma is the operation of the variance, and,

c ∈ { left eye, right eye, nose and mouth }, wherein s is 1, mu is mean value operation, for the c-th part feature of the degraded human face image with the scale of s to be restored;

the process of traversing the dictionary module to obtain the matching dictionary features comprises the following steps:

calculating respective features closest to the site features of the left eye, right eye, nose and mouth:

in the formula

In order to match the confidence of the dictionary features,<，>the inner product operation is performed.

According to the face image restoration system based on the multi-scale face part feature dictionary, the process of obtaining the self-adaptive fusion features of all parts by the confidence coefficient prediction module comprises the following steps:

in the formula

In order to adaptively blend the features of the video,

is the closest corresponding feature matched;

predicting the network for confidence, Θ_CPredicting a network learnable parameter for the confidence; the confidence prediction network comprises two layers of convolution operation with 3 x 3 and step length of 1;

the primary reconstruction module obtains a scale change parameter alpha and a displacement change parameter beta through convolution operation of two layers of 3 x 3 with the step length of 1 for a secondary reconstruction result image and primary human face features to be restored, and obtains a primary reconstruction result image SFTs through calculation:

the feature with the scale s is decoded for the network.

According to the face image restoration system based on the multi-scale face part feature dictionary, the restoration model further comprises a constraint form of a training network, the training network constrains whole network learning through reconstruction loss, and the reconstruction loss comprises loss of a primary reconstruction result image and an undegraded high-definition image corresponding to the primary reconstruction result image between a pixel space and a feature space

In the formula of_l2Representing the pixel spatial loss weight, λ_pmThe weight of the loss of the feature space is represented,

for the first-order reconstruction of the resulting image, I^hFor corresponding undegraded high definition images, C_m、H_m、,W_mSequentially representing the number, height and width of characteristic channels of the mth layer of the first-level reconstruction result image_mObtaining the mth layer of convolution characteristics for a pre-trained face recognition network;

the training network further comprises a multi-scale discriminant loss function:

for the image of the guiding restoration result, down-sampling r ═ {1,2,4,8} is respectively carried out to obtain 4 groups of images with different resolutions, and loss is calculated by adopting a changeloss mode through four discrimination networks

For learning of the discriminative network, the penalty is defined as:

wherein R is the upper limit of the scale, D_rIs a discriminator with a dimension r,

for an image down-sampled by r times for a non-degraded high-definition image, x r denotes down-sampled by r times,

in the interest of expectation,

is composed of

The distribution of (a);

for learning of a generative network under discriminant network constraints, penalties

Is defined as:

λ_a,rdiscriminating the network weight with the scale r; l is^dIs a face key point, Dic is a constructed face dictionary, theta is a parameter which can be learned by a model, I^dFor the degraded face image to be restored, P (I)^d) Is I^dThe distribution of (a) to (b) is,

a recovery module;

according to the face image restoration system based on the multi-scale face part feature dictionary, the training network performs end-to-end training on other network structures except the first-level face feature extraction module, the second-level face feature extraction module, the third-level face feature extraction module and the fourth-level face feature extraction module by adopting an Adam optimization algorithm.

According to the face image restoration system based on the multi-scale face part feature dictionary, the degraded face image to be restored is obtained by sequentially carrying out blurring, down-sampling, noise adding and JPEG (joint photographic experts group) compression on a high-definition face image, wherein the blurring adopts Gaussian blurring and motion blurring, and the standard deviation of Gaussian blurring kernel is adopted

The method comprises the following steps of performing down-sampling treatment by a bicubic down-sampling method according to a sampling scale S ∈ {1:0.1: S }, performing noise addition treatment by Gaussian white noise according to a noise level N ∈ {0,1:0.1: N }, performing JPEG compression quality parameters Q ∈ {0,10:0.1: Q }, wherein P is more than or equal to 5, S is more than or equal to 8, N is more than or equal to 15, and Q is more than or equal to 80;

the whole network is trained through the constructed low-quality degraded human face image to be restored and the corresponding high-definition human face image, and the obtained trained network is used for restoring the low-quality image.

The invention has the beneficial effects that: the method replaces the guide image to enhance by constructing the high-definition face part dictionary, so that the image restoration is not limited by the application range any more, and the method can be applied to most face enhancement scenes; compared with one or more guide images, the face part dictionary constructed by the invention has the advantages that high-quality features with higher similarity can be selected as the guide images, and the quality of guide enhancement is greatly improved.

The invention provides a face part dictionary algorithm for assisting face image enhancement aiming at the problem that a real low-quality image cannot be effectively restored in the prior art and the problem that one or more high-definition unified identity characteristic images are needed in a method based on guide image enhancement.

Drawings

FIG. 1 is a flow chart of a face image restoration system based on a multi-scale face part feature dictionary according to the present invention;

FIG. 2 is a block flow diagram of a face image restoration module;

FIG. 3 is a schematic diagram of a network structure for generating a face region feature dictionary;

fig. 4 is a schematic diagram of a network structure in which a face part feature dictionary is migrated to a face image restoration module to realize image restoration.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

It should be noted that the embodiments and features of the embodiments may be combined with each other without conflict.

The invention is further described with reference to the following drawings and specific examples, which are not intended to be limiting.

In a first embodiment, as shown in fig. 1 to 4, the present invention provides a face image restoration system based on a multi-scale face part feature dictionary, including:

the face feature dictionary offline generation module 100: the face feature dictionary is used for respectively extracting high-definition face feature from each sample image in the high-definition face image data set, and obtaining a face feature dictionary by adopting a k-means clustering mode on an extraction result;

face image restoration module 200: the face restoration method comprises the steps of extracting features of a degraded face image to be restored, and fusing a feature extraction result with the face part feature dictionary to obtain a face feature to be restored after the part is enhanced; and reconstructing the human face features to be restored to obtain a guide restoration result image.

In the embodiment, the sample images forming the high-definition face image data set have different postures, expressions, illumination conditions and the like, so that the diversity of the sample images is ensured.

In the embodiment, the face part feature dictionary is generated in an off-line mode, which is beneficial to greatly improving the efficiency of low-quality image restoration.

Further, the face feature dictionary obtained by the face feature dictionary offline generation module 100 includes M scales, where M is an integer greater than or equal to 1.

Still further, with reference to fig. 1 to 4, when M is 4, the processing of the sample image by using the VggFace model includes:

then, carrying out region alignment operation: extracting parts of a left eye, a right eye, a nose and a mouth of the high-definition face part characteristics by using a Vggface model with fixed parameters by adopting a face key point detection algorithm; cutting the left eye, the right eye, the nose and the mouth from the corresponding high-definition human face part features in a RoIAlign mode according to the obtained positions of all parts to obtain the part features of the left eye, the right eye, the nose and the mouth as extraction results; in this way, a large number of high-quality features of the individual regions can be obtained;

The embodiment is used for extracting S scale features, wherein S is 4, the network structure comprises convolutional layers C1 to C14, and pooling operations A1 to A4;

the convolution layer C1 performs a first convolution on a high-definition input sample image, and performs a first activation operation;

convolutional layer C2 performs a second convolution on the output of convolutional layer C1, a second activation operation;

pooling operation A1 performs a first pooling operation on the output of convolutional layer C2;

convolutional layer C3 performs a third convolution on the output of pooling operation A1, a third activation operation;

convolutional layer C4 performs a fourth convolution on the output of convolutional layer C3, a fourth activation operation;

the output scale of the convolutional layer C4 is 1 of the high-definition human face part characteristics;

pooling layer A2 performs a fifth activation operation on the output of convolutional layer C4, a second pooling operation;

convolutional layer C5 performs a fifth convolution operation, a sixth activation operation, on the output of pooling layer A2;

convolutional layer C6 performs a sixth convolution operation, a seventh activation operation on the convolutional layer C5 output;

convolutional layer C7 performs the seventh convolution operation, the eighth activation operation on the convolutional layer C6 output;

convolutional layer C8 performs an eighth convolution operation on the output of convolutional layer C7;

the output scale of the convolutional layer C8 is 2 high-definition human face part characteristics;

the pooling layer A3 performs a ninth activation operation and a third pooling operation on the output of the convolutional layer C8;

convolutional layer C9 performs the ninth convolution operation, the tenth activation operation on the output of pooling layer A3;

the convolutional layer C10 performs the tenth convolution operation, the eleventh activation operation on the output of the convolutional layer C9;

the convolutional layer C11 performs the eleventh convolution operation, the twelfth activation operation on the output of the convolutional layer C10;

convolutional layer C12 performs a twelfth convolution operation on the output of convolutional layer C11;

the output scale of the convolutional layer C12 is 3 high-definition human face part characteristics;

pooling layer A4 performs a thirteenth activation operation, a fourth pooling operation on the output of convolutional layer C12;

convolutional layer C13 performs a thirteenth convolution operation, a fourteenth activation operation on the output of pooling layer A4;

convolutional layer C14 performs a fourteenth convolution operation, a fifteenth activation operation on the output of convolutional layer C13;

the convolutional layer C15 performs a fifteenth convolution operation, a sixteenth activation operation on the output of the convolutional layer C14;

convolutional layer C16 performs a sixteenth convolution operation on the output of convolutional layer C15;

the convolutional layer C16 has a high-definition face feature output scale of 4.

Still further, as shown in fig. 2, the facial image restoration module 200 includes:

In this embodiment, the process of extracting an image by the four-level face feature extraction module specifically includes:

the first-level human face feature extraction module comprises the following steps:

the convolution layer C1 performs the first convolution on the degraded human face image to be restored, and performs the first activation operation;

the output scale of convolutional layer C4 is 1 degraded human face features;

the secondary face feature extraction module comprises the following steps:

the output scale of convolutional layer C8 is 2 degraded human face features;

the output scale of convolutional layer C12 is 3 degraded human face features;

the output scale of convolutional layer C16 is 4 degraded facial features.

Still further, with reference to fig. 2, the processing procedure of the dictionary feature guidance enhancement module with scale 1, the processing procedure of the dictionary feature guidance enhancement module with scale 2, the processing procedure of the dictionary feature guidance enhancement module with scale 3, and the processing procedure of the dictionary feature guidance enhancement module with scale 4 are the same; taking the dictionary feature guidance enhancement module with the scale of 1 as an example for explanation:

the scale-1 dictionary feature guidance enhancement module comprises:

a restoration module: and the method is used for replacing the self-adaptive fusion features of all parts into the degraded human face image to be restored according to the human face key points to obtain the first-level human face features to be restored. Specifically, the enhanced features of each part are replaced with corresponding features in the features of the input image according to the positions detected by the key points of the human face, so as to obtain the enhanced next-stage human face features to be restored.

In this embodiment, the four dictionary feature guidance enhancement modules add the enhanced features to the network decoding features through feature affine transformation, and the four enhancement modules respectively operate in feature spaces of different scales, so that a coarse-to-fine guidance restoration result can be obtained.

Still further, the method for obtaining the normalized dictionary features by the dictionary feature adaptive normalization module comprises the following steps:

wherein:

based on the above operation, an adaptive normalization operation is performed on all dictionary features of each part c.

in the formula

To match the confidence of the dictionary features, the inner product operation can be implemented by adopting the convolution operation with the bias execution of 0.

And outputting the similarity between the part of the current input image and each clustering center of the dictionary, and selecting the feature with the highest score, namely the most similar feature as the matching feature.

Still further, the process of obtaining the adaptive fusion characteristics of each part by the confidence prediction module includes:

in the formula

In order to adaptively blend the features of the video,

is the closest corresponding feature matched;

the feature with the scale s is decoded for the network.

In the embodiment, for the matched dictionary features, a confidence coefficient is predicted according to a residual error between the dictionary features and the input part features, and the confidence coefficient is applied to the dictionary features and is added back to the input part features.

Still further, the restoration model further comprises a constraint form of a training network, the training network constrains the whole network learning through reconstruction loss, and the reconstruction loss comprises that the first-stage reconstruction result image and the non-degraded high-definition image corresponding to the first-stage reconstruction result image are between the pixel space and the feature spaceLoss of

for the first-order reconstruction of the resulting image, I^hFor corresponding undegraded high definition images, C_m、H_m、,W_mSequentially representing the number, height and width of characteristic channels of the mth layer of the first-level reconstruction result image_mObtaining the nth layer of convolution characteristics for a pre-trained face recognition network;

For learning of the discriminative network, the penalty is defined as:

in the interest of expectation,

is composed of

The distribution of (a);

Is defined as:

a recovery module;

and further, the training network performs end-to-end training on other network structures except the first-level human face feature extraction module, the second-level human face feature extraction module, the third-level human face feature extraction module and the fourth-level human face feature extraction module by adopting an Adam optimization algorithm.

Further, the degraded human face image to be restored is obtained by sequentially carrying out blurring, down-sampling, noise adding and JPEG compression on the high-definition human face image, wherein the blurring process adopts Gaussian blurring and motion blurring, and Gaussian blurring kernel standard deviation

The restoration system of the invention provides the construction operation of the multi-scale face part feature dictionary and the operation of transferring the feature dictionary to the degradation graph on the basis of the existing face image restoration system based on the convolutional neural network. Firstly, extracting each face part from a large number of high-definition face images, and then obtaining the characteristics of each part in different scales by adopting a k-means mode. For each location of a degraded graph, there are K high-definition location dictionaries that can be used as a guide enhancement. Firstly, for each scale, adopting a site self-adaptive normalization operation to normalize dictionary features, and using the normalized dictionary features to process the problem of inconsistent distribution of the degradation map and the dictionary features. Traversing the whole normalized dictionary to obtain part features with the most similar features as guidance; for the problem that dictionary degrees required by different degenerated inputs are inconsistent, residual errors based on the matching features and the input features are used for predicting a confidence coefficient for obtaining dictionary features in a targeted mode. Finally, the multi-scale dictionary fusion features are beneficial to learning detailed features from coarse to fine by the network.

The method is suitable for any face repairing scene.

The face feature extraction module adopted by the invention comprises but is not limited to the use of a Vggface model.

The method adopts a form of constructing a high-quality dictionary to assist image enhancement in the convolutional neural network, and comprises the step of obtaining a plurality of high-definition part features in a K mean value mode.

The present invention is implemented by convolution operation and multilayer convolution operation in a neural network structure, and is not limited to a neural network.

Although the invention herein has been described with reference to particular embodiments, it is to be understood that these embodiments are merely illustrative of the principles and applications of the present invention. It is therefore to be understood that numerous modifications may be made to the illustrative embodiments and that other arrangements may be devised without departing from the spirit and scope of the present invention as defined by the appended claims. It should be understood that features described in different dependent claims and herein may be combined in ways different from those described in the original claims. It is also to be understood that features described in connection with individual embodiments may be used in other described embodiments.

Claims

1. A face image restoration system based on a multi-scale face part feature dictionary is characterized by comprising:

face feature dictionary offline generation module (100): the face feature dictionary is used for respectively extracting high-definition face feature from each sample image in the high-definition face image data set, and obtaining a face feature dictionary by adopting a k-means clustering mode on an extraction result;

face image restoration module (200): the face restoration method comprises the steps of extracting features of a degraded face image to be restored, and fusing a feature extraction result with the face part feature dictionary to obtain a face feature to be restored after the part is enhanced; and reconstructing the human face features to be restored to obtain a guide restoration result image.

2. The system for restoring a human face image based on a multi-scale human face feature dictionary according to claim 1, wherein the human face feature dictionary obtained by the human face feature dictionary off-line generation module (100) comprises M scales, and M is an integer greater than or equal to 1.

3. The system for restoring a human face image based on a multi-scale human face feature dictionary according to claim 2, wherein when M is 4, the process of processing the sample image by using the VggFace model comprises:

4. The multi-scale face region feature dictionary based face image restoration system according to claim 3, wherein the face image restoration module (200) comprises:

5. The system for restoring a human face image based on a multi-scale human face part feature dictionary according to claim 4, wherein the processing procedures of the dictionary feature guidance enhancement module with the scale of 1, the dictionary feature guidance enhancement module with the scale of 2, the dictionary feature guidance enhancement module with the scale of 3 and the dictionary feature guidance enhancement module with the scale of 4 on the input data are the same; taking the dictionary feature guidance enhancement module with the scale of 1 as an example for explanation:

the scale-1 dictionary feature guidance enhancement module comprises:

6. The system for restoring a human face image based on a multi-scale human face feature dictionary according to claim 5, wherein the method for obtaining the normalized dictionary features by the dictionary feature adaptive normalization module comprises:

wherein:

in the formula

7. The system for restoring a facial image based on a multi-scale facial feature dictionary according to claim 6,

the process of the confidence coefficient prediction module for obtaining the self-adaptive fusion characteristics of each part comprises the following steps:

in the formula

In order to adaptively blend the features of the video,

is the closest corresponding feature matched;

the feature with the scale s is decoded for the network.

8. The system for restoring a facial image based on a multi-scale facial features dictionary as claimed in claim 7, wherein the system is characterized in thatThe restoration model further comprises a constraint form of a training network, the training network constrains the whole network learning through reconstruction loss, and the reconstruction loss comprises the loss of the primary reconstruction result image and the non-degraded high-definition image corresponding to the primary reconstruction result image between the pixel space and the feature space

For learning of the discriminative network, the penalty is defined as:

in the interest of expectation,

is composed of

The distribution of (a);

for learning of a generating network under discriminating network constraints, loss l_adv,GIs defined as:

is a recovery module.

9. The system for restoring a facial image based on a multi-scale facial feature dictionary according to claim 8, wherein the training network adopts an Adam optimization algorithm to perform end-to-end training on other network structures except the first-level facial feature extraction module, the second-level facial feature extraction module, the third-level facial feature extraction module and the fourth-level facial feature extraction module.

10. The system for restoring a human face image based on a multi-scale human face part feature dictionary according to claim 9, wherein the degraded human face image to be restored is obtained by sequentially performing blurring, down-sampling, noise adding and JPEG (joint photographic experts group) compression on a high-definition human face imageObtaining the standard deviation of Gaussian blur and motion blur by the blur processing