CN109712069B

CN109712069B - Face image multilayer reconstruction method based on CCA space

Info

Publication number: CN109712069B
Application number: CN201811322383.9A
Authority: CN
Inventors: 郭立君; 李小宝; 张�荣; 姚正元
Original assignee: Ningbo University
Current assignee: Ningbo University
Priority date: 2018-11-08
Filing date: 2018-11-08
Publication date: 2023-04-07
Anticipated expiration: 2038-11-08
Also published as: CN109712069A

Abstract

The invention discloses a face image multilayer reconstruction method based on CCA space, which adopts large size to block a trained low-resolution face image, a trained high-resolution face image and a tested low-resolution face image to obtain a low-resolution dictionary and a high-resolution dictionary; secondly, performing CCA mapping on the two types of dictionaries once, and performing sparse updating on the two types of once-mapped dictionaries; then, performing inverse mapping on the two types of updated dictionaries, and performing CCA mapping on the two types of reflection dictionaries again; then, sorting the dictionaries by calculating Euclidean distances between each column vector in the two types of re-mapped dictionaries and the column vector of the corresponding image block in the tested low-resolution image, and obtaining a layer of reconstructed high-resolution face image by a super-resolution reconstruction method based on smoothness; then selecting a small size for blocking, repeating the process, introducing the constraint of a layer of reconstructed high-resolution image, and obtaining a high-resolution face image reconstructed by two layers; the advantage is that the reconstruction is effective.

Description

Face image multilayer reconstruction method based on CCA space

Technical Field

The invention relates to a face image reconstruction technology, in particular to a face image multilayer reconstruction method based on a CCA (Canonical Correlation Analysis) space.

Background

The most important face recognition problem in video surveillance is a difficult and complex one. For a face image in a monitoring video, edge structure information is unclear and details are blurred due to insufficient light, too far distance between a face and a monitoring device and the like. In order to solve the problems of unclear edge structure information and blurred details of a face image, it is necessary to use prior information to improve the resolution of the face image, that is, to reconstruct a high-resolution (HR) face image by using an observed low-resolution (LR) face image, which is a super-resolution (SR) reconstruction problem of the face image, and the reconstructed high-resolution face image provides more detailed information for the recognition and analysis of the face.

The super-resolution reconstruction technology of the face image has been widely concerned in the field of computer vision, and how to better reconstruct the face image is deeply researched by related organizations at home and abroad. Such as: jiang et al, J.Jiang, R.Hu, Z.Wang, and Z.Han, "Noise robust surface localization-constrained representation", IEEE trans.multimedia, vol.16, no.5, pp.1268-1281, aug.2014 (Jiang Junjun, hu Ruimin, wang Zhongyuan, han Zhen, noise robust face reconstruction [ J ], IEEE, multimedia, 2014, 1268-1281) propose a block-based local constraint model (LCR) with which reconstruction results show that the effect of Noise on the super-resolution reconstruction process can be reduced. On the basis, jiang J, ma J, chen C, et al. Noise Robust Face Image Super-Resolution [ J ]. IEEE Transactions on Cybernetics,2017, PP (99): 1-12. (Jiang Junjun, ma Jiayi, chen Chen, jiang Xinwei, wang Zheng), noise Robust Face reconstruction [ J ] based on Smooth Sparse Representation, IEEE, cybernetics,2017,1-12) provides a Super-Resolution reconstruction method (SSR) based on smoothness, which achieves certain smoothing and denoising effects.

The reconstruction technique proposed by Jiang et al is based on the assumption that the high-resolution dictionary and the low-resolution dictionary are highly correlated and have similar structural distribution, and the reconstruction of the face image can be performed based on the assumption. On one hand, however, although there is a certain similarity in structure and content between face images, a high-resolution dictionary and a low-resolution dictionary which are directly constructed based on a face image space cannot meet a highly relevant condition, so that a reconstruction effect is not ideal; on the other hand, the above reconstruction techniques all adopt a single-layer reconstruction mode, that is, the reconstruction is performed based on the size of a fixed block, in the block-based reconstruction techniques, the block size of the observed low-resolution face image is very important, if the block size is small, the number of blocks is large, the detailed information of the reconstructed high-resolution face image is abundant, but the structural information of the reconstructed high-resolution face image is not easy to grasp; if the size of the block is large, the number of blocks is small, the structural information of the reconstructed high-resolution face image is easy to grasp, but the detailed information of the reconstructed high-resolution face image is not rich. Therefore, a new face image reconstruction method needs to be researched for a low-resolution face image with the problems of unclear edge structure information and blurred details.

Disclosure of Invention

The invention aims to solve the technical problem of providing a human face image multilayer reconstruction method based on a CCA space, and the edge structure information of a high-resolution human face image obtained by utilizing the reconstruction method is clear, the details are clear, and the reconstruction effect is good.

The technical scheme adopted by the invention for solving the technical problems is as follows: a human face image multilayer reconstruction method based on CCA space is characterized by comprising the following steps:

the method comprises the following steps: selecting a face image database, wherein the face image database comprises at least two low-resolution face images and a high-resolution face image corresponding to each low-resolution face image, and correspondingly recording the nth low-resolution face image and the corresponding high-resolution face image in the face image database as

And &>

And the tested low-resolution face image is recorded as ^ or ^ based on>

Wherein N is a positive integer, N is more than or equal to 1 and less than or equal to N, N represents the total number of low-resolution face images contained in the face image database, N is more than or equal to 2, and/or is greater than or equal to N>

And &>

Are all W in width>

And &>

All the heights of (A) are H;

step two: dividing each low-resolution face image in face image database into S by adopting sliding window technology ₁ Each overlapping with a dimension of k ₁ ×k ₁ Image blocks of

S of (1) ₁ An image block is marked as>

Similarly, a sliding window technology is adopted to divide the high-resolution face image corresponding to each low-resolution face image in the face image database into S ₁ Each overlapping with a dimension of k ₁ ×k ₁ The image block of (a) is selected, will->

S of (1) ₁ Each image block is recorded as>

Will be picked up using a sliding window technique>

Is divided into ₁ Each overlapping with a dimension of k ₁ ×k ₁ The image block of (2) is selected, will be/are>

S of (1) ₁ Each image block is recorded as>

Wherein the size of the sliding window is k ₁ ×k ₁ ，k ₁ =5,7,9,11, sliding step of sliding window is 1 pixel point, S ₁ ＝(W-k ₁ +1)×(H-k ₁ +1)，s ₁ Is a positive integer, s is not less than 1 ₁ ≤S ₁ ；

Step three: arranging the pixel values of all pixel points in each image block in each low-resolution face image in a face image database to form corresponding column vectors, and arranging the pixel values of all pixel points in each image block in each low-resolution face image in the face image database to form a column vector

The corresponding column vector is marked +>

Similarly, arranging the pixel values of all pixel points in each image block in the high-resolution face image corresponding to each low-resolution face image in the face image database to form a corresponding column vector, and combining->

The corresponding column vector is marked +>

Will->

The pixel values of all the pixel points in each image block are arranged to form a corresponding column vector, and the ^ is greater than or equal to>

The corresponding column vector is marked +>

Then, column vectors corresponding to image blocks at the same position in all low-resolution face images in the face image database form a low-resolution dictionary to form S ₁ A low resolution dictionary for mapping the s-th image of all low resolution face images in the face image database ₁ A low-resolution dictionary formed by column vectors corresponding to image blocks is recorded as->

Similarly, column vectors corresponding to image blocks at the same position in all high-resolution face images in the face image database form a high-resolution dictionary, and form S ₁ A high resolution dictionary for comparing the s-th of all high resolution face images in the face image database ₁ The high-resolution dictionary formed by the column vectors corresponding to the image blocks is marked as->

Wherein it is present>

Are all (k) ₁ ×k ₁ )×1，/>

Are all (k) ₁ ×k ₁ )×N，/>

Is/is for the nth column vector of>

Is/is->

Step four: calculating the projection matrix corresponding to each low-resolution dictionary and each high-resolution dictionary respectively

The corresponding projection matrix is marked as->

Will->

The corresponding projection matrix is recorded as +>

Wherein it is present>

And &>

All dimensions of (c) are L × (k) ₁ ×k ₁ ) L represents the dimension of CCA space, and L is equal to {1,2, …, k ₁ ×k ₁ }；

Step five: mapping each low-resolution dictionary from an image space to a CCA space to obtain a corresponding primary mapping low-resolution dictionary, and mapping each low-resolution dictionary to a CCA space

The corresponding one-time mapped low resolution dictionary is &>

Similarly, mapping each high-resolution dictionary from the image space to the CCA space to obtain a corresponding once-mapped high-resolution dictionary, and combining>

Corresponding one-time mapped high resolution dictionary +>

Wherein it is present>

And &>

The dimensions of (A) are all L multiplied by N;

step six: calculating the sparse coefficient vector of each primary mapping low-resolution dictionary

Is marked as->

By counting->

Obtaining; then carrying out once sparse updating on the once mapped low-resolution dictionary by using the sparse coefficient vector of each once mapped low-resolution dictionary to obtain the updated dictionary of each once mapped low-resolution dictionary, and then judging whether the updated dictionary is the same as the updated dictionary or not>

The updated dictionary is recorded as +>

If/or>

The nth element of (a) is a non-zero element, will then >>

The nth column vector of (a) is extracted and the slave is then taken>

All column vectors extracted in (a) are formed in original order->

Similarly, performing sparse update once on each once-mapped high-resolution dictionary to obtain a dictionary updated by each once-mapped high-resolution dictionary, and judging whether the updated dictionary is matched with the updated dictionary or not>

Updated dictionary record>

If/or>

Will be a non-zero element, will->

The nth column vector of (a) is extracted and the slave is then taken>

All column vectors extracted in (a) are formed in original order->

Wherein it is present>

Dimension of (a) is Nx 1, argmin () represents solving the residual minimum value, the symbol "| | | | non-conducting phosphor ₂ Is "is ₂ Norm regular term operation symbol, symbol "| | | | non-woven phosphor ₁ Is "as ₁ Norm regular term operator sign, λ ₁ Is a constant, λ ₁ ∈(0,1)，/>

And &>

Has dimension L × M, M denotes->

The total number of the non-zero elements in the alloy is more than or equal to 1 and less than N;

step seven: the updated dictionary of each once-mapped low-resolution dictionary is reversely mapped back to the image space from the CCA space to obtain a corresponding reflection low-resolution dictionary, and the dictionary is subjected to image matching

The corresponding retroreflection low resolution dictionary is &>

Similarly, each high resolution word is mapped onceThe dictionary updated by the dictionary is reversely mapped back to the image space from the CCA space to obtain a corresponding reflection high-resolution dictionary, and the dictionary is/are>

The corresponding reflection high resolution dictionary is recorded as>

Wherein it is present>

And

has a dimension of (k) ₁ ×k ₁ )×M；

Step eight: calculating the projection matrix corresponding to each reverse mapping low-resolution dictionary and each reverse mapping high-resolution dictionary respectively

The corresponding projection matrix is recorded as +>

Will->

The corresponding projection matrix is recorded as +>

Wherein it is present>

And

all dimensions of (a) are L × (k) ₁ ×k ₁ ) L represents the dimension of CCA space, and L is equal to {1,2, …, k ₁ ×k ₁ }；

Step nine: mapping each reverse mapping low-resolution dictionary from the image space to the CCA space to obtain pairsShould remap the low resolution dictionary again, will

Corresponding remap low resolution dictionary noted in>

/>

Similarly, mapping each reflection high-resolution dictionary from the image space to the CCA space to obtain a corresponding re-mapping high-resolution dictionary, and combining>

Corresponding remapped high resolution dictionary

Will->

Each image block in (1) is mapped to CCA space from image space to obtain

Will ∑ be based on a corresponding primary mapped block of each image block in the image block>

The corresponding one-time mapping block is marked as->

Will->

The pixel values of all pixel points in the primary mapping block corresponding to each image block are arranged to form a corresponding column vector, and the column vector is obtained

The corresponding column vector is noted as/>

Wherein it is present>

And &>

Are all LxM->

Has dimension of L × 1;

step ten: computing the vector sum of each column in each remapped low resolution dictionary

Is used for ^ ing the Euclidean distance of the column vector corresponding to the primary mapping block corresponding to each image block in the>

Calculate->

Each column vector of (1)

Is based on the Euclidean distance of->

Obtaining M Euclidean distances; then, aiming at each M Euclidean distances obtained by re-mapping the low-resolution dictionary, sequencing the M Euclidean distances from large to small; then according to the magnitude sequence of M Euclidean distances obtained by aiming at each remapped low-resolution dictionary, carrying out position adjustment on all column vectors in each remapped low-resolution dictionary, recombining to obtain a corresponding recombined low-resolution dictionary, and then combining>

The corresponding recombined low resolution dictionary is marked>

The 1 st column vector and->

Has the largest Euclidean distance and is greater than or equal to>

And the last column vector and->

Has the smallest euclidean distance; wherein it is present>

Dimension of (d) is L × M;

similarly, calculate the sum of each column vector in each remapped high resolution dictionary

Calculate->

Each column vector of (1)

The distance in degrees of euclidean of (c), for +>

Obtaining M Euclidean distances; then, aiming at each M Euclidean distances obtained by re-mapping the high-resolution dictionary, sequencing the M Euclidean distances from large to small; then according to the magnitude sequence of M Euclidean distances obtained by mapping the high-resolution dictionary again for each time,adjusting the position of all column vectors in each remapped high-resolution dictionary, recombining to obtain a corresponding recombined high-resolution dictionary, and combining>

The corresponding recombined high-resolution dictionary is marked>

The 1 st column vector and->

Has the largest Euclidean distance and is greater than or equal to>

And the last column vector and->

Has the smallest euclidean distance; wherein it is present>

Dimension of (d) is L × M;

step eleven: computing

Will @, for each image block of the first sparse coefficient vector>

Is marked as £ the first sparse coefficient vector of>

By passing

Calculating to obtain; will then->

The high-resolution face image obtained after reconstruction in one layer is recorded as ^ er>

Will->

In and->

The area corresponding to the position is marked as->

Will->

The column vector formed by arranging the pixel values of all the pixel points is recorded as

Wherein it is present>

Has dimension of M × 1,m which is a positive integer, M is more than or equal to 1 and less than or equal to M, and λ ₂ And λ ₃ Are all constant, λ ₂ ∈(0,1)，λ ₃ ∈(0,1)，/>

Represents->

The mth element of (4), is selected>

Represents->

The mth column vector of (4), based on the number of cells in the column->

Represents->

M-1 element of (1);

step twelve: the size of the sliding window is changed to k ₂ ×k ₂ (ii) a Then S is obtained in the same manner according to the process from step two to step ten ₂ A recombined low resolution dictionary and S ₂ Recombining the high resolution dictionary to give ₂ Reorganize the low resolution dictionary as

Will be(s) ₂ Recombined high-resolution dictionary is marked as>

Then calculates->

Will ∑ the second sparse coefficient vector of each image block of>

S of (1) ₂ Image block>

Is recorded as a second sparse coefficient vector

By passing

Calculating to obtain; then will

The high-resolution face image obtained after the two-layer reconstruction is recorded as ^ er>

Will->

In and>

the area corresponding to the position is marked as->

Will be/are>

The column vector formed by arranging the pixel values of all the pixel points is recorded as ^ er>

Wherein it is present>

And &>

Dimension of L × M, k ₂ =3,5,7,9 and 1 < k ₂ ＜k ₁ ，S ₂ Representing that each low-resolution face image in a face image database and the corresponding high-resolution face image are judged and judged by adopting a sliding window technology>

Divided into mutually overlapping dimensions of k ₂ ×k ₂ Total number of image blocks, S ₂ ＝(W-k ₂ +1)×(H-k ₂ +1)，1≤s ₂ ≤S ₂ ，/>

Has dimension of Mx 1->

To be according to capturing>

Is obtained in the same manner, the projection matrix is evaluated>

Represents->

The pixel values of all the pixel points in the image are arranged to form a corresponding column vector,

represents->

The mth element of (4), is selected>

Represents->

The m-1 th element of (4), is selected>

Represents->

M-th column vector of (1) ₄ Is a constant, λ ₄ ∈(0,1)，/>

To be according to capturing>

Is obtained in the same manner, the projection matrix is evaluated>

Represents->

In and->

The pixel values of all the pixel points in the corresponding area are arranged to form a corresponding column vector, and the corresponding column vector is greater than or equal to>

Represents->

The m-th column vector of (1).

Compared with the prior art, the invention has the advantages that:

1) According to the method, the low-resolution dictionary and the high-resolution dictionary are both mapped to the CCA space from the image space, so that the correlation between the low-resolution dictionary and the high-resolution dictionary is enhanced; meanwhile, redundant information and noise information exist in the low-resolution dictionary and the high-resolution dictionary, and the anti-noise performance and the reconstruction effect of the reconstruction method are influenced, so that the dictionary updated by mapping the low-resolution dictionary once and the dictionary updated by mapping the high-resolution dictionary once are back-mapped from the CCA space to the image space, and then the reflection low-resolution dictionary and the reflection high-resolution dictionary are mapped from the image space to the CCA space, namely, two CCA mappings are adopted, and the anti-noise performance and the reconstruction effect of the method are improved.

2) The method adopts a two-layer reconstruction mode, firstly divides the tested low-resolution face image into larger image blocks to perform one-layer reconstruction so as to grasp the structural information of the reconstructed high-resolution face image, then reconstructs the next reconstruction work with smaller blocks by taking the high-resolution face image reconstructed from the tested low-resolution face image as constraint to reconstruct the detail information of the high-resolution face image of the tested low-resolution face image, and the edge structural information of the high-resolution face image obtained by two-layer reconstruction is clear and has clear details and good reconstruction effect.

Drawings

FIG. 1 is a general flow diagram of the process of the present invention;

FIG. 2 is a noisy face image;

FIG. 3 is a high-resolution face image obtained by reconstructing the noise face image shown in FIG. 2 using a conventional smoothing-based super-resolution reconstruction method (SSR);

fig. 4 is a high-resolution face image obtained by reconstructing the noise face image shown in fig. 2 by adding CCA mapping once based on the existing smoothing-based super-resolution reconstruction method (SSR) and single-layer reconstruction, i.e., one-layer reconstruction;

fig. 5 is a high-resolution face image obtained by reconstructing the noise face image shown in fig. 2 by adding two CCA mappings based on the existing smoothing-based super-resolution reconstruction method (SSR) and performing single-layer reconstruction, i.e., one-layer reconstruction;

FIG. 6 is a high-resolution face image reconstructed from the noise face image shown in FIG. 2 by the method of the present invention;

fig. 7 is a real high resolution face image corresponding to the noise face image shown in fig. 2.

Detailed Description

The invention is described in further detail below with reference to the accompanying examples.

The invention provides a human face image multilayer reconstruction method based on a CCA space, the general flow block diagram of which is shown in figure 1, and the method comprises the following steps:

And &>

And testing the low resolution faceImage is recorded as->

Wherein N is a positive integer, N is more than or equal to 1 and less than or equal to N, N represents the total number of low-resolution face images contained in the face image database, N is more than or equal to 2, and if N =360 ″, then ^ N is selected>

And &>

Are all W in width>

And

all heights of (2) are H.

S of (1) ₁ Each image block is recorded as>

Similarly, a sliding window technology is adopted to divide the high-resolution face image corresponding to each low-resolution face image in the face image database into S ₁ Each overlapping with a dimension of k ₁ ×k ₁ The image block of (2) is selected, will be/are>

S of (1) ₁ Each image block is recorded as>

Will be picked up using a sliding window technique>

Is divided into ₁ Each overlapping with a dimension of k ₁ ×k ₁ Will->

S of (1) ₁ Each image block is recorded as>

Wherein the size of the sliding window is k ₁ ×k ₁ ，k ₁ =5,7,9,11, in this example k ₁ =5, sliding step length of sliding window is 1 pixel point, S ₁ ＝(W-k ₁ +1)×(H-k ₁ +1)，s ₁ Is a positive integer, s is not less than 1 ₁ ≤S ₁ 。

The corresponding column vector is marked +>

The corresponding column vector is marked +>

Will->

The corresponding column vector is marked +>

Similarly, column vectors corresponding to image blocks at the same position in all high-resolution face images in the face image database form a high-resolution dictionary to form S ₁ A high resolution dictionary for comparing the s-th of all high resolution face images in the face image database ₁ The high-resolution dictionary formed by the column vectors corresponding to the image blocks is marked as->

Wherein it is present>

Are all (k) ₁ ×k ₁ )×1，/>

Are all (k) ₁ ×k ₁ )×N，/>

Is/is for the nth column vector of>

The nth column vector ofIs->

Step four: calculating the projection matrix corresponding to each low-resolution dictionary and each high-resolution dictionary respectively, and calculating the projection matrix

The corresponding projection matrix is recorded as +>

Will->

The corresponding projection matrix is recorded as +>

Wherein it is present>

And &>

All dimensions of (a) are L × (k) ₁ ×k ₁ ) L represents the dimension of CCA space, and L is equal to {1,2, …, k ₁ ×k ₁ }；/>

And &>

Reference may be made to David R.Hardoon et al.Canonical Correlation Analysis: an Overview with Application to Learning Methods [ J]Neural Computation,2004,2639-2664 (David R-Hayton et al, canonical correlation analysis: overview applied to learning methods [ J]Neural computation,2004, 2639-2664).

Step five: mapping each low-resolution dictionary from image space to CCA space to obtain corresponding primary mapping low-resolution dictionary, and mapping each low-resolution dictionary to CCA space

The corresponding one-time mapped low resolution dictionary is &>

Similarly, mapping each high-resolution dictionary from the image space to the CCA space to obtain a corresponding primary mapping high-resolution dictionary, and mapping each high-resolution dictionary to the CCA space

Corresponding one-time mapped high resolution dictionary +>

Wherein, is +>

And

are all L N.

Is marked as->

By counting->

Updated dictionary record>

If/or>

Will be a non-zero element, will->

The nth column vector of (a) is extracted and the slave is then taken>

All column vectors extracted in (are) formed in the original order->

The updated dictionary is recorded as +>

If/or>

Will be a non-zero element, will->

The nth column vector of (a) is extracted and the slave is then taken>

All column vectors extracted in (are) formed in the original order->

Wherein the content of the first and second substances,

dimension of (a) is Nx 1, argmin () represents solving the residual minimum value, the symbol "| | | | non-conducting phosphor ₂ Is "is ₂ Norm regular term operation symbol, symbol "| | | | non-woven phosphor ₁ Is "as ₁ Norm regular term operator sign, λ ₁ Is a constant, λ ₁ E (0,1), generally given as λ ₁ =0.1,0.3,0.5, in this example λ ₁ ＝0.3，/>

And &>

Has dimension L × M, M denotes->

The total number of the non-zero elements in the alloy is more than or equal to 1, and M is less than N.

The corresponding retroreflection low resolution dictionary is &>

Similarly, each updated dictionary with the once mapping high-resolution dictionary is inversely mapped from the CCA space to the image space to obtain a corresponding reflection high-resolution dictionary, and the dictionary is combined>

The corresponding reflection high resolution dictionary is recorded as>

Wherein it is present>

And

has a dimension of (k) ₁ ×k ₁ )×M。

Step eight: calculating the projection matrix corresponding to each back mapping low-resolution dictionary and each back mapping high-resolution dictionary respectively, and calculating the projection matrix corresponding to each back mapping low-resolution dictionary and each back mapping high-resolution dictionary

The corresponding projection matrix is recorded as +>

Will->

The corresponding projection matrix is recorded as +>

Wherein it is present>

And &>

And

Step nine: each is reflectedMapping the low-resolution dictionary from the image space to the CCA space to obtain a corresponding remapped low-resolution dictionary, and mapping the remapped low-resolution dictionary to the CCA space

Corresponding remap low resolution dictionary is recorded as>

Mapping each reflection high-resolution dictionary from the image space to the CCA space to obtain a corresponding re-mapping high-resolution dictionary, and combining>

Corresponding remapped high resolution dictionary

Will->

Each image block in (1) is mapped to CCA space from image space to obtain

The corresponding one-time mapping block is marked as->

Will->

The corresponding column vector is marked +>

Wherein it is present>

And &>

Are all LxM->

Has dimension of L × 1.

Step ten: computing the sum of vectors per column in each remapped low resolution dictionary

Calculate->

Each column vector of (1)

Is based on the Euclidean distance of->

Obtaining M Euclidean distances; then, aiming at each M Euclidean distances obtained by re-mapping the low-resolution dictionary, sequencing the M Euclidean distances from large to small; then according to the magnitude sequence of M Euclidean distances obtained by mapping the low-resolution dictionaries again, carrying out position adjustment on all column vectors in the low-resolution dictionaries mapped again, recombining to obtain corresponding recombined low-resolution dictionaries, and combining>

The corresponding recombined low resolution dictionary is marked>

The 1 st column vector and->

Has the largest Euclidean distance and is greater than or equal to>

And the last column vector and->

Has the smallest euclidean distance of (c); wherein it is present>

Dimension (d) is L M.

The euclidean distance of the column vector corresponding to the primary mapping block corresponding to each image block in (b), for +>

Calculate->

Each column vector of (1)

Is based on the Euclidean distance of->

Obtaining M Euclidean distances; then, sequencing the M Euclidean distances from large to small according to the M Euclidean distances obtained by re-mapping the high-resolution dictionary; according to each secondMapping the high-resolution dictionary to obtain the magnitude sequence of M Euclidean distances, performing position adjustment on all column vectors in each re-mapped high-resolution dictionary, recombining to obtain a corresponding recombined high-resolution dictionary, and then combining>

The corresponding recombined high-resolution dictionary is marked>

The 1 st column vector and->

Has the largest Euclidean distance and is greater than or equal to>

And/or the last column vector in (b)>

Has the smallest euclidean distance; wherein it is present>

Dimension (d) is L M.

Step eleven: computing

Will @, for each image block of the first sparse coefficient vector>

Is marked as £ the first sparse coefficient vector of>

By passing

Calculating to obtain; will then->

Will->

In and->

The area corresponding to the position is marked as->

Will->

Wherein it is present>

The dimension of M is multiplied by 1,m is a positive integer, M is more than or equal to 1 and less than or equal to M, and lambda is ₂ And λ ₃ Are all constant, λ ₂ E (0,1), generally given as λ ₂ =0.1,0.3,0.5, in this example λ ₂ ＝0.3，λ ₃ E (0,1), in this example λ ₃ ＝0.001，/>

Represents->

The mth element of (1), in>

Represents->

The mth column vector of (4), based on the number of cells in the column->

Represents->

The m-1 th element in (1).

Step twelve: the size of the sliding window is changed to k ₂ ×k ₂ (ii) a Then S is obtained in the same manner according to the process from step two to step ten ₂ Reorganized low resolution dictionary and S ₂ Reorganize the high resolution dictionary into s ₂ The recombined low resolution dictionary is noted

Will be(s) ₂ Recombined high-resolution dictionary is marked as>

Then count>

Will ∑ the second sparse coefficient vector of each image block of>

S of (1) ₂ Image block>

Is recorded as a second sparse coefficient vector

By passing

Calculating to obtain; then will be

Will->

In and->

The area corresponding to the position is marked as->

Will->

Wherein it is present>

And &>

Dimension of L × M, k ₂ =3,5,7,9 and 1 < k ₂ ＜k ₁ If k is ₁ K is taken out if =5 ₂ =3, if k ₁ K is taken out if =7 ₂ =3 or k ₂ If k is =5 ₁ K is taken out of the equation of k =9 ₂ =3 or k ₂ =5 or k ₂ =7, if k ₁ K is taken out of the equation of k =11 ₂ =3 or k ₂ =5 or k ₂ =7 or k ₂ ＝9，S ₂ The method comprises the steps of representing, by adopting a sliding window technology, each low-resolution face image in a face image database and a corresponding high-resolution face image,/>

Divided into mutually overlapping dimensions of size k ₂ ×k ₂ Total number of image blocks, S ₂ ＝(W-k ₂ +1)×(H-k ₂ +1)，1≤s ₂ ≤S ₂ ，/>

Has dimension of Mx 1->

To be according to capturing>

Is obtained in the same manner, the projection matrix is evaluated>

Represents->

The pixel values of all the pixel points in the column are arranged to form a corresponding column vector, and then the column vector is compared with the pixel value of the corresponding pixel point>

Represents->

The mth element of (4), is selected>

Represents->

The m-1 th element of (4), is selected>

Represents->

M-th column vector of (1) ₄ Is a constant, λ ₄ E (0,1), generally given as λ ₄ =0.1,0.3,0.5, in this example λ ₄ ＝0.3，/>

In accordance with the obtaining>

In the same manner, a projection matrix obtained in the same manner>

Represents->

In and->

Represents->

The m-th column vector of (1).

To further illustrate the feasibility and effectiveness of the method of the present invention, experiments were conducted on the method of the present invention.

Here, the method of the present invention was tested using FEI face data sets. The FEI face data set contains two high-resolution face images of 200 different persons (100 men and 100 women), one of which is a high-resolution face image with a normal expression, and the other is a high-resolution face image with a smile expression. Each high-resolution face image in the FEI face data set is subjected to down-sampling to obtain a corresponding low-resolution face image, the size of the down-sampled low-resolution face image is 30 × 25, and gaussian noises with standard deviations σ =10 and σ =30 are respectively added to all the low-resolution face images. In the experiment, 360 high-resolution face images of 180 persons in total and low-resolution face images corresponding to each high-resolution face image are randomly selected to form a training set, and 40 low-resolution face images of the rest 20 persons in total are used as tested low-resolution face images.

In order to verify the effectiveness of the method, the method is compared with other existing excellent Face Super-Resolution methods, such as Jiang J, ma J, chen C, et al. Noise Robust Face Image Super-Resolution Through Smooth Sparse reconstruction [ J ] IEEE Transactions on Cybernetics,2017, PP (99): 1-12. The method based on Smooth Super-Resolution reconstruction (SSR) proposed in the paper is compared with a method based on Smooth Super-Resolution reconstruction (SSR) in which CCA mapping is added once and reconstruction is performed as a layer (only one division of the size of an Image block is considered in the reconstruction process), and a method based on Smooth Super-Resolution reconstruction (SSR) in which CCA mapping is added twice and reconstruction is performed as a layer (only one division of the size of the Image block is considered in the reconstruction process) is performed as a layer. The method is to add two CCA mapping and two-layer (reconstruction, two image block sizes are considered in the reconstruction process, the image blocks with large sizes are firstly reconstructed, then the reconstruction result of the image blocks with large sizes is used for constraint, and the image blocks with small sizes are secondarily reconstructed) reconstruction on the basis of a smooth super-resolution reconstruction method (SSR). The sizes of the image blocks of the first sub-blocks of the low-resolution face images and the high-resolution face images in the training set and the tested low-resolution face images are 5 multiplied by 5, and the sizes of the image blocks of the second sub-blocks are 3 multiplied by 3.

The method comprises the steps of respectively adopting a super-resolution reconstruction method (SSR) based on smoothness, adding a method (simply referred to as CCA single layer) of CCA mapping and single-layer reconstruction forming on the basis of the super-resolution reconstruction method (SSR) based on smoothness, adding a method (simply referred to as 2CCA single layers) of CCA mapping and single-layer reconstruction forming on the basis of the super-resolution reconstruction method (SSR) based on smoothness, reconstructing the tested low-resolution face image, and giving an average PSNR and an SSIM of the high-resolution face image obtained after 40 tested low-resolution face images are reconstructed by adopting the methods under different noise environments (sigma =10 and sigma = 30) in table 1. As can be seen from the data listed in table 1, under the condition of severe noise, the method of the present invention has 0.75 improvement on PSNR compared to the SSR method, and has 0.0675 improvement on SSIM compared to the SSR method; meanwhile, the method is superior to a single-layer reconstruction method, namely a CCA single-layer method and a 2CCA single-layer method, in both PSNR indexes and SSIM indexes.

Table 1 shows the average PSNR and SSIM of the 40 tested low-resolution face images obtained by reconstructing the high-resolution face images by the above methods under different noise environments (σ =10 and σ = 30), respectively

FIG. 2 shows a noisy face image; FIG. 3 shows a high resolution face image reconstructed from the noisy face image shown in FIG. 2 using a conventional smoothing-based super-resolution reconstruction method (SSR); fig. 4 shows a high-resolution face image obtained by reconstructing the noise face image shown in fig. 2 by adding CCA mapping once and performing single-layer reconstruction, i.e., one-layer reconstruction, on the basis of the existing smoothing-based super-resolution reconstruction method (SSR); fig. 5 shows a high-resolution face image obtained by reconstructing the noise face image shown in fig. 2 by adding two CCA mappings based on the existing smoothing-based super-resolution reconstruction method (SSR) and performing single-layer reconstruction, i.e., one-layer reconstruction; FIG. 6 shows a high resolution face image reconstructed from the noise face image shown in FIG. 2 by the method of the present invention; fig. 7 shows a real high resolution face image corresponding to the noise face image shown in fig. 2. Comparing fig. 3, fig. 4, fig. 5, fig. 6 with fig. 7, it is obvious that the edge structure information of the high resolution face image shown in fig. 6 is clear, the details are clear, the reconstruction effect is good, and the edge structure information is closer to the real high resolution face image shown in fig. 7.

Claims

1. A human face image multilayer reconstruction method based on CCA space is characterized by comprising the following steps:

the method comprises the following steps: selecting a face image database which contains at least two low-resolution face images and a high-resolution face image corresponding to each low-resolution face image, and recording the nth low-resolution face image and the corresponding high-resolution face image in the face image database as corresponding

And &>

And recording the tested low-resolution face image as->

And &>

Are all W in width>

And &>

All the heights of (A) are H;

S of (1) ₁ Each image block is recorded as>

Similarly, a sliding window technology is adopted to divide the high-resolution face image corresponding to each low-resolution face image in the face image database into S ₁ Each overlapping with a dimension of k ₁ ×k ₁ Will->

S of (1) ₁ Each image block is recorded as>

Will be picked up using a sliding window technique>

Is divided into ₁ Each overlapping with a dimension of k ₁ ×k ₁ Will->

S of (1) ₁ Each image block is recorded as>

The corresponding column vector is marked +>

Similarly, arranging the pixel values of all pixel points in each image block in the high-resolution face image corresponding to each low-resolution face image in the face image database to form a corresponding column vector, and then judging whether the pixel values of all pixel points in each image block in the high-resolution face image correspond to the low-resolution face images in the face image database or not>

The corresponding column vector is marked as +>

Will be/are>

The corresponding column vector is marked as +>

Then, column vectors corresponding to image blocks at the same position in all low-resolution face images in the face image database form a low-resolution dictionary to form S ₁ A low resolution dictionary for mapping the s-th image to all low resolution face images in the face image database ₁ A low-resolution dictionary formed by column vectors corresponding to image blocks is recorded as->

Wherein it is present>

Are all (k) ₁ ×k ₁ )×1，/>

Are all (k) ₁ ×k ₁ )×N，/>

Is/is->

Is/is->

The corresponding projection matrix is recorded as +>

Will->

The corresponding projection matrix is recorded as +>

Wherein it is present>

And &>

The corresponding one-time mapped low resolution dictionary is &>

Corresponding one-time mapped high resolution dictionary +>

Wherein +>

And &>

The dimensions of (A) are all L multiplied by N;

step six: calculate each primary mapSparse coefficient vectors of the low resolution dictionary

Is marked as->

By counting->

The updated dictionary is recorded as

If/or>

Will be a non-zero element, will->

The nth column vector of (a) is extracted and the slave is then taken>

All column vectors extracted in (are) formed in the original order->

Similarly, sparsely updating each once-mapped high-resolution dictionary to obtain a dictionary updated by each once-mapped high-resolution dictionary, and combining>

The updated dictionary is recorded as +>

If/or>

Will be a non-zero element, will->

The nth column vector of (a) is extracted and the slave is then taken>

All column vectors extracted in (a) are formed in original order->

Wherein it is present>

Dimension of (a) is Nx 1, argmin () represents solving the residual minimum value, the symbol "| | | | non-conducting phosphor ₂ Is "is ₂ Norm regular term operation symbol, symbol "| | | | non-woven phosphor ₁ Is "is ₁ Norm regular term operator sign, λ ₁ Is a constant, λ ₁ ∈(0,1)，/>

And &>

Has dimension L × M, M denotes->

The total number of the non-zero elements in the alloy is more than or equal to 1, and M is less than N;

step seven: the updated dictionary of each once-mapped low-resolution dictionary is reversely mapped from the CCA space back to the image spaceTo obtain a corresponding reflection low resolution dictionary

The corresponding retroreflection low resolution dictionary is &>

Similarly, the updated dictionary of each once-mapping high-resolution dictionary is back-mapped to the image space from the CCA space to obtain a corresponding reflection high-resolution dictionary, and the dictionary is/are>

The corresponding reflection high resolution dictionary is recorded as>

Wherein it is present>

And &>

Has a dimension of (k) ₁ ×k ₁ )×M；

The corresponding projection matrix is recorded as +>

Will->

The corresponding projection matrix is recorded as +>

Wherein it is present>

And &>

Step nine: mapping each reverse mapping low-resolution dictionary from the image space to the CCA space to obtain a corresponding re-mapping low-resolution dictionary, and mapping each reverse mapping low-resolution dictionary from the image space to the CCA space

Corresponding remap low resolution dictionary is recorded as>

Corresponding remapped high resolution dictionary @>

Will->

Is mapped from image space to CCA space, resulting in ∑>

A primary mapping block corresponding to each image block in the image data will be

The corresponding one-time mapping block is marked as->

Will->

The pixel values of all pixel points in the primary mapping block corresponding to each image block are arranged to form a corresponding column vector, and the pixel values are compared with the pixel values in the primary mapping block corresponding to each image block to determine whether the pixel values are greater than or equal to the pixel values in the primary mapping block>

The corresponding column vector is marked +>

Wherein it is present>

And &>

Are all LxM->

Dimension of (a) is L × 1;

The euclidean distance of the column vector corresponding to the primary mapping block corresponding to each image block in (b), for->

Calculate->

And/or for each column vector in>

Is based on the Euclidean distance of->

The corresponding recombined low resolution dictionary is marked>

The 1 st column vector and +>

Has a maximum Euclidean distance, and>

and/or the last column vector in (b)>

Has the smallest euclidean distance; wherein +>

Dimension of (d) is L × M;

Calculate->

And/or for each column vector in>

Is based on the Euclidean distance of->

Obtaining M Euclidean distances; then, aiming at each M Euclidean distances obtained by re-mapping the high-resolution dictionary, sequencing the M Euclidean distances from large to small; then according to the sequence of the M Euclidean distances obtained by aiming at each high-resolution dictionary re-mapped, carrying out position adjustment on all column vectors in each high-resolution dictionary re-mapped, recombining to obtain a corresponding recombined high-resolution dictionary, and then combining>

Corresponding reorganized high resolution dictionary

The 1 st column vector and +>

Has the largest Euclidean distance and is greater than or equal to>

And the last column vector and->

Has the smallest euclidean distance; wherein it is present>

Dimension of (d) is L × M;

step eleven: computing

Will @, for each image block of the first sparse coefficient vector>

Is marked as £ the first sparse coefficient vector of>

By passing

Calculating to obtain; will then->

Will->

In and->

The area corresponding to the position is marked as->

Will->

Wherein it is present>

The dimension of M is multiplied by 1,m is a positive integer, M is more than or equal to 1 and less than or equal to M, and lambda is ₂ And λ ₃ Are all constant, λ ₂ ∈(0,1)，λ ₃ ∈(0,1)，/>

Represents->

The mth element of (4), is selected>

Represents->

The mth column vector of (4), based on the number of cells in the column->

Represents->

M-1 element of (1); />

Step twelve: the size of the sliding window is changed to k ₂ ×k ₂ (ii) a Then S is obtained in the same manner according to the process from step two to step ten ₂ Has low recombinationResolution dictionary and S ₂ Reorganize the high resolution dictionary into s ₂ The recombined low resolution dictionary is noted

Will be(s) ₂ Recombined high-resolution dictionary is marked as>

Then calculates->

Will ∑ the second sparse coefficient vector of each image block of>

S of (1) ₂ Image block->

Is recorded as a second sparse coefficient vector

By passing

Calculating to obtain; then will be

Will->

In and->

Zone corresponding to positionField is recorded as->

Will->

Wherein +>

And &>

Dimension of L × M, k ₂ =3,5,7,9 and 1 < k ₂ ＜k ₁ ，S ₂ Representing that each low-resolution face image in a face image database and the corresponding high-resolution face image are combined by adopting a sliding window technology>