PLAUSIBLE AGEING OF THE HtJMAN FACE
This invention relates to the generation of facial images, using digital representations of facial features. In particular, the invention relates to the generation of a facial image which is based 5 on a known facial image and to which an ageing algorithm has been applied.
There has been much work into the digital representation of facial images. One known method involves use of a statistical model of facial appearance. This model captures both shape and texture properties of a human face, and an individual face is represented as a highly 10 compact vector of parameters. A close resemblance to a real face can be provided with as few as 25 parameters.
The known statistical model is described further in the article by T.F.Cootes, GJ. Edwards and C.J.Taylor entitled "Active Appearance Models", EEEE PAMI, Vol.23, No.6, pp.681-685, 15 2001.
The representation of the human face as a vector of parameters opens up many possibilities. One proposed use of the statistical model is to generate a resemblance of a facial image from memory, for example for use by the police. In this proposed use of the statistical model, an 0 evolutionary algorithm is used to iteratively modify an initial facial image (which may be randomly generated) to reach a target facial image.
This invention is directed to the processing of a digital image of a human face so as to change the apparent age of the face. There are a number of possible applications for such a process. 5 For missing persons applications, a plausible and near photo-realistic facial appearance can be generated of individuals for whom no current photographic images exist. Such a process can also be used to produce a plausible and near photo-realistic facial appearance of individuals as they were at an earlier age. Such a method could be used, for example, in the attempt to convict or exonerate individuals accused of war crimes in an earlier epoch.
30
The effect of ageing on a human face is not, however, easily predicted or easily modelled, and a reliable method of processing facial images to represent ageing has not been achieved, hi particular, the ageing effect as a child matures to adulthood is particularly difficult to model.
According to the invention, there is provided a facial image generation system, comprising: means for receiving a first vector representation of a first facial image of a first individual having an age in a first age range; means for receiving data derived from second vector representations of a plurality of facial images of a first group of individuals having ages in the first age range; means for receiving data derived from third vector representations of a plurality of facial images of a second group of individuals having ages in a second age range which does not overlap with the first age range; means for receiving data derived from fourth vector representations of a plurality of facial images of a third group of individuals which are related to the individuals of the second group and of a different generation to the individuals of the second group; and a processor for processing the data derived from the first, second, third and fourth vector representations to generate an output vector representation which comprises a vector representation of an estimate of the facial image of the first individual in the second age range.
This apparatus uses samples of facial images of the starting age (the first age range) and the target age (the second age range) in order to model the ageing process on the vector representations of a given facial image, hi addition, samples of facial images are used to derive a relationship between the facial images of different generations of the same family. This relationship can then be used to ensure that an ageing algorithm (to an older, or to a younger age) can be applied which is consistent both with a general change in appearance between the starting age and the target age, but also to provide consistency of the estimated facial image with other family members of the first individual.
The second, third and fourth vector representations are preferably of individuals having the same gender and of the same racial origin as the first individual. The second and third vector representations are preferably of individuals not related to the first individual, and these may
therefore comprise general samples which can be used for processing any starting image of the corresponding age range. The fourth group of individuals preferably include a parent or child of the first individual. The image for this parent or child can then be used to ensure that the ageing algorithm also takes into account the way ageing effects are specific to individual families.
The second age range may be higher than the first age range, so that an image of a child can for example be used to generate an image of an adult. For example, the first age range can be a pre-pubescent age range, and the second age range can be a pubescent or adult age range.
The processor is adapted to apply a transformation to the first vector representation using a first scaling vector representing the difference between an average of the second vector representations and an average of the third vector representations. This difference can be considered to represent a general ageing effect. The processor is preferably further adapted to apply a transformation to the first vector representation using a second scaling vector representing the difference between the vector representation of the parent or child of the first individual and the first vector representation. This second scaling vector represents a difference between different generations of the family of the first individual. The two scaling vectors thus enable family resemblances to be maintained whilst also applying general ageing effects.
The transformation may comprise a linear transformation using weighted values of the first and second scaling vectors.
For example, the weighting values can be selected to maximise the product of (i) the probability of the output vector representation belonging to the second group of individuals with (ii) the probability of the output vector representation being a parent or child of the known parent or child of the first individual. This aims to ensure that the generated facial image is probabilistically a good fit into the second group of individuals of the target age, and also probabilistically remains a good fit into the family of the first individual.
The invention also provides a method of generating a facial image, comprising: receiving a first vector representation of a first facial image of a first individual having an age in a first age range; receiving data derived from second vector representations of a plurality of facial images of a first group of individuals having ages in the first age range; receiving data derived from third vector representations of a plurality of facial images of a second group of individuals having ages in a second age range which does not overlap with the first age range; receiving data derived from fourth vector representations of a plurality of facial images of a third group of individuals which are related to the individuals of the second group and of a different generation to the individuals of the second group; and processing the data derived from the, first, second, third and fourth vector representations to generate an output vector representation which comprises a vector representation of an estimate of the facial image of the first individual in the second age range.
An example of the invention will now be described in detail with reference to the accompanying drawings, in which:
Figure 1 shows the data used in the method and apparatus of the invention; Figure 2 shows the information derived from the input data in the method and apparatus of the invention; and
Figure 3 shows an apparatus of the invention.
The invention provides a method of producing a statistically optimal estimate of the facial appearance of a first individual at a different age.
An original face is thus to be age-transformed from its current age of ynow to a target age of yτ . To produce such an optimal estimate, the invention utilises the following data in digital form:
-The original face. This group (of one) will be referred to by the symbol O.
-A selection of faces of approximately the same age, gender and racial origin as the original face. This group is referred to by the symbol S.
-A selection of faces corresponding to the parent(s) or sibling(s) of the group S. These faces therefore represent individuals of a different generation to the faces in group S. This group is referred to by the symbol P.
-A selection of faces of the same age, gender and racial origin as the original face having the (approximate) target age yτ . This group is referred to by the symbol T.
This data is schematically summarised in Figure 1.
The first step in the procedure of the invention is to build a statistical appearance model using all the faces in the groups O5P5T and S. The precise method for calculating such an appearance model is described in detail in the article by T.F.Cootes, GJ. Edwards and C. J.Taylor entitled "Active Appearance Models", referenced above. The central result of constructing such an appearance model as is that each example face in any of the the groups O5P5T and S can be parametrically encoded as a compact vector of numerical parameters which retains all the important shape and textural information in the facial appearance. There will typically be 25-60 such parameters.
Making the general assumption that N such parameters are sufficient to encode the facial appearance of the faces to the required accuracy, an appearance vector can be denoted by x - [X1, x2,- - -xN\ .
Each appearance vector, consisting of N numerical parameters, may be considered to occupy a certain location in an abstract parameter space of N dimensions, the magnitude of the kth component xk thereby corresponding to the extension along the kth axis of this abstract space.
Altering any of the components in an appearance vector thus moves to a different position in this abstract space and alters the facial appearance of the individual. The parameter space
effectively defines a very large number of different facial appearances each of which corresponds to a specific point location encoded by the vector x = [^1 , x2 , • • • xN ] .
This invention provides a means to move from one location in the parameter space to another location effectively aging the facial appearance in a way which is consistent with information about parental similarity in facial appearance and typical aging effects for the subject's age, gender and racial origin.
For this explanation: The appearance vector of the original face (i.e. the single subject in group O) is denoted by c = [C15C2,- -c^] .
The appearance vector of the transformed face is denoted by x = [xvx2,- - -xN] . This is the output of the system of the invention.
The appearance vector of a parent (or child if the arm is to reduce the age) of the subj ect is denoted by xP = ϊxp ι , xp 2 ,■■■ xp N 1.
μτ = — ∑(xr ) represents the average ox prototype appearance vector of the target
age group T where (xτ) is the appearance vector of the kth member of group T.
represents the average or prototype appearance vector of the current
age group S where (x
s J is the appearance vector of the kth member of group S.
Using these definitions it is possible to conceptually represent the transformation process by a vector diagram as shown in Figure 2.
The aim is to transform c to a new vector x in a way which gives optimal results. The preferred implementation of the invention provides a linear transformation of the form:
X = C + W,S + W2V (1)
where the free parameters to be determined are W1 and W2 . This transformation models the aging process as a weighted combination of two basic effects:
i) The tendency for an individual to resemble a parent or sibling as defined by the vector s . ii) The tendency for an individual to follow a prototypical dominant aging process defined by representative samples of people of the same gender and ethnic origin at both the current and target ages. This is defined by the vector v .
As shown in Figure 2, the vector v is a normalised vector representing the difference between the average vectors of groups S and T. The vector s is the difference between the starting vector c and the known parent (or sibling) vector.
To determine the appropriate weighting parameters W1 SLHdW2 , the following criterion for optimality are selected:
The transformed vector x (which represents the facial image after the change in age) should, as far as possible, be both a typical member of the target age group T and a typical member of the parent (sibling) - child distribution as determined by corresponding pairs in the groups P and T.
Specifically, the density functions for both groups are estimated and an estimator for x is obtained which is maximum likelihood for simultaneously originating from both distributions.
It is a direct result of the calculation of a statistical appearance model as proposed (i.e. on groups O5P5S and T together), that the distribution of the appearance model parameters over the whole space is independent, multivariate normal. Indeed, the independent distribution of
the model parameters is precisely the aim of the calculation. Thus the model parameters (here denoted generally by y = [jFpJV • '>v] ) over the whole space is of the form:
where C is a diagonal covariance matrix.
The respective distributions of the parameters over the two sub-spaces described above (i.e. target age group T and parent to child at target age group) both take a similar multivariate normal form but are not independent. Thus, the distribution over the target age group is:
PTCX) = Nn 1/2 exp --(X - ^)Γ C;' (X - ^) (3)
where μτ and C7. are the mean vector and covariance matrix of the target group T respectively. The covariance matrix C7. in this case will not be of diagonal form.
This equation essentially provides the probability of a given vector x being a member of the group T based on the mean and covariance values.
Similarly, the distribution for the difference in appearance vector between members of the target age group and their parents Δ = x - xp is given by:
/>Δ (Δ) =
2π N/2 17reχpj ~(Δ-//Δ)rc;1 (Δ-/.Δ) (4)
where μ& andCΔ are the mean vector and covariance matrix respectively.
This equation essentially provides the probability of a given difference vector Δ being a suitable difference between a member of age group T and their parent (or child).
It is important to recognise that the free parameters of these two density functions μτ,Cτ , μA andCΔ are not known a priori and must be estimated from the example faces available in a training sample. These values can all be fixed, however, so that they are stored for use in many different ageing operations, and provided samples are stored for the appropriate age ranges, all that is needed to implement the ageing algorithm is the starting image and an image of the parent or child.
The maximum likelihood estimator xfor the facial appearance at the target age (where the subject has a parent with appearance vector Xp) is that vector x which maximises the product pT(x)pA(A) of the two density functions defined by equations 3 and 4. The exponential nature of these distributions allows this to be equivalently expressed as a minirnisation problem, namely, to find the appearance vector x which minimises the cost function:
Substituting equations 1,3 and 4 into equation 5, a function of the two free variables W1 and W2 is obtained:
Q(wι,w2) = kϊ(i-μτJ Cϊ(l-μτ) + (A-μA)T ' C£(Δ-/ZΔ) (6)
where k is a multiplicative constant and Δ = x - X^
To find the optimal solution for x , standard methods of calculus can be used. In particular, partial derivatives of Q with respect to the variables W1 and W2 are obtained, and these are set to zero to obtain a pair of simultaneous equations in W1 and W2 which can be solved trivially.
These equations are:
where the coefficients a, b, c, γx and ^2 are known quantities given by -
a = s 2, s , O = S ∑, v, c =v ∑ v with Z = C;1 +Q1
In this way, the optimal estimates of the weighting parameters W1 and w2 are obtained which can be substituted into equation 1 to produce the transformed (aged) facial appearance.
Figure 3 shows the apparatus 10 of the invention. The method of the invention is essentially implemented as software running on a suitable processor. As shown, the apparatus of the invention needs to receive as inputs the mean vectors for the groups S and T and the difference vectors Δ, the covariance matrix for the groups T and the difference vectors Δ, the starting vector c, and the corresponding parent vector xP.. Of these, all of the data may be pre- available and stored within the system other than the starting vector and the parent vector.
The apparatus may have a display for displaying the obtained vector x.
The generation of the appearance vectors corresponding to known images (such as c) can be achieved in known manner so that image vectors can be created by the system starting from known physical images of the individuals which are entered into the system. The system of the invention may additionally include such capability as to operate an iterative process to arrive at a good likeness on a display to a known image and to generate the appearance vector for this and systems are known for generating the vectors in this way.
Various modifications will be apparent to those skilled in the art.