CN108022308A

CN108022308A - A kind of facial alignment schemes based on three-dimensional face model fitting

Info

Publication number: CN108022308A
Application number: CN201711238476.9A
Authority: CN
Inventors: 夏春秋
Original assignee: Shenzhen Vision Technology Co Ltd
Current assignee: Shenzhen Vision Technology Co Ltd
Priority date: 2017-11-30
Filing date: 2017-11-30
Publication date: 2018-05-11

Abstract

A kind of facial alignment schemes based on three-dimensional face model fitting proposed in the present invention, its main contents include：The expression of 3D faces, the structure of convolutional neural networks, mark fitting constraint, contour fitting constraint and Scale invariant features transform match constraint, its process is, one convolutional neural networks of training are used for the face-image that intensive 3D face shapes are fitted to single input, convolutional neural networks learn nonlinear mapping function, corresponding projective parameter and form parameter from input picture, the parameter of estimation can be used to construct intensive 3D face shapes, then be represented using intensive 3D shape to apply multiple constraints.The three-dimensional face model fitting algorithm that the present invention uses, using multiple constraints and utilize multiple data sets, can not only be alignd the facial markers of finite number, and facial contour and Scale invariant features transform characteristic point can be met, improve facial alignment precision, calculating cost is reduced, substantially increases the efficiency of face alignment.

Description

Face alignment method based on three-dimensional face model fitting

Technical Field

The invention relates to the field of face alignment, in particular to a face alignment method based on three-dimensional face model fitting.

Background

With the development of computer technology, some biological characteristics of human beings are applied to identification. Compared with other biological identification technologies, the face identification technology has the advantages of convenience in feature acquisition, short identification period and the like. The face recognition system can be divided into the following steps: face detection, face alignment, extraction, and face part class recognition. The alignment of the faces of different images in the face sequence image affects the accuracy of face recognition, and becomes an important problem of a face recognition system. The face alignment can be used for face organ positioning and organ tracking, and corresponding part features are extracted by accurately positioning each part of the face; after the faces are aligned, the expression state of the faces can be analyzed by using the aligned face shapes, so that the method is applied to the fields of analysis of interest points of children, survey of user satisfaction, expression lie detection and the like; the face alignment can also be used for generating a human face cartoon and a sketch, and a picture required by a user is generated by combining a mobile phone picture editor; in addition, face alignment can be used for 3D cartoon simulation, gender identification, age inference from aging or rejuvenation of faces, virtual reality and augmented reality, etc. However, many face alignment methods cannot utilize multiple data sets because each data set has a different label, and face alignment is computationally expensive and difficult to apply efficiently.

The invention provides a face alignment method based on three-dimensional face model fitting.A convolutional neural network is trained to be used for fitting a dense 3D face shape into a single input face image, the convolutional neural network learns a nonlinear mapping function, corresponding projection parameters and shape parameters from the input image, the estimated parameters can be used for constructing the dense 3D face shape, and then a plurality of constraints are applied by utilizing dense three-dimensional shape representation. The three-dimensional face model fitting algorithm used by the invention adopts a plurality of constraints and utilizes a plurality of data sets, not only can align a limited number of face marks, but also can accord with face contour and scale invariant feature transformation feature points, thereby improving the face alignment precision, reducing the calculation cost and greatly improving the face alignment efficiency.

Disclosure of Invention

In view of the problem that multiple data sets cannot be utilized and the computation cost is high, the invention aims to provide a face alignment method based on three-dimensional face model fitting, a convolution neural network is trained to fit a dense 3D face shape into a single input face image, the convolution neural network learns a nonlinear mapping function, corresponding projection parameters and shape parameters from the input image, the estimated parameters can be used to construct the dense 3D face shape, and then multiple constraints are applied by utilizing the dense three-dimensional shape representation.

In order to solve the above problems, the present invention provides a face alignment method based on three-dimensional face model fitting, which mainly comprises:

(one) a 3D face representation;

(II) the structure of a Convolutional Neural Network (CNN);

(iii) Label Fitting Constraints (LFCs);

(IV) a Contour Fitting Constraint (CFC);

(V) Scale Invariant Feature Transform (SIFT) pairing constraints (SPC).

Wherein, the face alignment method trains a Convolutional Neural Network (CNN) for fitting dense 3D face shapes to a single input face image; a plurality of constraints, for example, a marker fitting constraint, a contour fitting constraint, and a SIFT pairing constraint, are imposed with a dense three-dimensional shape representation.

Wherein the 3D face representation represents a dense three-dimensional shape of the face as S, including three-dimensional positions of Q vertices:

to calculate the S of the face, the basis of the 3D shape is represented in terms of a 3D model:

wherein the face shape S is an average shapeAnd weighting the principal component analysis shape toAndand corresponding weightAndon the basis of the original data; to be provided withThe basic 199 shapes represent tall or short, light or heavy, male or female, etcThe basic 29 shapes represent expression changes, such as mouth opening, smiling, kissing, and the like; each radix has Q53215 vertices, which correspond to the vertices on all other bases.

Further, for the dense three-dimensional shape, a subset of the N vertices of the dense 3D face U corresponds to the positions of the 2D markers on the image:

the dense shape of the 2D face can be estimated from the 3D face shape by considering weak perspective projection, and the projection matrix has 6 degrees of freedom to model, such as size, rotation angle (tilt α, deviation β, and rotation gamma), and translation (t) and_x,t_y) (ii) a Transformed dense face shapeCan be expressed as:

U＝Pr·A (5)

wherein a can be orthogonally projected onto a 2D plane to obtain U; thus, the z coordinate transformation (m)₁₂) Out of the range of interest, is defined as 0; the orthogonal projection may be represented as a matrix

Further, the orthogonal projection, given the properties of the projection matrix, the normalized third row of the projection matrix may be represented as the vector product of the normalized first two rows:

thus, the projection parameters can be passedThe first two rows and the shape reference coefficient p ═To determine arbitrary 2D surfacesA dense shape of the portion; the learning of dense three-dimensional shapes translates into learning of m and p, which is easier to manage in terms of dimensionality.

Wherein, the structure of the Convolutional Neural Network (CNN) learns a nonlinear mapping function f (theta), a corresponding projection parameter m and a shape parameter p from an input image I by using the convolutional neural network; the estimated parameters may be used to construct dense 3D face shapes;

CNN networks have two branches, one for predicting m and the other for predicting p; the first three convolution blocks are shared by the two branches, after the third one, two independent convolution blocks are used to extract task specific features, the two fully connected layers transfer the features to the final output; each convolution block is a stack of two convolution layers and a maximum convergence layer, each convolution or fully-connected layer being followed by a bulk normalization layer and a modified linear unit (ReLU) layer;

to improve CNN learning, a loss function is used that contains a number of constraints: parameter Constraint (PC) J_prMinimizing the difference between the estimated parameters and the calibrated real parameters; label Fitting Constraint (LFC) J_lmReducing alignment errors of the 2D marks; contour Fitting Constraint (CFC) J_cEstimating a match between the contour of the three-dimensional shape and contour pixels of the input image; SIFT Pair Constraint (SPC) J_sExciting SIFT feature point pairs of the two face images to enable the SIFT feature point pairs to correspond to the same 3D vertex;

the global loss function is defined as:

wherein,

the Parametric Constraint (PC) loss function is defined as shown in the above equation.

Wherein said Label Fitting Constraint (LFC) is aimed at minimizing the estimated 2D label and the authentic 2D label U_lm∈The difference between them; given a 2D face image with specific labels, first manually label the indices of the 3D face vertices that correspond anatomically to these labels; the set of these indices is denoted i_lm(ii) a According to formula (4) and estimationAndcalculating the shape A, the 3D marker may be from A to A (: i)_lm) Extracting; with A (: i)_lm) Projected onto a 2D plane, the LFC loss function is defined as:

where subscript F denotes the frobinius norm and L is the number of predefined labels.

Wherein said Contour Fitting Constraint (CFC) is intended to minimize the error between the projected outline of the dense 3D shape and the corresponding contour pixels in the input face image; while rendering the 3D space onto the 2D plane, the outer contour may be considered as a boundary between the background and the 3D face.

Further, in order to utilize the contour fitting constraint, the following three steps are required to be followed:

(1) detecting real contours in 2D face images: first, an off-the-shelf edge detector is used to detect contours on the facial imageFurther refining the detected edges by retaining edges within the narrow band defined by the contour marks; before training begins, the preprocessing step is finished off-line;

(2) describe the silhouette vertices on the estimated three-dimensional shape a: the contour on the estimated three-dimensional shape A can be described as a set of boundary verticesA is based on the estimationAndcalculating parameters; by representing shape A using Delaunay triangulation, one edge of a triangle is defined as a boundary if the adjacent surface has a sign change in the z value of the surface normal; the vertices associated with the edge are defined as boundary vertices, and their set is denoted as i_c；

(3) Determining the corresponding relation between the real contour and the estimated contour, and reversely propagating the fitting error: require U_cAnd A (: i)_c) Evaluating the constraint condition by the point-to-point correspondence relation between the points; matching contour pixels on the 2D image with the closest points on the 3D shape contour, and then calculating the minimum distance; the sum of all the minimum distances is the error of CFC, as shown in formula (10); to make the CFC loss negligible, equation (10) is rewritten to calculate the vertex index of the nearest contour proxel, e.g.,once k is⁰It was determined that CFC losses would be insignificant, similar to equation (9):

although i is_cDependent on { m, pH, but for simplicity when performing reverse propagation, i_cAre considered constant.

The Scale Invariant Feature Transform (SIFT) pairing constraint (SPC) is characterized in that a pair of faces i and j are given, and SIFT points on two face images are detected and matched firstly; the matched SIFT points are expressed as

For perfectly dense face alignment, the matched SIFT points will overlap with exactly the same vertices in the estimated 3D face shape, denoted as aⁱAnd A^j(ii) a Finding 3D projection verticesIts projection and two-dimensional SIFT pointsAnd (3) superposition:

to be provided withOn the basis, the loss function for SPC is defined as:

wherein, by { mⁱ,pⁱCalculating to obtain Aⁱ(ii) a SIFT points are mapped from one face to another and their distance is calculated, matching SIFT points on the other face.

Drawings

Fig. 1 is a system frame diagram of a face alignment method based on three-dimensional face model fitting according to the present invention.

FIG. 2 is a structure of a convolutional neural network of a face alignment method based on three-dimensional face model fitting.

Detailed Description

It should be noted that the embodiments and features of the embodiments in the present application can be combined with each other without conflict, and the present invention is further described in detail with reference to the drawings and specific embodiments.

Fig. 1 is a system frame diagram of a face alignment method based on three-dimensional face model fitting according to the present invention. The method mainly comprises 3D face representation, the structure of a convolutional neural network, mark fitting constraint, contour fitting constraint and scale invariant feature transformation pairing constraint.

A face alignment method, training a Convolutional Neural Network (CNN) for fitting dense 3D face shapes to a single input face image; a plurality of constraints, for example, a marker fitting constraint, a contour fitting constraint, and a SIFT pairing constraint, are imposed with a dense three-dimensional shape representation.

3D face representation, representing the dense three-dimensional shape of the face as S, containing the three-dimensional positions of Q vertices:

The subset of N vertices of the dense 3D face U corresponds to the position of the 2D marker on the image:

U＝Pr·A (5)

Given the properties of the projection matrix, the normalized third row of the projection matrix may be represented as the vector product of the normalized first two rows:

thus, the projection parameters can be passedFirst two rows and shape reference coefficients To determine the dense shape of any 2D face; the learning of dense three-dimensional shapes translates into learning of m and p, which is easier to manage in terms of dimensionality.

FIG. 2 is a structure of a convolutional neural network of a face alignment method based on three-dimensional face model fitting. Learning a non-linear mapping function from an input image I using a convolutional neural networkThe corresponding projection parameter m and shape parameter p; the estimated parameters may be used to construct dense 3D face shapes;

the global loss function is defined as:

wherein,

Label Fitting Constraints (LFC) aim to minimize estimated 2D labels and authentic 2D labelsThe difference between them;given a 2D face image with specific labels, first manually label the indices of the 3D face vertices that correspond anatomically to these labels; the set of these indices is denoted i_lm(ii) a According to formula (4) and estimationAndcalculating the shape A, the 3D marker may be from A to A (: i)_lm) Extracting; with A (: i)_lm) Projected onto a 2D plane, the LFC loss function is defined as:

The Contour Fitting Constraint (CFC) is intended to minimize the error between the projected outline of the dense 3D shape and the corresponding contour pixels in the input face image; while rendering the 3D space onto the 2D plane, the outer contour may be considered as a boundary between the background and the 3D face.

The specific steps of contour fitting constraint are as follows:

although i is_cDepends on the current estimate of m, p, but for simplicity i is the inverse of the propagation performed_cAre considered constant.

Scale Invariant Feature Transform (SIFT) pairing constraints (SPC), given a pair of faces i and j, are first detected and matchedSIFT points on the two face images; the matched SIFT points are expressed as

to be provided withOn the basis, the loss function for SPC is defined as:

It will be appreciated by persons skilled in the art that the invention is not limited to details of the foregoing embodiments and that the invention can be embodied in other specific forms without departing from the spirit or scope of the invention. In addition, various modifications and alterations of this invention may be made by those skilled in the art without departing from the spirit and scope of this invention, and such modifications and alterations should also be viewed as being within the scope of this invention. It is therefore intended that the following appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the invention.

Claims

1. A face alignment method based on three-dimensional face model fitting is characterized by mainly comprising a 3D face representation (I); structure of Convolutional Neural Network (CNN) (two); label Fit Constraints (LFCs) (three); contour Fitting Constraint (CFC) (four); scale Invariant Feature Transform (SIFT) pairing constraints (SPC) (five).

2. A face alignment method as claimed in claim 1, characterized in that a Convolutional Neural Network (CNN) is trained for fitting dense 3D face shapes to a single input face image; a plurality of constraints, for example, a marker fitting constraint, a contour fitting constraint, and a SIFT pairing constraint, are imposed with a dense three-dimensional shape representation.

3. 3D face representation (one) based on claim 1, characterized in that the dense three-dimensional shape of the face is represented as S, containing the three-dimensional positions of Q vertices:

<mrow> <mi>S</mi> <mo>=</mo> <mover> <mi>S</mi> <mo>&OverBar;</mo> </mover> <mo>+</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>N</mi> <mrow> <mi>i</mi> <mi>d</mi> </mrow> </msub> </munderover> <msubsup> <mi>p</mi> <mrow> <mi>i</mi> <mi>d</mi> </mrow> <mi>i</mi> </msubsup> <msubsup> <mi>S</mi> <mrow> <mi>i</mi> <mi>d</mi> </mrow> <mi>i</mi> </msubsup> <mo>+</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>N</mi> <mi>exp</mi> </msub> </munderover> <msubsup> <mi>p</mi> <mi>exp</mi> <mi>i</mi> </msubsup> <msubsup> <mi>S</mi> <mi>exp</mi> <mi>i</mi> </msubsup> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>2</mn> <mo>)</mo> </mrow> </mrow>

4. The dense three-dimensional shape of claim 3, wherein a subset of the N vertices of the dense 3D face U correspond to the positions of the 2D markers on the image:

U＝Pr·A (5)

5. The orthographic projection as recited in claim 4, wherein the normalized third row of the projection matrix is expressed as a vector product of the normalized first two rows given the properties of the projection matrix:

<mrow> <mo>&lsqb;</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>9</mn> </msub> <mo>,</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>10</mn> </msub> <mo>,</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>11</mn> </msub> <mo>&rsqb;</mo> <mo>=</mo> <mo>&lsqb;</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>1</mn> </msub> <mo>,</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>2</mn> </msub> <mo>,</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>3</mn> </msub> <mo>&rsqb;</mo> <mo>&times;</mo> <mo>&lsqb;</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>4</mn> </msub> <mo>,</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>5</mn> </msub> <mo>,</mo> <msub> <mover> <mi>m</mi> <mo>&OverBar;</mo> </mover> <mn>6</mn> </msub> <mo>&rsqb;</mo> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>6</mn> <mo>)</mo> </mrow> </mrow>

6. The structure (ii) of a Convolutional Neural Network (CNN) based on claim 1, characterized in that a non-linear mapping function f (Θ), corresponding projection parameters m and shape parameters p are learned from an input image I using a convolutional neural network; the estimated parameters may be used to construct dense 3D face shapes;

the global loss function is defined as:

<mrow> <mi>arg</mi> <munder> <mrow> <mi>m</mi> <mi>i</mi> <mi>n</mi> </mrow> <mrow> <mover> <mi>m</mi> <mo>^</mo> </mover> <mo>,</mo> <mover> <mi>p</mi> <mo>^</mo> </mover> </mrow> </munder> <mi>J</mi> <mo>=</mo> <msub> <mi>J</mi> <mrow> <mi>p</mi> <mi>r</mi> </mrow> </msub> <mo>+</mo> <msub> <mi>&lambda;</mi> <mrow> <mi>l</mi> <mi>m</mi> </mrow> </msub> <msub> <mi>J</mi> <mrow> <mi>l</mi> <mi>m</mi> </mrow> </msub> <mo>+</mo> <msub> <mi>&lambda;</mi> <mi>c</mi> </msub> <msub> <mi>J</mi> <mi>c</mi> </msub> <mo>+</mo> <msub> <mi>&lambda;</mi> <mi>s</mi> </msub> <msub> <mi>J</mi> <mi>s</mi> </msub> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>7</mn> <mo>)</mo> </mrow> </mrow>

wherein,

7. Label Fitting Constraint (LFC) (III) based on claim 1, characterized in that LFC aims to minimize estimated 2D labels and true 2D labelsThe difference between them; given a 2D face image with specific labels, first manually label the indices of the 3D face vertices that correspond anatomically to these labels; the set of these indices is denoted i_lm(ii) a According to formula (4) and estimationAndcalculating the shape A, the 3D marker may be from A to A (: i)_lm) Extracting; with A (: i)_lm) Projected onto a 2D plane, the LFC loss function is defined as:

<mrow> <msub> <mi>J</mi> <mrow> <mi>l</mi> <mi>m</mi> </mrow> </msub> <mo>=</mo> <mfrac> <mn>1</mn> <mi>L</mi> </mfrac> <mo>&CenterDot;</mo> <mo>|</mo> <mo>|</mo> <mi>Pr</mi> <mi>A</mi> <mrow> <mo>(</mo> <mo>:</mo> <mo>,</mo> <msub> <mi>i</mi> <mrow> <mi>l</mi> <mi>m</mi> </mrow> </msub> <mo>)</mo> </mrow> <mo>-</mo> <msub> <mi>U</mi> <mrow> <mi>l</mi> <mi>m</mi> </mrow> </msub> <mo>|</mo> <msubsup> <mo>|</mo> <mi>F</mi> <mn>2</mn> </msubsup> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>9</mn> <mo>)</mo> </mrow> </mrow>

8. The Contour Fitting Constraint (CFC) (iv) based on claim 1, characterized in that CFC aims to minimize the error between the projected outline of the dense 3D shape and the corresponding contour pixels in the input face image; while rendering the 3D space onto the 2D plane, the outer contour may be considered as a boundary between the background and the 3D face.

9. The specific steps based on the contour fitting constraint of claim 8, wherein in order to utilize such contour fitting constraint, the following three steps are followed:

<mrow> <mtable> <mtr> <mtd> <mrow> <msub> <mi>J</mi> <mi>c</mi> </msub> <mo>=</mo> <mfrac> <mn>1</mn> <mi>L</mi> </mfrac> <munder> <mo>&Sigma;</mo> <mi>j</mi> </munder> <munder> <mi>min</mi> <mrow> <mi>k</mi> <mo>&Element;</mo> <msub> <mi>i</mi> <mi>c</mi> </msub> </mrow> </munder> <mo>|</mo> <mo>|</mo> <mi>Pr</mi> <mi>A</mi> <mrow> <mo>(</mo> <mo>:</mo> <mo>,</mo> <mi>k</mi> <mo>)</mo> </mrow> <mo>-</mo> <msub> <mi>U</mi> <mi>c</mi> </msub> <mrow> <mo>(</mo> <mo>:</mo> <mo>,</mo> <mi>J</mi> <mo>)</mo> </mrow> <mo>|</mo> <msup> <mo>|</mo> <mn>2</mn> </msup> </mrow> </mtd> </mtr> <mtr> <mtd> <mrow> <mo>=</mo> <mfrac> <mn>1</mn> <mi>L</mi> </mfrac> <munder> <mo>&Sigma;</mo> <mi>j</mi> </munder> <mo>|</mo> <mo>|</mo> <mi>Pr</mi> <mi>A</mi> <mrow> <mo>(</mo> <mo>:</mo> <mo>,</mo> <mi>arg</mi> <munder> <mi>min</mi> <mrow> <mi>k</mi> <mo>&Element;</mo> <msub> <mi>i</mi> <mi>c</mi> </msub> </mrow> </munder> <mo>|</mo> <mo>|</mo> <mi>Pr</mi> <mi>A</mi> <mo>(</mo> <mrow> <mo>:</mo> <mo>,</mo> <mi>k</mi> </mrow> <mo>)</mo> <mo>-</mo> <msub> <mi>U</mi> <mi>c</mi> </msub> <mo>(</mo> <mrow> <mo>:</mo> <mo>,</mo> <mi>j</mi> </mrow> <mo>)</mo> <mo>|</mo> <msup> <mo>|</mo> <mn>2</mn> </msup> <mo>)</mo> </mrow> <mo>-</mo> <msub> <mi>U</mi> <mi>c</mi> </msub> <mrow> <mo>(</mo> <mo>:</mo> <mo>,</mo> <mi>j</mi> <mo>)</mo> </mrow> <mo>|</mo> <msup> <mo>|</mo> <mn>2</mn> </msup> </mrow> </mtd> </mtr> </mtable> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>10</mn> <mo>)</mo> </mrow> </mrow>

10. Scale-invariant feature transform (SIFT) pair constraints (SPC) (iv) based on claim 1, wherein given a pair of faces i and j, SIFT points on two face images are first detected and matched; the matched SIFT points are expressed asAnd

<mrow> <msubsup> <mi>i</mi> <mi>s</mi> <mi>i</mi> </msubsup> <mo>=</mo> <mi>arg</mi> <munder> <mrow> <mi>m</mi> <mi>i</mi> <mi>n</mi> </mrow> <mrow> <mi>i</mi> <mo>&Element;</mo> <mo>{</mo> <mn>1</mn> <mo>,</mo> <mn>...</mn> <mo>,</mo> <msub> <mi>L</mi> <mrow> <mi>i</mi> <mi>j</mi> </mrow> </msub> <mo>}</mo> </mrow> </munder> <mo>|</mo> <mo>|</mo> <msup> <mi>A</mi> <mi>i</mi> </msup> <mo>{</mo> <msubsup> <mi>i</mi> <mi>s</mi> <mi>i</mi> </msubsup> <mo>}</mo> <mo>-</mo> <msubsup> <mi>U</mi> <mi>s</mi> <mi>i</mi> </msubsup> <mo>|</mo> <msubsup> <mo>|</mo> <mi>F</mi> <mn>2</mn> </msubsup> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>11</mn> <mo>)</mo> </mrow> </mrow>

to be provided withOn the basis, the loss function for SPC is defined as: