CN112307714A

CN112307714A - Character style migration method based on double-stage deep network

Info

Publication number: CN112307714A
Application number: CN202011210655.3A
Authority: CN
Inventors: 陈金泽; 李龙; 吕奕杭; 廖志寰; 朱安娜
Original assignee: Wuhan University of Technology WUT
Current assignee: Wuhan University of Technology WUT
Priority date: 2020-11-03
Filing date: 2020-11-03
Publication date: 2021-02-02
Anticipated expiration: 2040-11-03
Also published as: CN112307714B

Abstract

A character style migration method based on a two-stage depth network comprises the steps of firstly constructing a training data set A and a training data set B, then training a de-stylized network by adopting the training data set A to obtain a de-stylized network model, then training a font migration network by utilizing de-stylized pictures obtained by the de-stylized network model and the training data set B to obtain a font migration network model, migrating a certain font picture to be converted into a target reference font picture by utilizing the model, finally training a texture migration network by utilizing the target reference font picture and the training data set A to obtain a texture migration network model, and obtaining a final result of character style migration by utilizing the model. The design has excellent character style migration effect.

Description

Character style migration method based on double-stage deep network

Technical Field

The invention belongs to the field of deep learning and image style migration, and particularly relates to a character style migration method based on a two-stage deep network.

Background

Style migration of images refers to the task of migrating one style from one image to another to synthesize a new artistic image. In recent years, with the continuous development of artificial intelligence technology and global creative industry, it is becoming a demand to realize style migration of text images. People hope to generate more artistic fonts and apply the fonts to the design and the propaganda of industries such as commerce, culture and the like.

Style migration of text images differs from that of ordinary images, and involves both font (font) migration and texture (texture) migration of text. The former realizes the font transformation of characters with the same content, and the latter realizes the style appearance transformation of the characters. Manually synthesized text images with specific fonts and textures consume a lot of time and energy, so that the realization of text style migration by using an automatic and efficient method becomes a concern of people. However, the existing character style migration method is limited to direct conversion in a single stage, that is, both the font (font) and the texture (texture) of the character are migrated at one time in the same stage, and the obtained effect is often not ideal.

Disclosure of Invention

The invention aims to overcome the problems in the prior art and provide a character style migration method based on a two-stage deep network with better migration effect.

In order to achieve the above purpose, the invention provides the following technical scheme:

a character style migration method based on a two-stage deep network sequentially comprises the following steps:

the method comprises the following steps of firstly, constructing a training data set A and a training data set B, wherein the training data set A comprises stylized character pictures with various textures and de-stylized character pictures corresponding to the stylized character pictures, and the training data set B comprises a reference font and de-stylized character pictures with other various fonts;

constructing a de-stylized network, and training the de-stylized network by adopting a training data set A to obtain a de-stylized network model for de-stylizing the stylized character and image with texture;

thirdly, constructing a font migration network, training the font migration network by using de-stylized pictures obtained by the de-stylized network model and a training data set B to obtain a font migration network model for realizing conversion and migration of various fonts, and then migrating a certain font picture to be converted into a target reference font picture by using the model;

and step four, constructing a texture migration network, training the texture migration network by using the target reference font picture and the training data set A generated in the step three to obtain a texture migration network model for realizing stylized texture rendering of the font picture, and finally obtaining a final result of character style migration by using the model.

In step two, the de-stylized network includes an encoder E_XEncoder E_YAnd decoder G_XThe method for training the de-stylized network by adopting the training data set A sequentially comprises the following steps:

2.1 de-stylized network randomly selects an image pair (x, y) from training data set A, and inputs them to encoder E_XAnd E_YWherein y is a stylized character picture with texture, and x is a de-stylized character picture corresponding to y;

2.2 encoder E_XAnd E_YMapping x and y to shared characteristic space, coding to generate respective characteristic diagram, and calculating characteristic loss L according to the characteristic diagram_featAnd with L_featTraining encoder E with minimum optimization for target_XAnd E_Y；

2.3 transmitting the characteristic diagram to a decoder G_XGenerating a reconstructed de-stylized text picture, and then calculating pixel loss L from the reconstructed de-stylized text picture and picture x_pixAnd with L_pixTraining decoder G with minimum for goal optimization_XThe reconstructed de-stylized text picture is brought sufficiently close to picture x.

The de-stylized network also includes a discriminator D_XSaid adoption trainingTraining the de-stylized network for dataset a further comprises:

2.4 inputting the reconstructed de-stylized character picture into a discriminator D_XDetermining the authenticity of the product, and calculating the resistance loss L_advAnd optimized using an Adam optimizer.

The de-stylized network employs a total loss function to be optimized as:

L₁＝λ_featL_feat+λ_pixL_pix+λ_advL_adv

L_feat＝E_x,y[||S_X(E_Y(y))-z||₁]

z＝S_X(G_X(x))

L_pix＝E_x,y[||G_X(E_Y(y))-x||₁]

in the above formula, L_feat、L_pix、L_advCharacteristic loss, pixel loss and contrast loss, λ, respectively_feat、λ_pix、λ_advRespectively, characteristic loss, pixel loss, and superparametric of countermeasures loss, S_XIs G_XZ is a content feature, λ_gpIn order to be a penalty factor,

for de-stylized text picture G along picture x and reconstructed_X(E_Y(y)) are uniformly sampled.

In the third step, the font migration network comprises a generator G and a discriminator D, and the font migration network trained by the de-stylized picture and the training data set B obtained by using the de-stylized network model sequentially comprises the following steps:

3.1, randomly selecting a picture x from a training data set B by a font migration network, inputting the picture x into a generator G, and generating a false picture G (x, c) by the generator G according to the picture x and a target font label c;

3.2, on one hand, the false picture G (x, c) is input into the generator G again to generate a reconstructed picture G (G (x, c)), the de-stylized picture of the de-stylized network model is taken as a target font picture to supervise in the reconstruction process, and then the font classification loss L of the generator is calculated_fAnd reconstruction loss L_recWith L_f、L_recOn the other hand, the false picture G (x, c) is input into a discriminator D to discriminate the authenticity of the false picture G and the font domain to which the picture belongs, and the font classification loss L of the discriminator is calculated_rAnd with L_rAnd optimally training the discriminator D with the minimum as a target.

The loss function to be optimized adopted by the font migration network is as follows:

L₂＝L_D+L_G

L_D＝-L_adv+λ₁L_r

L_G＝L_adv+λ₁L_f+λ₂L_rec

L_r＝E_x,c'[-logD(c'|x)]

L_f＝E_x,c'[-logD(c|G(x,c))]

L_rec＝E_x,c[||x-G(G(x,c),c')||₁]

in the above formula, L_DFor discriminator loss, L_GFor generator losses, L_adv、L_r、L_f、L_recRespectively, the countermeasure loss, the font classification loss of the discriminator, the font classification loss of the generator, the reconstruction loss, lambda₁、λ₂、λ_gpRespectively representing a font classification loss hyper-parameter, a reconstruction loss hyper-parameter and a penalty function for resisting loss,

to sample evenly along the straight line between the real picture sample and the false picture G (x, c), D (c '| x) is the probability distribution that the discriminator D attributes the real picture sample to the original font domain c'.

In step 3.1, the generation method of the false picture G (x, c) is as follows: firstly, the image x and the target font label c are subjected to feature mapping and fusion, and then the image x and the target font label c are transmitted into a deep convolutional network for training.

In the fourth step, the texture migration network comprises an encoder f, a decoder g and an AdaIN self-adaptive normalization layer positioned between the encoder f and the decoder g, wherein the encoder f and the decoder g are constructed by taking a VGG-19 network structure as a reference, the encoder f selects a front L layer of a pre-trained VGG-19 network, the decoder g is a symmetric structure of the encoder f, and all pooling layers are replaced by upper sampling layers;

the training of the texture migration network by using the target reference font picture and the training data set A generated in the third step sequentially comprises the following steps:

4.1, firstly, mapping a font picture c and a texture style picture s to a feature space by using an encoder f to obtain f (c) and f(s), and then performing feature transformation on the font picture c and the texture style picture s by using an AdaIN self-adaptive normalization layer to obtain a feature map t ═ AdaIN (f (c) and f (s));

4.2, mapping the feature map t back to the original feature space by a decoder g to obtain a stylized result map g (t);

4.3, inputting the stylized result graph g (t) and the texture style picture s into an encoder f, and realizing the training of the texture migration network through the optimization of the loss function.

In step 4.3, the loss function is:

L₃＝L_c+λL_s

L_c＝||f(g(t))-t||₂

in the above formula, L_cFor content loss, L_sFor style loss, λ is the hyper-parameter of style loss, φ_iIn the i-th layer of the encoder f, σ and μ are the variance and mean of each image channel, respectively.

In step 4.1, the feature transformation formula of the AdaIN adaptive normalization layer is as follows:

in the above formula, σ and μ are the variance and mean of each image channel, respectively.

Compared with the prior art, the invention has the beneficial effects that:

the invention relates to a character style migration method based on a two-stage depth network, which comprises the steps of firstly constructing a training data set A and a training data set B, then training a de-stylized network by adopting the training data set A to obtain a de-stylized network model for de-stylizing a stylized character image with textures, then training a font migration network by utilizing the de-stylized network model to obtain a font migration network model for realizing conversion and migration of various fonts, migrating a certain font image to be converted into a target reference font image by utilizing the model, finally training the texture migration network by utilizing the target reference font image and the training data set A to obtain a texture migration network model for realizing stylized texture rendering of the font image, and obtaining a final character migration result by utilizing the model The texture is transferred in stages, namely the first stage of character font transfer is carried out, and then the second stage of character texture transfer is carried out, so that a better character style transfer effect can be obtained. Therefore, the invention can obtain better character style migration effect.

Drawings

FIG. 1 is an overall flow chart of the present invention.

Fig. 2 is a schematic diagram of a training data set a in the present invention.

Fig. 3 is a schematic diagram of a training data set B in the present invention.

FIG. 4 is a schematic diagram of a de-stylized network of the present invention.

Fig. 5 is a schematic structural diagram of a font migration network according to the present invention.

FIG. 6 is a schematic structural diagram of a texture migration network according to the present invention.

Detailed Description

The present invention will be further described with reference to the following detailed description and accompanying drawings.

Referring to fig. 1 to 6, a text style migration method based on a two-stage deep network sequentially includes the following steps:

2.1 de-stylized network randomly selects an image pair (x, y) from training data set A, and inputs them to encoder E_XAnd E_YIn (A), itIn the method, y is a stylized character picture with texture, and x is a de-stylized character picture corresponding to y;

The de-stylized network also includes a discriminator D_XThe training of the de-stylized network using the training data set a further comprises:

The de-stylized network employs a total loss function to be optimized as:

L₁＝λ_featL_feat+λ_pixL_pix+λ_advL_adv

L_feat＝E_x,y[||S_X(E_Y(y))-z||₁]

z＝S_X(G_X(x))

L_pix＝E_x,y[||G_X(E_Y(y))-x||₁]

L₂＝L_D+L_G

L_D＝-L_adv+λ₁L_r

L_G＝L_adv+λ₁L_f+λ₂L_rec

L_r＝E_x,c'[-logD(c'|x)]

L_f＝E_x,c'[-logD(c|G(x,c))]

L_rec＝E_x,c[||x-G(G(x,c),c')||₁]

In step 4.3, the loss function is:

L₃＝L_c+λL_s

L_c＝||f(g(t))-t||₂

The principle of the invention is illustrated as follows:

the invention provides a character style migration method based on a two-stage depth network, which is based on a de-stylized network consisting of two encoders, a decoder and a discriminator and realizes de-stylized processing of textured characters by optimizing characteristic loss, pixel loss and countermeasure loss; based on a font migration network of a generator and a discriminator, realizing the first-stage migration of the character fonts by optimizing the countermeasure loss and the font classification loss; based on a texture migration network of an encoder and a decoder with an AdaIN self-adaptive normalization layer, feature transformation is carried out through mean values and variances, content loss and style loss are optimized, and second-stage migration of character textures is achieved. The style migration character image obtained by the method has higher artistic effect, has wide application in the field of visual design, can be used for various aspects such as artistic image design, culture and commercial image propaganda, drawing text processing and the like, is not only suitable for digital and letter images, but also has better performance in the aspect of Chinese character migration.

Discriminator D_X: in order to make the de-stylized reconstruction result more accurate, the invention adds a discriminator D in the de-stylized network_XAnd the method is used for determining the authenticity of the reconstructed picture.

Font classification loss L of discriminator_r＝E_x,c'[-logD(c'|x)]We expect that the input font image x is converted to the output font image y and correctly classified into the target font domain c, D (c '| x) represents the probability distribution of the discriminator to classify the true samples into the original font domain c', and the goal of the discriminator D is to minimize this loss.

Font classification penalty L for generators_f＝E_x,c'[-logD(c|G(x,c))]The loss function is used to optimize the generator G so that the pictures generated by the generator G can be classified into the target font domain c by the discriminator D.

Loss of reconstruction L_rec＝E_x,c[||x-G(G(x,c),c')||₁]. Considering that the generator G may only change the font-related information of the input picture without changing the font content of the picture to trick the discriminator D, re-inputting the generated G (x, c) to the generator G results in the picture G (x, c)) which should be as consistent as possible with the picture x, a 1-norm is used for loss limitation.

For fighting against loss L_advThe model collapse problem is solved by adopting a WGAN method, namely:

the texture migration network defines two kinds of penalties: content loss L_cAnd style loss L_s. Content loss employing network output imagesExpressing Euclidean distance from the AdaIN layer output feature diagram, and aiming at enabling the final output content of the model to be close to the AdaIN layer output feature diagram t sufficiently so as to accelerate convergence speed; and the style loss is obtained by coding the image generating the result by a coder again, acquiring the mean value and the variance of the feature map of each layer of the VGG network, and performing Euclidean distance summation on the mean value and the variance of the layer corresponding to the real style map.

Example 1:

referring to fig. 1, a text style migration method based on a two-stage deep network is sequentially performed according to the following steps:

1. reference documents: yang S, Liu J, Wang W, et al. TET-GAN: Text Effects Transfer Via Stylation and Destylation [ J ].2018. constructing a training data set A and a training data set B, wherein the training data set A comprises stylized character pictures with various textures and de-stylized character pictures corresponding to the stylized character pictures, and the training data set B comprises de-stylized character pictures with a reference font and other various fonts (see fig. 2 and 3);

2. constructing a de-stylized network comprising an encoder E_XEncoder E_YDecoder G_XAnd a discriminator D_XEncoder E_XAnd E_YThe last layers of weights are shared, and the network structure adopted by the de-stylized network is shown in table 1:

TABLE 1 de-stylized network architecture Table

The de-stylized network employs a total loss function to be optimized as:

L₁＝λ_featL_feat+λ_pixL_pix+λ_advL_adv

in the above formula, L_feat、L_pix、L_advCharacteristic loss, pixel loss and contrast loss, λ, respectively_feat、λ_pix、λ_advRespectively characteristic loss, imagePrime loss, hyper-parameters to combat loss;

3. referring to fig. 4, training the de-stylized network using the training data set a specifically includes:

(1) the de-stylized network randomly selects an image pair (x, y) from the training data set A, and inputs the image pair into the encoder E_XAnd E_YWherein y is a stylized character picture with texture, and x is a de-stylized character picture corresponding to y;

(2) encoder E_XAnd E_YMapping x and y to shared characteristic space, coding to generate respective characteristic diagram, and calculating characteristic loss L according to the characteristic diagram_featAnd with L_featTraining encoder E with minimum optimization for target_XAnd E_Y；

The task of the encoder is to bring the result closer to the group Truth of the content feature. With S_XRepresents G_XThe content feature for guidance is defined as z ═ S_X(G_X(x))，L_featFor guidance E_YRemoving texture feature elements from the text image, and retaining core font information defined as

L_feat＝E_x,y[||S_X(E_Y(y))-z||₁]；

(3) Transmitting the characteristic diagram to a decoder G_XGenerating a reconstructed de-stylized text picture, and then calculating pixel loss L from the reconstructed de-stylized text picture and picture x_pixAnd with L_pixTraining decoder G with minimum for goal optimization_XMaking the reconstructed de-stylized text picture sufficiently close to the picture x;

the de-stylized network needs to make the generated reconstruction result close to picture x, so a pixel loss constraint is performed using a 1-norm, where pixel loss is defined as:

L_pix＝E_x,y[||G_X(E_Y(y))-x||₁]

(4) inputting the reconstructed de-stylized character picture into a discriminator D_XDetermining the authenticity of the product, and calculating the resistance loss L_adv(definition of anti-loss purpose in networkIs a guide G_XAnd E_YConfusion D_X)：

In the above formula, λ_gpIn order to be a penalty factor,

for de-stylized text picture G along picture x and reconstructed_X(E_Y(y)) a straight line between;

and optimized by using an Adam optimizer, and the learning rate parameter is set to be 0.0002 and lambda_feat＝λ_pix＝100，λ_gp＝10，λ _adv1, the encoder E available for de-formatting is finally obtained_YAnd decoder G_XG of model Generation_X(E_Y(y)) the image is sufficiently close to the de-stylized picture x:

4. the font migration network is constructed and comprises a generator G and a discriminator D, wherein the generator G comprises 2 convolution layers, 6 residual error layers and 2 deconvolution layers, normalization processing is used, the total network flow is that the generator G reduces the model dimension to 4 times, then 6 residual error networks are used to obtain equal dimension output, then a transposition convolution is used to amplify 4 times, finally, through the convolution with unchanged layer size, tanh is taken as output, and the network structure adopted by the generator G is shown in a table 2:

table 2 structural table of generator G

The convolution kernel size in each convolution layer is 4 x 4 with a step size of 2, and the dimension is reduced 1/2 for each convolution operation. Normalizing IN an image channel by the normalization layer, and calculating an average value (IN) according to H x W; since the result of the generation is mainly dependent on a certain image instance, the normalization (BN) of the whole batch is not suitable for the stylization of the image, so that the normalization of H × W can speed up the model convergence and maintain independence between each image instance. The activation function adopts LeakyReLU, and as the output of the function has a small gradient to the negative value input, the derivative is always not zero, so that the occurrence of silent neurons can be reduced, learning based on the gradient is allowed, and the problem that the neurons cannot learn after the Relu function enters a negative interval is solved.

Furthermore, to try to avoid the over-fitting problem, we no longer fit the desired feature map directly with multiple stacked layers, but explicitly fit them to a residual map. Assuming the desired feature mapping is h (x), then the stacked non-linear layer fits another mapping, i.e., f (x) h (x) -x. Assuming that it is easier to optimize the residual mapping than the desired mapping, i.e. f (x) ═ h (x) -x is easier to optimize than f (x) ═ h (x), then in the extreme case the desired mapping will fit the identity mapping, where the task of the residual network is to fit f (x) ═ 0 and the common network will fit f (x) ═ x, which is obviously easier to optimize.

The discriminator D adopts a PatchGAN structure, carries out true and false classification on the local image blocks, does not use a normalization layer, uses the output of Conv1 to represent the prediction probability of a target font, uses the output of Conv2 to represent the judgment whether the image is true or not, and has parallel relation;

L₂＝L_D+L_G

L_D＝-L_adv+λ₁L_r

L_G＝L_adv+λ₁L_f+λ₂L_rec

in the above formula, L_DFor discriminator loss, L_GFor generator losses, L_adv、L_r、L_f、L_recRespectively, the countermeasure loss, the font classification loss of the discriminator, the font classification loss of the generator, the reconstruction loss, lambda₁、λ₂The font classification loss super-parameter and the reconstruction loss super-parameter are respectively;

5. referring to fig. 5, a font migration network is trained by using de-stylized pictures obtained by the de-stylized network model and a training data set B to obtain a font migration network model for realizing conversion and migration of multiple fonts, which specifically includes:

(1) randomly selecting a picture x from a training data set B by a font migration network, inputting the picture x into a generator G, performing feature mapping and fusion on the picture x and a target font label c by the generator G, and then transmitting the picture x and the target font label c into a deep convolutional network for training to generate a false picture G (x, c);

(2) on one hand, the false picture G (x, c) is input into the generator G again to generate a reconstructed picture G (G (x, c)), the de-stylized picture of the de-stylized network model is taken as a target font picture to be supervised in the reconstruction process, the storage of the picture content in the picture conversion process is ensured, only the part of the domain difference is changed, and the font classification loss L of the generator is calculated_fAnd reconstruction loss L_recWith L_f、L_recOn the other hand, the false picture G (x, c) is input into a discriminator D to discriminate the authenticity of the false picture G and the font domain to which the picture belongs, and the font classification loss L of the discriminator is calculated_rAnd with L_rAnd optimally training the discriminator D with the minimum target as follows:

L_r＝E_x,c'[-logD(c'|x)]

L_f＝E_x,c'[-logD(c|G(x,c))]

L_rec＝E_x,c[||x-G(G(x,c),c')||₁]

in the above formula, λ_gpIn order to counter the penalty function of the loss,

d (c '| x) is the probability distribution of the discriminator D for classifying the real picture samples into the original font domain c' for uniform sampling along the straight line between the real picture samples and the false picture G (x, c);

the model is trained by adopting a parameter beta₁＝0.5，β₂Adam optimizer 0.999, flipping the image at a probability level of 0.5 for increasing data, performing 1 generator update after 5 discriminator updates, setting the batch size of all trials to 16, training all models at a learning rate of 0.0001 in the first 10 epochs, and linearly attenuating the learning rate to 0 in the next 10 epochs;

6. migrating a certain font picture to be converted into a target reference font picture by using the obtained font migration network model;

7. constructing a texture migration network, wherein the texture migration network comprises an encoder f, a decoder g and an AdaIN self-adaptive normalization layer positioned between the encoder f and the decoder g, the encoder f and the decoder g are constructed by taking a VGG-19 network structure as a reference, the encoder f selects relu1_1 to relu4_1 parts of a pre-trained VGG-19 network, the decoder g is a symmetrical structure of the encoder f, but all pooling layers are replaced by upsampling layers, and the specific structure of the texture migration network is shown in Table 3:

table 3 texture migration network structure table

The convolution kernels of the network convolution layers are all 3 × 3 in size, the step length is 1, the window size of the MaxPool maximum pooling layer is 2 × 2, and the upper sampling layer adopts a nearest neighbor interpolation algorithm;

8. referring to fig. 6, a texture migration network is trained by using a target reference font picture and a training data set a to obtain a texture migration network model for implementing stylized texture rendering on a font picture, which specifically includes:

(1) firstly, a font picture c and a texture style picture s are mapped to a feature space by an encoder f to obtain f (c) and f(s), and then feature transformation is carried out on the font picture c and the texture style picture s by an AdaIN self-adaptive normalization layer to obtain a feature graph t ═ AdaIN (f (c), f (s)):

in the above formula, σ and μ are the variance and mean of each image channel respectively;

(2) mapping the feature map t back to the original feature space by a decoder g to obtain a stylized result map g (t);

(3) inputting the stylized result graph g (t) and the texture style picture s into an encoder f, and realizing the training of the texture migration network through the optimization of a loss function, wherein the loss function is as follows:

L₃＝L_c+λL_s

L_c＝||f(g(t))-t||₂

in the above formula, L_cFor content loss, L_sFor style loss, λ is the hyper-parameter of style loss, φ_iThe layer i of the encoder f is used, and sigma and mu are respectively the variance and the mean value of each image channel;

optimization loss the Adam optimizer is selected and the batch size is set to 8.

Claims

1. A character style migration method based on a two-stage deep network is characterized in that:

the method comprises the following steps in sequence:

2. The text style migration method based on the two-stage deep network as claimed in claim 1, wherein:

3. The text style migration method based on the two-stage deep network as claimed in claim 2, wherein:

4. The method for text style migration based on the two-stage deep network as claimed in claim 3, wherein:

the de-stylized network employs a total loss function to be optimized as:

L₁＝λ_featL_feat+λ_pixL_pix+λ_advL_adv

L_feat＝E_x,y[||S_X(E_Y(y))-z||₁]

z＝S_X(G_X(x))

L_pix＝E_x,y[||G_X(E_Y(y))-x||₁]

5. The character style migration method based on the two-stage deep network as claimed in any one of claims 1-4, wherein:

6. The text style migration method based on the two-stage deep network as claimed in claim 5, wherein:

L₂＝L_D+L_G

L_D＝-L_adv+λ₁L_r

L_G＝L_adv+λ₁L_f+λ₂L_rec

L_r＝E_x,c'[-logD(c'|x)]

L_f＝E_x,c'[-logD(c|G(x,c))]

L_rec＝E_x,c[||x-G(G(x,c),c')||₁]

7. The text style migration method based on the two-stage deep network as claimed in claim 5, wherein:

8. The character style migration method based on the two-stage deep network as claimed in any one of claims 1-4, wherein:

9. The text style migration method based on the two-stage deep network as claimed in claim 8, wherein:

in step 4.3, the loss function is:

L₃＝L_c+λL_s

L_c＝||f(g(t))-t||₂

10. The text style migration method based on the two-stage deep network as claimed in claim 8, wherein: