CN115205420A

CN115205420A - Method for generating ancient character fonts based on GAN network

Info

Publication number: CN115205420A
Application number: CN202210821980.6A
Authority: CN
Inventors: 孙金虎; 李鹏; 安宁; 梁天冕; 曹玉梅
Original assignee: Shaanxi Normal University
Current assignee: Shaanxi Normal University
Priority date: 2022-07-13
Filing date: 2022-07-13
Publication date: 2022-10-18

Abstract

The invention provides a method for generating ancient character fonts based on a GAN network, which comprises the following steps: s1: acquiring a font data set, wherein the font data set comprises two parts, the first part is a font library data set with different styles, and the second part is a character data set; s2: constructing a font style conversion network, and realizing the conversion of Chinese character font styles to obtain font images; s3: performing detail reconstruction on the font image by adopting end-to-end depth convolution through a detail reconstruction network model, and fusing Chinese character stroke semantics and a depth generation network; s4: the characteristics of different intermediate layers in a decoder are utilized to supplement confrontation training for a detail reconstruction network model; s5: sequentially inputting the font data set into a detail reconstruction network model after the countermeasure training, and inputting a corresponding complete target font through a generator; s6: and evaluating the font generation quality according to a model rating system.

Description

Method for generating ancient character fonts based on GAN network

Technical Field

The invention relates to the technical field of GAN network models, in particular to a method for generating ancient character fonts based on a GAN network.

Background

The characters are used as important carriers of human civilization, are cultural symbols, are important marks of human civilization progress, and are also important ways for information record storage and transmission development from ancient times to modern times. Unlike other characters such as english, chinese characters are the only characters that are widely used at present and expressed in a two-dimensional form. Ancient Chinese characters are part of Chinese culture and are carriers of Chinese civilization, and fonts are carriers of the ancient Chinese characters, and the shape, proportion, details and style of each character are changed according to different application scenes. The font can not only enhance readability, understandability and credibility, but also increase aesthetic feeling. The different fonts present different forms, and the visual aesthetic feeling and the spreading value can be added to the fonts while a new expression mode is added to the vision.

In the field of artificial intelligence computer vision, application and development of deep learning have achieved great achievements. With continuous temperature rise of artificial intelligence, continuous development of artificial intelligence technology not only improves productivity, but also creates new products to enter people's lives. The ancient Chinese character font generation problem is more and more concerned because the ancient Chinese character font generation has relatively small field and lower cost hardware. In the process of designing the fonts, the problems of slow design process, time consumption and labor consumption of the ancient traditional Chinese characters are solved by using an artificial intelligence mode. By means of the operation of the generating model and the computer, the styles of fewer font samples are learned through the generating model, and other fonts with consistent styles are generated according to the styles of the few font samples, so that repetitive work is greatly reduced, and the efficiency of font design is improved.

The method based on deep learning can be regarded as an image-to-image conversion problem, and the calligraphy image can be directly generated from the standard font image based on the deep neural network model, and the generated font usually contains fuzzy and ghost pseudo images. These methods often produce unreasonable strokes and incorrectly structured results for characters with complex structures and cursive handwriting styles. Because of the huge number and complex structure of the ancient Chinese characters, establishing a set of complete personalized Chinese characters is still a difficult task till now.

Disclosure of Invention

Technical problem to be solved

In view of the technical problems, the invention provides an ancient character font generation method based on a GAN network, which fuses Chinese character stroke semantics and a deep generation network to enable a font generated by a model to have more reasonable strokes.

(II) technical scheme

The invention provides a method for generating ancient character fonts based on a GAN network, which comprises the following steps:

s1: acquiring a font data set, wherein the font data set comprises two parts, the first part is a font library data set with different styles, and the second part is a character data set;

s2: constructing a font style conversion network, and realizing the conversion of Chinese character font styles to obtain font images;

s3: performing detail reconstruction on the font image by adopting end-to-end depth convolution through a detail reconstruction network model, and fusing Chinese character stroke semantics and a depth generation network;

s4: the characteristics of different intermediate layers in a decoder are utilized to supplement countermeasure training for the detail reconstruction network model;

s5: sequentially inputting the font data set into a detail reconstruction network model after the confrontation training, and inputting a corresponding complete target font through a generator;

s6: and evaluating the font generation quality according to a model rating system.

In some embodiments of the present invention, the method for acquiring font data set in step S1 is:

s11: randomly selecting a plurality of Chinese character libraries with different handwriting styles and design styles;

s12: dividing a character data set into two parts, wherein the first part is a character set selected according to the use frequency, and then selecting a plurality of characters to supplement the character set;

s13: and selecting characters with complex font structures containing all 33 stroke types as a second part to obtain an optimal input character set.

In some embodiments of the present invention, the font style conversion in step S2 is to form a font style conversion network through a font image generator G and a font image discriminator D; the font image generator G comprises an encoder and a decoder, and the encoder and the decoder are of a U-Net structure with jump connection.

In some embodiments of the present invention, in step S5, the font data sets are sequentially input into the detail reconstruction network model after the countermeasure training, and the corresponding complete target font is input through the generator; an encoder and a decoder framework are used as generators, the size of a font image is 255 x 255, the encoder comprises 5 down-sampling layers, each layer adopts a convolution layer with convolution kernel of 5 x 5 and step length of 2 and a ReLU activation function, and feature vectors are obtained through encoding; processing the classified strokes by using one-hot codes to obtain class labels, establishing corresponding output channels for each class, forming mapping through spatial feature transformation, and obtaining stroke semantic feature embedded vectors through mapping transformation; connecting the stroke semantic feature embedding vector with the feature vector; and (3) sending the connected vectors to a decoder, wherein the decoder comprises 5 upsampling layers, and each layer adopts a deconvolution layer with a convolution kernel of 5 multiplied by 5 and a step length of 2 and a ReLU activation function to finally obtain an output font image.

(III) advantageous effects

According to the technical scheme, the invention has at least one of the following beneficial effects:

(1) The Chinese character stroke semantics and the depth generation network are fused, so that the font generated by the model has more reasonable strokes.

(2) The invention utilizes the characteristics extracted from different middle layers in the decoder to bring supplementary confrontation training for the model, and prompts the generator to easily find detailed local differences for better optimizing the generator.

(3) The invention inputs the font data set into the detail reconstruction network model after the countertraining, and can further improve the quality of the generated image.

Drawings

FIG. 1 is a flow diagram of font generation of the present invention.

Fig. 2 is a diagram of a font style conversion network architecture according to the present invention.

Detailed Description

The invention provides a method for generating ancient character fonts based on a GAN network, which is combined with a specific embodiment and attached drawings to further explain the invention in detail in order to make the purposes, technical schemes and advantages of the invention more clear and clearer.

DETAILED DESCRIPTION OF EMBODIMENT (S) OF INVENTION

Examples

As shown in fig. 1, the present invention provides a GAN network-based method for generating ancient text fonts, which comprises the following steps:

s5: sequentially inputting the font data set into a detail reconstruction network model after the countermeasure training, and inputting a corresponding complete target font through a generator;

The method for acquiring the font data set in the step S1 includes: s11: randomly selecting a plurality of Chinese character libraries with different handwriting styles and design styles; s12: dividing a character data set into two parts, wherein the first part is a character set selected according to the use frequency, and then selecting a plurality of characters to supplement the character set; s13: and selecting a character with a complex font structure containing all 33 types of strokes as a second part to obtain an optimal input character set.

As shown in fig. 2, the font style conversion in step S2 is to form a font style conversion network through a font image generator G and a font image discriminator D; the font image generator G comprises an encoder and a decoder, and the encoder and the decoder are in a U-Net structure with jump connection. In the step S5, the font data sets are sequentially input into the detail reconstruction network model after the confrontation training, and the corresponding complete target font is input through the generator; the method comprises the steps that a decoder and a decoder framework are used as generators, the size of a font image is 255 x 255, the encoder comprises 5 down-sampling layers, each layer adopts a convolution layer with a convolution kernel of 5 x 5 and a step length of 2 and a ReLU activation function, and feature vectors are obtained through encoding; processing the classified strokes by using one-hot codes to obtain class labels, establishing corresponding output channels for each class, forming mapping through spatial feature transformation, and obtaining stroke semantic feature embedded vectors through mapping transformation; connecting the stroke semantic feature embedding vector with the feature vector; and (3) sending the connected vectors to a decoder, wherein the decoder comprises 5 upsampling layers, and each layer adopts a deconvolution layer with a convolution kernel of 5 multiplied by 5 and a step length of 2 and a ReLU activation function to finally obtain an output font image.

So far, the present embodiment has been described in detail with reference to the accompanying drawings. From the above description, those skilled in the art should clearly recognize the present invention.

It is to be understood that the implementations not shown or described in the drawings or in the text of this specification are in a form known to those skilled in the art and are not described in detail. Furthermore, the above definitions of the various elements and methods are not limited to the specific structures, shapes, or configurations shown in the examples.

It is also noted that the illustrations herein may provide examples of parameters that include particular values, but that these parameters need not be exactly equal to the corresponding values, but may be approximated to the corresponding values within acceptable error tolerances or design constraints. Further, unless steps are specifically described or must occur in sequence, the order of the steps is not limited to that listed above and may be changed or rearranged as desired by the desired design. The embodiments described above may be mixed and matched with each other or with other embodiments based on design and reliability considerations, i.e., technical features in different embodiments may be freely combined to form further embodiments.

The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention, and are not intended to limit the present invention, and any modifications, equivalents, improvements and the like made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A method for generating ancient character fonts based on a GAN network is characterized by comprising the following steps:

2. The GAN network-based ancient character font generation method according to claim 1, wherein the method for acquiring the font data set in the step S1 is as follows:

3. The GAN network-based ancient character font generation method according to claim 1, wherein the font style conversion in step S2 is a font style conversion network formed by a font image generator G and a font image discriminator D; the font image generator G comprises an encoder and a decoder, and the encoder and the decoder are in a U-Net structure with jump connection.

4. The GAN network-based ancient character font generation method according to claim 1, wherein the font data sets are sequentially input into the detail reconstruction network model after the countermeasure training in the step S5, and the corresponding complete target font is input through the generator; an encoder and a decoder framework are used as generators, a font image is input by the encoder, the size of the font image is 255 × 255, the encoder comprises 5 down-sampling layers, each layer adopts a convolution layer with convolution kernel of 5 × 5 and step length of 2 and a ReLU activation function, and feature vectors are obtained through encoding; processing the classified strokes by using one-hot coding to process class labels, establishing a corresponding output channel for each class, forming mapping through spatial feature transformation, and obtaining stroke semantic feature embedded vectors through mapping transformation; connecting the stroke semantic feature embedding vector with the feature vector; and (3) sending the connected vectors to a decoder, wherein the decoder comprises 5 upsampling layers, and each layer adopts a deconvolution layer with a convolution kernel of 5 multiplied by 5 and a step length of 2 and a ReLU activation function to finally obtain an output font image.