CN116009749A

CN116009749A - Handwritten character erasing method and system based on attention mechanism

Info

Publication number: CN116009749A
Application number: CN202211391605.9A
Authority: CN
Inventors: 杨辉; 黄家昌; 赵宝华; 邱道椿
Original assignee: Fujian Ecan Information Technology Co ltd
Current assignee: Fujian Ecan Information Technology Co ltd
Priority date: 2022-11-08
Filing date: 2022-11-08
Publication date: 2023-04-25

Abstract

The invention provides a handwritten character erasing method and a system based on an attention mechanism in the technical field of image processing, wherein the method comprises the following steps: step S10, obtaining a large number of original pictures containing handwritten characters and printing bodies, and constructing a picture set based on the original pictures; step S20, carrying out data enhancement processing and picture preprocessing on the picture set to obtain a training data set; step S30, creating an erasure model based on the attention mechanism; step S40, training the erasure model by using the training data set; and S50, automatically erasing the handwritten characters of the picture to be erased by using the trained erasing model. The invention has the advantages that: the accuracy and the efficiency of hand-written character erasure are greatly improved.

Description

Handwritten character erasing method and system based on attention mechanism

Technical Field

The invention relates to the technical field of image processing, in particular to a handwritten character erasing method and system based on an attention mechanism.

Background

Along with the progress of science and technology, OCR scanning is used in numerous scenes such as learning, working and the like, and through technology and algorithm, handwritten characters (handwriting) on a scanned paper document can be restored and repaired, namely, the handwritten characters on a picture are electronically erased, so that the appearance of the document is restored, the document does not need to be edited again, and the user experience is improved.

Conventionally, erasure of a handwritten character requires two steps, first, extraction of the handwritten character and then filling of the handwritten character area with pixel values. However, the conventional method is unsatisfactory in terms of erasing effect, and is too labor-consuming to erase manually using image editing software.

Therefore, how to provide a handwritten character erasing method and system based on an attention mechanism, so as to improve the accuracy and efficiency of handwritten character erasing, is a technical problem to be solved.

Disclosure of Invention

The invention aims to solve the technical problem of providing a handwritten character erasing method and a handwritten character erasing system based on an attention mechanism, which can improve the accuracy and efficiency of handwritten character erasing.

In a first aspect, the present invention provides a method for erasing handwritten characters based on an attention mechanism, including the steps of:

step S10, obtaining a large number of original pictures containing handwritten characters and printing bodies, and constructing a picture set based on the original pictures;

step S20, carrying out data enhancement processing and picture preprocessing on the picture set to obtain a training data set;

step S30, creating an erasure model based on the attention mechanism;

step S40, training the erasure model by using the training data set;

and S50, automatically erasing the handwritten characters of the picture to be erased by using the trained erasing model.

Further, the step S10 specifically includes:

obtaining a large number of original pictures containing handwritten characters and printing bodies, removing the handwritten characters in the original pictures by utilizing image editing software to obtain target pictures, carrying out mask processing on the target pictures to obtain mask pictures, and constructing a picture set based on the original pictures, the target pictures and the mask pictures.

Further, the step S20 specifically includes:

carrying out random rotation, random overturn, brightness transformation and random graying data enhancement processing on each picture in the picture set;

and setting a picture size, cutting the picture subjected to the data enhancement processing based on the picture size to complete picture preprocessing, and further obtaining a training data set.

Further, in the step S30, the erasure model is provided with a semantic segmentation module, a coarse erasure module and a fine erasure module;

the semantic segmentation module is used for carrying out semantic segmentation on the picture so as to distinguish the handwritten characters from the printed matter; the rough erasing module is used for erasing the handwritten characters of the picture once; the fine erasing module is used for secondarily erasing the handwritten characters of the picture;

the semantic segmentation module, the coarse erasure module and the fine erasure module are all based on a Unet framework of an attention mechanism; the semantic segmentation module and the rough erasure module share a feature extraction part; the high-level network layer of the semantic segmentation module and the rough erasure module is provided with a CBAM unit and an ASPP unit; the low-level network layer of the fine erase module is provided with Non-Local self-attention units, and the number of network characteristic channels is larger than that of the coarse erase module.

Further, the step S40 specifically includes:

inputting an original picture in the training data set into an erasure model for training, obtaining a predicted target picture and a predicted mask picture, calculating target loss values of the target picture and the predicted target picture in the training data set through a target loss function, calculating mask loss values of the mask picture and the predicted mask picture in the training data set through a mask loss function, accumulating the target loss values and the mask loss values to obtain a total loss value, and continuously training the erasure model until a preset training iteration number is reached;

the target loss function is an average absolute error function; the mask loss function is a cross entropy loss function.

In a second aspect, the present invention provides a handwriting character erasing system based on an attention mechanism, including the following modules:

the picture set construction module is used for acquiring a large number of original pictures containing handwritten characters and printing bodies and constructing a picture set based on the original pictures;

the training data set construction module is used for carrying out data enhancement processing and picture preprocessing on the picture set so as to obtain a training data set;

the erasure model creation module is used for creating an erasure model based on the attention mechanism;

the erasure model training module is used for training the erasure model by utilizing the training data set;

and the handwritten character erasing module is used for automatically erasing the handwritten characters of the picture to be erased by utilizing the trained erasing model.

Further, the picture set construction module is specifically configured to:

Further, the training data set construction module is specifically configured to:

Further, in the erasure model creation module, the erasure model is provided with a semantic segmentation module, a coarse erasure module and a fine erasure module;

Further, the erasure model training module is specifically configured to:

The invention has the advantages that:

the method comprises the steps of obtaining an original picture containing handwritten characters and a printing body, constructing a picture set, carrying out data enhancement processing and picture preprocessing on the picture set to obtain a training data set, training a created erasure model by utilizing the training data set, so that the erasure model has stronger robustness and generalization, creating the erasure model based on an attention mechanism, enhancing the characteristic extraction capability of the erasure model, setting an ASPP unit to promote the receptive fields of a semantic segmentation module and a rough erasure module, further improving the characteristic extraction capability, setting Non-Local self-attention units on a low-level network layer of the fine erasure module, and effectively improving the capability of the fine erasure module to obtain global information, greatly improving the learning capability of the module, automatically erasing the handwritten characters of the picture to be erased by the trained erasure model, and finally greatly improving the accuracy and efficiency of handwritten character erasure.

Drawings

The invention will be further described with reference to examples of embodiments with reference to the accompanying drawings.

FIG. 1 is a flow chart of a handwriting character erasing method based on an attention mechanism of the present invention.

FIG. 2 is a schematic diagram of a handwriting character erasing system based on an attention mechanism according to the present invention.

FIG. 3 is a schematic diagram of the erasure model of the present invention.

Fig. 4 is a schematic structural diagram of the semantic segmentation module and the rough erasure module of the present invention.

Fig. 5 is a schematic diagram of the structure of the fine erase module of the present invention.

Detailed Description

According to the technical scheme in the embodiment of the application, the overall thought is as follows: training an erasure model by utilizing a training data set obtained by data enhancement processing and picture preprocessing to improve robustness and generalization of the erasure model, creating the erasure model based on an attention mechanism, enhancing feature extraction capability, setting an ASPP unit to improve a receptive field of a module, setting a Non-Local self-attention unit and more network feature channel numbers at a low-level network layer of a fine erasure module, and effectively improving the capability and learning capability of the fine erasure module for acquiring global information, so that the trained erasure model can automatically erase handwritten characters of pictures to be erased, and further improving the erasure accuracy and efficiency of the handwritten characters.

Referring to fig. 1 to 5, a preferred embodiment of a handwriting character erasing method based on an attention mechanism of the present invention includes the following steps:

step S30, creating an erasure model based on the attention mechanism;

step S40, training the erasure model by using the training data set;

The step S10 specifically includes:

The image editing software can be selected as photoshop; and removing and filling the handwritten characters in the original picture pixel by utilizing the image editing software to obtain a target picture.

The mask processing specifically includes: setting a pixel threshold value, performing difference on pixel values of the original picture and the target picture to obtain an average difference value of three channels, updating the average difference value which is larger than or equal to the pixel threshold value to be 1, and updating the average difference value which is smaller than the pixel threshold value to be the pixel threshold value or keeping the original value.

The step S20 specifically includes:

the data enhancement processing can greatly improve the generalization capability and robustness of the model, is an indispensable picture preprocessing process before training, can enhance the segmentation capability of characters in different directions through random rotation and random overturn, and can ensure the accuracy when the model segments images with abnormal illumination or shadows through brightness conversion and graying.

Setting a picture size, cutting the picture subjected to the data enhancement processing based on the picture size to complete picture pretreatment, and further obtaining a training data set; the picture size may be set as desired, for example 768×768, to ensure consistency of the input picture size.

In the step S30, the erasure model is provided with a semantic segmentation module, a coarse erasure module and a fine erasure module; the fine erasing module is connected behind the coarse erasing module;

the semantic segmentation module, the coarse erasure module and the fine erasure module are all based on a Unet framework of an attention mechanism, and the Unet can achieve better balance between precision and speed; the semantic segmentation module and the rough erasure module share a feature extraction part; the high-level network layers of the semantic segmentation module and the rough erasure module are provided with a CBAM unit and an ASPP unit, the CBAM unit can make the model focus on a target area to be segmented, and the ASPP unit is used for expanding a receptive field to enhance the feature extraction capability so as to cope with detection areas with different sizes; the low-level network layer of the fine erasure module is provided with Non-Local self-attention units, and the number of network characteristic channels is larger than that of the coarse erasure module; since the degradation difference of different pictures is larger than the denoising task no matter the colors are enhanced or the handwritten characters are erased, and a larger receptive field is needed than the denoising (rough erasure module), the fine erasure module needs to perform downsampling for more times, a Non-Local self-attention unit is added to further increase the receptive field, and the number of network characteristic channels is increased to improve the learning capability.

The step S40 specifically includes:

inputting an original picture in the training data set into an erasure model for training, obtaining a predicted target picture and a predicted mask picture, calculating target loss values of the target picture and the predicted target picture in the training data set through a target loss function, calculating mask loss values of the mask picture and the predicted mask picture in the training data set through a mask loss function, accumulating the target loss values and the mask loss values to obtain a total loss value, and continuously training the erasure model until a preset training iteration number is reached, namely training until the model converges;

The step S50 specifically includes:

obtaining pictures to be erased, performing angle correction on the pictures to be erased, cutting the pictures to be erased based on a set picture size to obtain sub-pictures, inputting each sub-picture into a trained erasure model to perform handwriting character erasure, and then splicing the sub-pictures after erasure to complete handwriting character erasure of the pictures to be erased.

The size of the picture size is constrained by the size of the video memory, and in the case of video memory permission, a larger picture size (dividing size) can be adopted in order to increase the erasing speed.

The preferred embodiment of the handwriting character erasing system based on the attention mechanism comprises the following modules:

The picture set construction module is specifically configured to:

The training data set construction module is specifically configured to:

The erasure model creation module is provided with a semantic segmentation module, a coarse erasure module and a fine erasure module; the fine erasing module is connected behind the coarse erasing module;

The erasure model training module is specifically configured to:

The handwritten character erasing module is specifically used for:

In summary, the invention has the advantages that:

While specific embodiments of the invention have been described above, it will be appreciated by those skilled in the art that the specific embodiments described are illustrative only and not intended to limit the scope of the invention, and that equivalent modifications and variations of the invention in light of the spirit of the invention will be covered by the claims of the present invention.

Claims

1. A handwritten character erasing method based on an attention mechanism is characterized by comprising the following steps of: the method comprises the following steps:

step S30, creating an erasure model based on the attention mechanism;

step S40, training the erasure model by using the training data set;

2. A method for erasing handwritten characters based on an attention mechanism as recited in claim 1, wherein: the step S10 specifically includes:

3. A method for erasing handwritten characters based on an attention mechanism as recited in claim 1, wherein: the step S20 specifically includes:

4. A method for erasing handwritten characters based on an attention mechanism as recited in claim 1, wherein: in the step S30, the erasure model is provided with a semantic segmentation module, a coarse erasure module and a fine erasure module;

5. A method for erasing handwritten characters based on an attention mechanism as recited in claim 1, wherein: the step S40 specifically includes:

6. A handwriting character erasing system based on an attention mechanism, characterized in that: the device comprises the following modules:

7. An attention-based handwritten character erasing system as in claim 6, wherein: the picture set construction module is specifically configured to:

8. An attention-based handwritten character erasing system as in claim 6, wherein: the training data set construction module is specifically configured to:

9. An attention-based handwritten character erasing system as in claim 6, wherein: the erasure model creation module is provided with a semantic segmentation module, a coarse erasure module and a fine erasure module;

10. An attention-based handwritten character erasing system as in claim 6, wherein: the erasure model training module is specifically configured to: