CN112419159A

CN112419159A - Character image super-resolution reconstruction system and method

Info

Publication number: CN112419159A
Application number: CN202011417305.4A
Authority: CN
Inventors: 张晓东; 张月
Original assignee: Shanghai Internet Software Group Co ltd
Current assignee: Shanghai Internet Software Group Co ltd
Priority date: 2020-12-07
Filing date: 2020-12-07
Publication date: 2021-02-26

Abstract

The invention discloses a system and a method for reconstructing super-resolution of character images, wherein the method comprises the following steps: the feature extraction module extracts a set feature layer corresponding to the image to be processed; inputting the characteristic layer into a super-resolution image reconstruction module, carrying out up-sampling on the characteristic layer, and carrying out feature extraction on the up-sampled characteristic layer to obtain a reconstructed super-resolution character image; inputting the characteristic layer into a character recognition module, performing down-sampling on the characteristic layer, performing time sequence feature extraction on the feature layer after down-sampling, and performing character recognition on the extracted time sequence feature to obtain character contents in a character image to be processed; and inputting the characteristic layer into a super-resolution gradient map reconstruction module, up-sampling the characteristic layer, and extracting the characteristics of the up-sampled characteristic layer to obtain a reconstructed super-resolution gradient map. The multitask character image super-resolution reconstruction system and the multitask character image super-resolution reconstruction method can improve the definition and the reliability of the reconstructed character image.

Description

Character image super-resolution reconstruction system and method

Technical Field

The invention belongs to the technical field of image processing, relates to an image processing system, and particularly relates to a system and a method for reconstructing super-resolution of a character image.

Background

The deep neural network is a complex mathematical model, input data obtain corresponding output data through the deep neural network, a loss function is constructed through the difference between the output data and the marking data, the loss function calculates the gradient of parameters in the deep neural network, the parameters in the deep neural network are updated through gradient back propagation, and the difference between the output data and the marking data is continuously reduced through continuously updating the parameters. Wherein the input data and the marking data form training data required by deep neural network training, and the performance of the deep neural network is related to the structure of the neural network and the training data. The deep neural network has acquired performance superior to that of the traditional method in the fields of image, voice, natural language processing and the like, and is widely applied.

The image super-resolution reconstruction means reconstructing a corresponding high-resolution image from an observed low-resolution image. With the rapid development of the deep learning technology, the image super-resolution reconstruction method based on the deep neural network is the image super-resolution reconstruction method with the optimal performance at present.

The image super-resolution reconstruction method based on the deep neural network generally comprises two modules: the feature extraction module 21 and the super-resolution image reconstruction module 31 obtain the reconstructed super-resolution character image 41, during training, an image loss function 51 between the reconstructed super-resolution character image 41 and a high-resolution image corresponding to the character image 11 to be processed is calculated, image training gradient backward propagation is performed based on the image loss function 51, and parameters of the feature extraction module 21 and the super-resolution image reconstruction module 31 are updated, so that the feature extraction module 21 can extract image information of the image 11 to be processed, and the whole is as shown in fig. 1. The existing image super-resolution reconstruction method based on the deep neural network obtains good performance in natural image reconstruction. When the existing image super-resolution reconstruction method is directly used for character image super-resolution reconstruction, the reconstructed super-resolution character image has the problems of fuzzy character edge and low reliability:

compared with a natural image, the character image contains a large amount of gradient information, and when the existing image super-resolution reconstruction method is directly used for character image super-resolution reconstruction, the gradient information in the character image cannot be fully utilized, so that the character edge of the reconstructed super-resolution character image is fuzzy;

the super-resolution reconstruction is an ill-posed problem in nature, that is, for a low-resolution image, there are usually many high-resolution images corresponding to the low-resolution image, and the ill-posed problem may cause the change of the text content in the reconstructed super-resolution text image, so that the reconstructed super-resolution text image has low reliability.

In view of the above, there is an urgent need to design a new text image reconstruction method to overcome at least some of the above-mentioned defects of the existing text image reconstruction methods.

Disclosure of Invention

The invention provides a character image super-resolution reconstruction system and method, which can reconstruct a character image with reduced resolution into a character image with super-resolution and provide a clear and credible image for high-level tasks such as character detection and identification.

In order to solve the technical problem, according to one aspect of the present invention, the following technical solutions are adopted:

a text image super-resolution reconstruction system, the system comprising:

the characteristic extraction module is used for extracting a set characteristic layer corresponding to the image to be processed;

the super-resolution image reconstruction module is connected with the feature extraction module and is used for up-sampling the feature layer and extracting features of the up-sampled feature layer to obtain a reconstructed super-resolution character image;

the character recognition module is connected with the feature extraction module and used for down-sampling the feature layer, extracting time sequence features of the down-sampled feature layer and recognizing characters of the extracted time sequence features to obtain character contents in the character image to be processed;

and the super-resolution gradient map reconstruction module is connected with the feature extraction module and is used for up-sampling the feature layer and extracting features of the up-sampled feature layer to obtain a reconstructed super-resolution gradient map.

As an embodiment of the present invention, the system further includes:

the image loss function acquisition module is used for calculating an image loss function according to the super-resolution character image acquired by the super-resolution image reconstruction module;

the character loss function acquisition module is used for calculating a character loss function according to the character content acquired by the character recognition module;

the gradient loss function acquisition module is used for calculating a gradient loss function according to the super-resolution gradient map acquired by the super-resolution gradient map reconstruction module;

a loss function fusion module, configured to fuse the image loss function obtained by the image loss function obtaining module, the character loss function obtained by the character loss function obtaining module, and the gradient loss function obtained by the gradient loss function obtaining module to obtain a fusion loss function; and (4) training the multitask character image super-resolution reconstruction network by using the fusion loss function.

As an embodiment of the present invention, the feature extraction module is configured to obtain an advanced feature layer of a text image to be processed, where the advanced feature layer includes deep feature information of the text image to be processed;

the super-resolution image reconstruction module is used for carrying out up-sampling on the advanced feature layer by a deep neural network, carrying out feature extraction on the feature layer after up-sampling and obtaining features output by each layer of the deep neural network; determining the characteristics output by the last layer of deep neural network as a reconstructed super-resolution character image;

the character recognition module is used for carrying out down-sampling on the advanced feature layer by a deep neural network comprising a pooling layer, so that the height of the down-sampled feature layer is a set value; sending the down-sampled feature layer into a bidirectional LSTM network to extract time sequence features, and outputting the time sequence features of the character and image to be processed; further extracting the characteristics of the time sequence characteristics through a full connection layer and a softmax function, and determining the characteristics of the last layer as the character content of the character image to be processed;

the super-resolution gradient map reconstruction module is used for performing up-sampling on the advanced feature layer by a deep neural network, performing feature extraction on the up-sampled feature layer and obtaining features output by each layer of the deep neural network; and determining the characteristics output by the last layer of deep neural network as a reconstructed super-resolution gradient map.

As an embodiment of the present invention, the image loss function obtaining module is configured to reversely propagate the calculated image loss function to the feature extraction module through an image training gradient; the high-level feature layer extracted by the feature extraction module contains rich image information, so that the super-resolution character image reconstructed by the super-resolution reconstruction module is more vivid;

the character loss function acquisition module is used for reversely transmitting the calculated character loss function to the feature extraction module through a character training gradient, so that a feature layer extracted by the feature extraction module contains rich character information, the character content of the super-resolution character image reconstructed by the super-resolution image reconstruction module is more prepared, and the reliability of the reconstructed super-resolution character image is improved;

the gradient loss function acquisition module is used for reversely transmitting the calculated gradient loss function to the feature extraction module through a gradient training gradient, so that the high-level feature layer extracted by the feature extraction module contains rich gradient information, the super-resolution character image and the character edge reconstructed by the super-resolution image reconstruction module are clearer, and the definition of the reconstructed super-resolution character image is improved.

As an embodiment of the present invention, the image loss function obtaining module is configured to calculate an image loss function, and specifically includes: calculating L the reconstructed super-resolution character image and the high-resolution character image corresponding to the character image to be processed₁Loss, so that the reconstructed super-resolution character image has the pixel value of the corresponding high-resolution character image;

the text loss function obtaining module is used for calculating a text loss function, and specifically includes: calculating the CTC loss by using the character content of the character image to be processed acquired by the character recognition module and the corresponding marked character content, so that the character content recognized by the character recognition module is more correct;

the gradient loss function obtaining module is configured to calculate a gradient loss function, and specifically includes: calculating a gradient map of the high-resolution character image corresponding to the character image to be processed through a Sobel operator to obtain a target gradient map; calculating L by the target gradient map and the reconstructed super-resolution gradient map₁Loss, so that the reconstructed super-resolution gradient map has the pixel value of the target gradient map;

and the loss function fusion module performs weighted summation on the image loss function, the character loss function and the gradient loss function to obtain a fusion loss function.

According to one aspect of the invention, the following technical scheme is adopted: a text image super-resolution reconstruction method comprises the following steps:

the feature extraction module extracts a set feature layer corresponding to the image to be processed;

inputting the characteristic layer into a super-resolution image reconstruction module, carrying out up-sampling on the characteristic layer, and carrying out feature extraction on the up-sampled characteristic layer to obtain a reconstructed super-resolution character image;

inputting the characteristic layer into a character recognition module, performing down-sampling on the characteristic layer, performing time sequence feature extraction on the feature layer after down-sampling, and performing character recognition on the extracted time sequence feature to obtain character contents in a character image to be processed;

and inputting the characteristic layer into a super-resolution gradient map reconstruction module, up-sampling the characteristic layer, and extracting the characteristics of the up-sampled characteristic layer to obtain a reconstructed super-resolution gradient map.

As an embodiment of the present invention, the method further comprises:

the image loss function acquisition module calculates an image loss function according to the super-resolution character image acquired by the super-resolution image reconstruction module;

the character loss function acquisition module calculates a character loss function according to the character content acquired by the character recognition module;

the gradient loss function acquisition module calculates a gradient loss function according to the super-resolution gradient map acquired by the super-resolution gradient map reconstruction module;

the loss function fusion module fuses the image loss function acquired by the image loss function acquisition module, the character loss function acquired by the character loss function acquisition module and the gradient loss function acquired by the gradient loss function acquisition module to acquire a fusion loss function; and (4) training the multitask character image super-resolution reconstruction network by using the fusion loss function.

As an embodiment of the present invention, the feature extraction module obtains an advanced feature layer of the character image to be processed, where the advanced feature layer includes deep feature information of the character image to be processed;

the super-resolution image reconstruction module performs up-sampling on the advanced feature layer by a deep neural network, performs feature extraction on the up-sampled feature layer, and obtains features output by each layer of the deep neural network; determining the characteristics output by the last layer of deep neural network as a reconstructed super-resolution character image;

the character recognition module carries out down-sampling of the deep neural network comprising the pooling layer on the high-level feature layer, so that the height of the down-sampled feature layer is a set value; sending the down-sampled feature layer into a bidirectional LSTM network to extract time sequence features, and outputting the time sequence features of the character and image to be processed; further extracting the characteristics of the time sequence characteristics through a full connection layer and a softmax function, and determining the characteristics of the last layer as the character content of the character image to be processed;

the super-resolution gradient map reconstruction module performs up-sampling on the advanced feature layer by a deep neural network, performs feature extraction on the up-sampled feature layer, and obtains features output by each layer of the deep neural network; and determining the characteristics output by the last layer of deep neural network as a reconstructed super-resolution gradient map.

As an embodiment of the present invention, the image loss function obtaining module reversely propagates the calculated image loss function to the feature extraction module through an image training gradient; the high-level feature layer extracted by the feature extraction module contains rich image information, so that the super-resolution character image reconstructed by the super-resolution reconstruction module is more vivid;

the character loss function acquisition module reversely transmits the calculated character loss function to the feature extraction module through a character training gradient, so that a feature layer extracted by the feature extraction module contains rich character information, the character content of the super-resolution character image reconstructed by the super-resolution image reconstruction module is more prepared, and the reliability of the reconstructed super-resolution character image is improved;

the gradient loss function acquisition module reversely transmits the calculated gradient loss function to the feature extraction module through a gradient training gradient, so that the high-level feature layer extracted by the feature extraction module contains rich gradient information, the super-resolution character image and the character edge reconstructed by the super-resolution image reconstruction module are clearer, and the definition of the reconstructed super-resolution character image is improved.

As an embodiment of the present invention, the image loss function obtaining module calculates an image loss function, and specifically includes: calculating L the reconstructed super-resolution character image and the high-resolution character image corresponding to the character image to be processed₁Loss, so that the reconstructed super-resolution character image has the pixel value of the corresponding high-resolution character image;

the character loss function obtaining module calculates a character loss function, and specifically includes: calculating the CTC loss by using the character content of the character image to be processed acquired by the character recognition module and the corresponding marked character content, so that the character content recognized by the character recognition module is more correct;

the gradient loss function obtaining module calculates a gradient loss function, and specifically includes: treating the part to be treatedCalculating a gradient map of the high-resolution character image corresponding to the character image through a Sobel operator to obtain a target gradient map; calculating L by the target gradient map and the reconstructed super-resolution gradient map₁Loss, so that the reconstructed super-resolution gradient map has the pixel value of the target gradient map;

The invention has the beneficial effects that: the multitask character image super-resolution reconstruction system and the multitask character image super-resolution reconstruction method provided by the invention have the advantages that the character image with reduced resolution ratio is reconstructed into the super-resolution character image, the problems that when the existing image super-resolution reconstruction method based on the deep neural network is applied to character image reconstruction, the reconstructed super-resolution character image is fuzzy in character edge and low in character content reliability are solved, and a clear and credible image is provided for high-level tasks such as semantic analysis of the character image.

Compared with the existing image super-resolution reconstruction method based on the deep neural network, the method has the following two advantages:

(1) the reconstructed super-resolution character image has clear character edges:

according to the multitask character image super-resolution reconstruction method, the super-resolution gradient map reconstruction module is added on the basis of the super-resolution image reconstruction module in parallel, the gradient loss function is calculated, and when network parameters are updated, the gradient training gradient is propagated reversely, so that the high-level feature layer extracted by the feature extraction module contains rich gradient information, and the character edge of the super-resolution character image reconstructed by the super-resolution image reconstruction module is clearer.

(2) The reconstructed super-resolution character image has high character content reliability:

according to the multitask character image super-resolution reconstruction method, the character recognition module is added on the basis of the super-resolution image reconstruction module in parallel, the character loss function is calculated, and when network parameters are updated, the high-level feature layer extracted by the feature extraction module contains abundant character information through reverse propagation of the character training gradient, so that the character content of the super-resolution character image reconstructed by the super-resolution image reconstruction module is correct, and the reliability is high.

Drawings

Fig. 1 is a schematic composition diagram of a conventional text image super-resolution reconstruction system.

Fig. 2 is a schematic composition diagram of a super-resolution reconstruction system for text images according to an embodiment of the present invention.

Fig. 3 is a schematic composition diagram of a super-resolution reconstruction system for text images according to an embodiment of the present invention.

Fig. 4 is a schematic composition diagram of a super-resolution reconstruction system for text images according to an embodiment of the present invention.

Detailed Description

Preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

For a further understanding of the invention, reference will now be made to the preferred embodiments of the invention by way of example, and it is to be understood that the description is intended to further illustrate features and advantages of the invention, and not to limit the scope of the claims.

The description in this section is for several exemplary embodiments only, and the present invention is not limited only to the scope of the embodiments described. It is within the scope of the present disclosure and protection that the same or similar prior art means and some features of the embodiments may be interchanged.

The steps in the embodiments in the specification are only expressed for convenience of description, and the implementation manner of the present application is not limited by the order of implementation of the steps. The term "connected" in the specification includes both direct connection and indirect connection.

The invention discloses a character image super-resolution reconstruction system, and fig. 2 and 3 are schematic composition diagrams of the character image super-resolution reconstruction system in an embodiment of the invention; referring to fig. 2 and 3, the system includes: the system comprises a feature extraction module 1, a super-resolution image reconstruction module 2, a character recognition module 3 and a super-resolution gradient map reconstruction module 4; the feature extraction module 1 is respectively connected with the super-resolution image reconstruction module 2, the character recognition module 3 and the super-resolution gradient map reconstruction module 4.

The feature extraction module 1 is used for extracting a set feature layer corresponding to an image to be processed; the super-resolution image reconstruction module 2 is used for up-sampling the feature layer and extracting features of the up-sampled feature layer to obtain a reconstructed super-resolution character image; the character recognition module 3 is used for down-sampling the characteristic layer, extracting time sequence characteristics of the characteristic layer after down-sampling, and performing character recognition on the extracted time sequence characteristics to obtain character contents in the character image to be processed; the super-resolution gradient map reconstruction module 4 is used for up-sampling the feature layer, and extracting features of the up-sampled feature layer to obtain a reconstructed super-resolution gradient map.

In an embodiment of the invention, the feature extraction module is configured to obtain an advanced feature layer of the text image to be processed, where the advanced feature layer includes deep feature information of the text image to be processed. In one embodiment, the text image to be processed may be input to a feature extraction module in the ESRGAN generation network, thereby obtaining an advanced feature layer.

FIG. 4 is a schematic diagram illustrating a super-resolution reconstruction system for text images according to an embodiment of the present invention; referring to fig. 4, in an embodiment of the present invention, the super-resolution image reconstruction module 2 is configured to perform up-sampling on the feature layer by using a deep neural network, perform feature extraction on the up-sampled feature layer, and obtain features output by each layer of the deep neural network; and determining the characteristics output by the last layer of deep neural network as the reconstructed super-resolution character image.

The character recognition module 3 is used for performing down-sampling on the advanced feature layer by using a deep neural network comprising a pooling layer, so that the height of the down-sampled feature layer is a set value 1; sending the down-sampled feature layer into a bidirectional LSTM network to extract time sequence features, and outputting the time sequence features of the character and image to be processed; and further providing the characteristics of the time sequence characteristics through a full connection layer and a softmax function, and determining the characteristics of the last layer as the character content of the character image to be processed.

The super-resolution gradient map reconstruction module 4 is used for performing up-sampling of the deep neural network on the high-level feature layer, performing feature extraction on the up-sampled feature layer, and obtaining features output by each layer of the deep neural network; and determining the characteristics output by the last layer of deep neural network as a reconstructed super-resolution gradient map.

As shown in fig. 4, in an embodiment of the present invention, the system further includes an image loss function obtaining module 5, a text loss function obtaining module 6, a gradient loss function obtaining module 7, and a loss function fusing module 8.

The image loss function acquisition module 5 is used for calculating an image loss function according to the super-resolution character image acquired by the super-resolution image reconstruction module; the text loss function acquisition module 6 is used for calculating a text loss function according to the text content acquired by the text recognition module; the gradient loss function acquisition module 7 is used for calculating a gradient loss function according to the super-resolution gradient map acquired by the super-resolution gradient map reconstruction module. The loss function fusion module 8 is configured to fuse the image loss function acquired by the image loss function acquisition module 5, the text loss function acquired by the text loss function acquisition module 6, and the gradient loss function acquired by the gradient loss function acquisition module 7 to acquire a fusion loss function; and (4) training the multitask character image super-resolution reconstruction network by using the fusion loss function.

In an embodiment of the present invention, the image loss function obtaining module 5 is configured to calculate an image loss function, and specifically includes: calculating L the reconstructed super-resolution character image and the high-resolution character image corresponding to the character image to be processed₁And loss, so that the reconstructed super-resolution character image has the pixel value of the corresponding high-resolution character image.

The text loss function obtaining module 6 is configured to calculate a text loss function, and specifically includes: the character content of the character image to be processed acquired by the character recognition module and the corresponding marked character content are used for calculating the CTC loss, so that the character content recognized by the character recognition module is more correct.

The ladderThe degree loss function obtaining module 7 is configured to calculate a gradient loss function, and specifically includes: calculating a gradient map of the high-resolution character image corresponding to the character image to be processed through a Sobel operator to obtain a target gradient map; calculating L by the target gradient map and the reconstructed super-resolution gradient map₁And (4) losing, so that the reconstructed super-resolution gradient map has the pixel value of the target gradient map.

And the loss function fusion module 8 performs weighted summation on the image loss function, the character loss function and the gradient loss function to obtain a fusion loss function.

With reference to fig. 3 and 4, in an embodiment of the present invention, the image loss function obtaining module 5 is configured to reversely propagate the calculated image loss function to the feature extraction module through an image training gradient; the high-level feature layer extracted by the feature extraction module contains abundant image information, so that the super-resolution character image reconstructed by the super-resolution reconstruction module is more vivid.

The character loss function acquisition module 6 is used for reversely transmitting the calculated character loss function to the feature extraction module through a character training gradient, so that the feature layer extracted by the feature extraction module contains rich character information, thereby helping the super-resolution character image content reconstructed by the super-resolution image reconstruction module to be more ready and improving the reliability of the reconstructed super-resolution character image.

The gradient loss function acquisition module 7 is used for reversely transmitting the calculated gradient loss function to the feature extraction module through a gradient training gradient, so that the high-level feature layer extracted by the feature extraction module contains rich gradient information, thereby helping the super-resolution character image and the character edge reconstructed by the super-resolution image reconstruction module to be clearer and improving the definition of the reconstructed super-resolution character image.

The loss function fusion module 8 is further configured to reversely propagate the fusion loss function to the image loss function obtaining module 5, the text loss function obtaining module 6, and the gradient loss function obtaining module 7.

The invention also discloses a multitask character image super-resolution reconstruction method, which can refer to fig. 4, and the method comprises the following steps:

In an embodiment of the present invention, the feature extraction module obtains an advanced feature layer of the text image to be processed, where the advanced feature layer includes deep feature information of the text image to be processed.

In an embodiment of the present invention, the super-resolution image reconstruction module performs up-sampling on the feature layer by using a deep neural network, and performs feature extraction on the up-sampled feature layer to obtain features output by each layer of the deep neural network; and determining the characteristics output by the last layer of deep neural network as the reconstructed super-resolution character image.

The character recognition module carries out down-sampling of the deep neural network comprising the pooling layer on the high-level feature layer, so that the height of the down-sampled feature layer is a set value; sending the down-sampled feature layer into a bidirectional LSTM network to extract time sequence features, and outputting the time sequence features of the character and image to be processed; and further providing the characteristics of the time sequence characteristics through a full connection layer and a softmax function, and determining the characteristics of the last layer as the character content of the character image to be processed.

In an embodiment of the invention, the method further comprises a training process.

Referring to fig. 4, in an embodiment of the invention, the method further includes:

In an embodiment of the present invention, the image loss function obtaining module calculates an image loss function, which specifically includes: calculating L the reconstructed super-resolution character image and the high-resolution character image corresponding to the character image to be processed₁And loss, so that the reconstructed super-resolution character image has the pixel value of the corresponding high-resolution character image.

The character loss function obtaining module calculates a character loss function, and specifically includes: the character content of the character image to be processed acquired by the character recognition module and the corresponding marked character content are used for calculating the CTC loss, so that the character content recognized by the character recognition module is more correct.

The gradient loss function obtaining module calculates a gradient loss function, and specifically includes: calculating a gradient map of the high-resolution character image corresponding to the character image to be processed through a Sobel operator to obtain a target gradient map; calculating L by the target gradient map and the reconstructed super-resolution gradient map₁And (4) losing, so that the reconstructed super-resolution gradient map has the pixel value of the target gradient map.

the image loss function acquisition module reversely propagates the calculated image loss function to the feature extraction module through an image training gradient; the high-level feature layer extracted by the feature extraction module contains rich image information, so that the super-resolution character image reconstructed by the super-resolution reconstruction module is more vivid;

the gradient loss function acquisition module reversely transmits the calculated gradient loss function to the feature extraction module through a gradient training gradient, so that the high-level feature layer extracted by the feature extraction module contains rich gradient information, thereby helping the super-resolution character image and the character edge reconstructed by the super-resolution image reconstruction module to be clearer and improving the definition of the reconstructed super-resolution character image;

the loss function fusion module reversely propagates the fusion loss function to the image loss function acquisition module, the character loss function acquisition module and the gradient loss function acquisition module.

In summary, the multitask character image super-resolution reconstruction system and the multitask character image super-resolution reconstruction method provided by the invention reconstruct the character image with reduced resolution into the super-resolution character image, solve the problems that when the existing image super-resolution reconstruction method based on the deep neural network is applied to character image reconstruction, the reconstructed super-resolution character image has fuzzy character edges and low character content reliability, and provide clear and credible images for high-level tasks such as semantic analysis of the character image.

It should be noted that the present application may be implemented in software and/or a combination of software and hardware; for example, it may be implemented using Application Specific Integrated Circuits (ASICs), general purpose computers, or any other similar hardware devices. In some embodiments, the software programs of the present application may be executed by a processor to implement the above steps or functions. As such, the software programs (including associated data structures) of the present application can be stored in a computer-readable recording medium; such as RAM memory, magnetic or optical drives or diskettes, and the like. In addition, some steps or functions of the present application may be implemented using hardware; for example, as circuitry that cooperates with the processor to perform various steps or functions.

The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The description and applications of the invention herein are illustrative and are not intended to limit the scope of the invention to the embodiments described above. Effects or advantages referred to in the embodiments may not be reflected in the embodiments due to interference of various factors, and the description of the effects or advantages is not intended to limit the embodiments. Variations and modifications of the embodiments disclosed herein are possible, and alternative and equivalent various components of the embodiments will be apparent to those skilled in the art. It will be clear to those skilled in the art that the present invention may be embodied in other forms, structures, arrangements, proportions, and with other components, materials, and parts, without departing from the spirit or essential characteristics thereof. Other variations and modifications of the embodiments disclosed herein may be made without departing from the scope and spirit of the invention.

Claims

1. A text image super-resolution reconstruction system, the system comprising:

2. The text image super-resolution reconstruction system according to claim 1, wherein:

the system further comprises:

3. The text image super-resolution reconstruction system according to claim 1, wherein:

the character extraction module is used for acquiring an advanced feature layer of the character image to be processed, and the advanced feature layer comprises deep feature information of the character image to be processed;

4. The text image super-resolution reconstruction system according to claim 2, wherein:

the image loss function acquisition module is used for reversely transmitting the calculated image loss function to the feature extraction module through an image training gradient; the high-level feature layer extracted by the feature extraction module contains rich image information, so that the super-resolution character image reconstructed by the super-resolution reconstruction module is more vivid;

5. The text image super-resolution reconstruction system according to claim 2 or 4, wherein:

the image loss function obtaining module is used for calculating an image loss function, and specifically includes: calculating L the reconstructed super-resolution character image and the high-resolution character image corresponding to the character image to be processed₁Loss, so that the reconstructed super-resolution character image has the pixel value of the corresponding high-resolution character image;

the gradient loss functionThe number obtaining module is used for calculating a gradient loss function, and specifically comprises: calculating a gradient map of the high-resolution character image corresponding to the character image to be processed through a Sobel operator to obtain a target gradient map; calculating L by the target gradient map and the reconstructed super-resolution gradient map₁Loss, so that the reconstructed super-resolution gradient map has the pixel value of the target gradient map;

6. A character image super-resolution reconstruction method is characterized by comprising the following steps:

7. The super-resolution reconstruction method for text images according to claim 6, wherein:

the method further comprises:

8. The super-resolution reconstruction method for text images according to claim 6, wherein:

the character extraction module acquires an advanced feature layer of the character image to be processed, wherein the advanced feature layer comprises deep feature information of the character image to be processed;

9. The super-resolution reconstruction method for text images according to claim 6, wherein:

10. The super-resolution reconstruction method for text images according to claim 9, wherein:

the image loss function obtaining module calculates an image loss function, and specifically includes: calculating L the reconstructed super-resolution character image and the high-resolution character image corresponding to the character image to be processed₁Loss, so that the reconstructed super-resolution character image has the pixel value of the corresponding high-resolution character image;

the gradient loss function obtaining module calculates a gradient loss function, and specifically includes: calculating a gradient map of the high-resolution character image corresponding to the character image to be processed through a Sobel operator to obtain a target gradient map; calculating L by the target gradient map and the reconstructed super-resolution gradient map₁Loss, so that the reconstructed super-resolution gradient map has the pixel value of the target gradient map;