CN116757966A

CN116757966A - Image enhancement method and system based on multi-level curvature supervision

Info

Publication number: CN116757966A
Application number: CN202311036758.6A
Authority: CN
Inventors: 于亮亮; 李成华; 张阳
Original assignee: Zhongke Fangcun Zhiwei Nanjing Technology Co ltd
Current assignee: Zhongke Fangcun Zhiwei Nanjing Technology Co ltd
Priority date: 2023-08-17
Filing date: 2023-08-17
Publication date: 2023-09-15

Abstract

The invention provides an image enhancement method and system based on multi-level curvature supervision, belongs to the technical field of image data processing, and aims at the problems in the image denoising process in the prior art, denoising processing is carried out on image data by constructing a double-layer denoising network model, wherein a first layer in the double-layer denoising network model provides noise level estimation for a second layer so as to remove prior noise distribution. In addition, the second layer in the dual-layer denoising network model adopts an encoder-decoder architecture, and curvature is introduced into the encoder-decoder architecture and the curvature supervision module so as to obtain an efficient and flexible convolutional neural network. Aiming at different blind noise reconstruction models, the invention can obviously improve the image texture and the denoising strength in the directions of image denoising, deblurring, single image superdivision and the like.

Description

Image enhancement method and system based on multi-level curvature supervision

Technical Field

The invention relates to the technical field of image data processing, in particular to an image enhancement method and system based on multi-level curvature supervision.

Background

With the rapid development of 5G technology, the requirements of multiple fields such as medicine, weather, monitoring and the like on image quality are higher and higher, and the acquired images are unclean due to the fact that the actual imaging is subjected to external environment, object movement and the like, especially in the process of spreading through compression processing and the like, the pixel characteristics of the images are further lost, so that the method has important practical significance for researching an image denoising model.

In early research of an image denoising technology, the problem of additive noise is solved by constructing a maximum probability estimation model, but the method cannot establish an effective mapping relation under the condition of facing real complex noise, and has strong limitation in solving the problem of blind noise.

In recent years, with the application of deep learning in image processing, an image denoising model based on a convolutional neural network obtains a qualitative leap, but a large progress space still exists in the aspects of feature extraction and module optimization.

Disclosure of Invention

The invention aims to: an image enhancement method and system based on multi-level curvature supervision are provided to solve the above problems in the prior art.

The technical scheme is as follows: in a first aspect, an image enhancement method based on multi-level curvature supervision is provided, the method comprising the steps of:

step 1, constructing a double-layer denoising network model based on a neural network; the first layer of the double-layer denoising network model is used for executing mapping learning, and the second layer is used for executing image denoising processing;

step 2, the double-layer denoising network model receives original image data to be processed, executes convolution operation, and divides a convolution result into four branches to synchronously execute feature extraction operation;

step 3, fusing the feature extraction results of the four branches to obtain fusion features;

step 4, performing noise distribution and noise level learning mapping on the fusion characteristics based on a convolution layer to obtain noise level estimation and noise distribution estimation;

step 5, fusing the original image data with the noise distribution estimation, and calculating the corresponding curvature;

step 6, taking the curvature, noise level estimation and original image data as input data of a second layer of the double-layer denoising network model;

step 7, analyzing and processing the received data by adopting a four-layer neural network at the second layer of the double-layer denoising network model; the four layers of neural networks are embedded with CSAM models in each layer of neural networks through downsampling operation;

and 8, outputting an analysis processing result of the second layer of the double-layer denoising network model to obtain denoised image data.

In some implementations of the first aspect, before the dual-layer denoising network model performs image denoising, further includes: and evaluating the double-layer denoising network model by constructing an evaluation loss function, and optimizing parameters of the double-layer denoising network model according to an evaluation result to obtain the double-layer denoising network model with optimized final performance.

Wherein, the expression corresponding to the estimated loss function is:

in the method, in the process of the invention,representing noise level loss;representing noise distribution loss;representing an estimated loss function;representing the weight ratio of the adjusted noise distribution loss and the noise level loss.

The first layer of the dual-layer denoising network model sequentially comprises: conv convolution layer, four parallel block layers, convolution kernel and Conv layer composed of activation function. In the process of executing data processing, the Conv convolution layer executes convolution operation on the received original image data to be processed, divides a convolution result into four branches, synchronously inputs the four parallel layers of block layers to execute multi-scale feature extraction, and finally inputs the fused processing result into the Conv layer to execute learning mapping of noise distribution and noise level to obtain noise level estimation and noise distribution estimation.

In some implementations of the first aspect, the process of obtaining the curvature includes the steps of:

step 5.1, fusing the original image data with the noise distribution estimation to obtain fused image data;

step 5.2, selecting a local area with a preset size for each pixel in the fused image data to calculate the local geometric information of the current pixel;

step 5.3, calculating gradient information of pixel values in a selected pixel neighborhood window by using a Hessian matrix;

and 5.4, converting the calculated curvature value into a attention weight section of the pixel through a mapping function.

A process for performing data processing by a CSAM model, comprising the steps of:

step 7.1, performing 1×1 convolution on a curvature map obtained according to curvature;

step 7.2, performing sigmoid activation on the convolution result in the step 7.1;

step 7.3, fusing the activation result in the step 7.2 with the fusion characteristic;

step 7.4, generating curvature supervision attention map through convolution of the fusion result in the step 7.3 with the convolution kernel size of 1 multiplied by 1;

step 7.5, performing feature extraction on the original image data, and calculating attention weights of pixels of the original image data based on the curvature map;

and 7.6, judging whether the pixels in the current original image data belong to noise or effective information according to the curvature information of the pixels, and distributing corresponding weights.

In some implementations of the first aspect, the second layer of the two-layer denoising network model uses an encoder-decoder une convolutional neural network as a network architecture, including a ConvNeXt-Block structure. In the process of executing data processing by the ConvNeXt-Block structure, a data processing method combining a reverse bottleneck structure and deep convolution is adopted.

The ConvNeXt-Block structure specifically comprises: layerNorm normalization module, 7×7 large depth convolutional layer, 1×1 convolutional layer, simpleGate module. The process of executing the data processing comprises the following steps:

after passing through a layerNorm normalization module, a 7×7 large-depth convolution layer is utilized to enlarge a receptive field for a received image;

the characteristic dimension is increased by using a 1 multiplied by 1 convolution layer aiming at the data after the receptive field is enlarged, and the dimension is reduced by using the 1 multiplied by 1 convolution layer again after the data passes through a simpleGate module;

carrying out fusion processing on the result processed by the layerNorm normalization module and the image with reduced dimensionality, and sequentially inputting the fusion processing result into the layerNorm normalization module, the 1 multiplied by 1 convolution layer, the simpleGate module and the 1 multiplied by 1 convolution layer; and finally, fusing the fusion processing result with the processing result obtained at present to obtain the processing result of the ConvNeXt-Block structure, and outputting the processing result.

In a second aspect, an image enhancement system based on multi-level curvature supervision is provided, for implementing an image enhancement method, the system comprising the following modules:

the model construction module is used for constructing a double-layer denoising network model based on the neural network;

the feature extraction module is used for receiving the original image data and extracting features through parallel branch processing;

fusing the branch processing results to obtain a feature fusion module of fusion features;

a noise estimation module for performing noise level estimation and noise distribution estimation according to the obtained fusion characteristics;

a curvature calculation module for calculating a corresponding curvature according to the obtained noise distribution estimation and the original image data;

an image denoising module for performing image denoising processing by adopting an encoder-decoder structure according to curvature, noise level estimation and original image data;

and the image output module is used for outputting the denoised image data.

In a third aspect, an image enhancement device based on multi-level curvature supervision is presented, the device comprising: a processor and a memory storing computer program instructions, wherein the processor reads and executes the computer program instructions to implement an image enhancement method based on multi-level curvature supervision.

In a fourth aspect, a computer readable storage medium having computer program instructions stored thereon, which when executed by a processor, implement an image enhancement method based on multi-level curvature supervision is presented.

The beneficial effects are that: the invention provides an image enhancement method and system based on multi-level curvature supervision, which execute denoising processing on image data by constructing a double-layer denoising network model, wherein a first layer in the double-layer denoising network model provides noise level estimation for a second layer so as to perform priori noise distribution removal. In addition, the second layer in the dual-layer denoising network model adopts an encoder-decoder architecture, and curvature is introduced into the encoder-decoder architecture and the curvature supervision module so as to obtain an efficient and flexible convolutional neural network. Aiming at different blind noise reconstruction models, the invention can obviously improve the image texture and the denoising strength in the directions of image denoising, deblurring, single image superdivision and the like.

Drawings

FIG. 1 is a flow chart of data processing according to the present invention.

FIG. 2 is a Block diagram of the improved ConvNeXt-Block architecture of the present invention.

Fig. 3 is a block diagram of a CSAM curvature monitor attention module.

Detailed Description

In the following description, numerous specific details are set forth in order to provide a more thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the invention may be practiced without one or more of these details. In other instances, well-known features have not been described in detail in order to avoid obscuring the invention.

The existing stage has a certain theoretical and practical basis for the known noise model, and a better prediction model exists; on the other hand, when the unknown noise is faced, the denoising effect is greatly degraded, and based on the fact that the method disclosed by the invention is used for solving the problem by using a two-time end-to-end algorithm model. Firstly, estimating the noise distribution condition, taking the noise distribution condition as the input of a second-layer network, and further converting model training of unknown noise into Gaussian noise removal of known distribution condition.

Example 1

In one embodiment, aiming at the defects existing in the image denoising process in the prior art, an image enhancement method based on multi-level curvature supervision is provided, and denoising processing is carried out on image data by constructing a curvature supervision model, so that the processed image data meets the requirement of balancing denoising performance and texture detail.

Specifically, the image enhancement method based on multi-level curvature supervision is shown in fig. 1, and comprises the following steps:

step 1, constructing a double-layer denoising network model based on a neural network, performing mapping learning on a first layer, and performing image denoising processing on a second layer;

specifically, a first layer of the double-layer denoising network model acquires noise distribution estimation and noise level estimation according to a mapping relation; and a second layer of the double-layer denoising network model receives the data output by the first layer and adopts an encoder-decoder architecture to perform denoising processing.

Step 2, the double-layer denoising network model receives the original image data to be processed, executes convolution operation, and divides the convolution result into four branches to synchronously execute feature extraction operation;

specifically, the first layer of the dual-layer denoising network model sequentially comprises: the convolution kernel is a Conv convolution layer with a size of 3*3, a parallel four-layer block layer, and a Conv layer formed by the convolution kernel and an activation function. The four parallel block layers adopt convolution kernels with different sizes, and the sizes of the convolution kernels are 1*1, 3*3, 5*5 and 7*7 from top to bottom in sequence and are used for multi-scale feature extraction. In order to keep the feature sizes consistent, different channel numbers are set for convolution kernels with different sizes, the channel numbers are 64, 32, 16 and 8 from top to bottom in sequence, and after feature extraction is respectively executed, feature fusion of branches is carried out.

specifically, the curvature is calculated by measuring the curve or the curved surface bending degree to evaluate the local geometric characteristics of the pixels on the feature map, so that in order to improve the processing capability of the double-layer denoising network model on the noise image edge and the texture, the embodiment fuses the original image data and the noise distribution estimation.

First, for each pixel, selecting a local area with a fixed size to calculate local geometric information of the pixel; then, in the selected pixel neighborhood window, gradient information of pixel values is calculated by using a Hessian matrix, and curvature values are obtained; and finally, converting the calculated curvature value into an attention weight section of the pixel through a mapping function, limiting the curvature value in a preset section, and avoiding that the calculated curvature value section exceeds a preset range.

specifically, according to the noise distribution estimation obtained by the first layer of the double-layer denoising network model, the second layer of the double-layer denoising network model only needs to denoise the original image data according to the known noise distribution, and the output of the first layer is used as the input of the second layer, namely curvature, noise level and noise image are input into the second layer.

specifically, a 4-layer neural network is selected as a second layer of the double-layer denoising network model, and a CSAM (curvature supervised attention module) denoising model is added into each layer of network through a downsampling module, so that detail textures are reinforced while denoising is realized.

Specifically, the CSAM module mainly fuses the curvature map and the attention mechanism to enhance the weight of the effective features of the image, firstly, carries out 1×1 convolution on the curvature map, then carries out sigmoid activation, finally fuses the curvature map and the feature map obtained in the previous step, and generates curvature supervision attention map through convolution with the convolution kernel size of 1×1.

The CSAM attention mechanism calculates attention weight of image pixels based on a curvature graph after feature extraction, judges whether the pixels belong to noise or effective information according to curvature information of the pixels, and distributes corresponding weight, so that noise in an image can be processed pertinently for a subsequent denoising model, and the quality of a denoising image is improved.

In a further embodiment, the first layer in the two-layer denoising network model provides noise level estimation to the second layer for prior noise distribution removal, and the corresponding expression is:

；

in the method, in the process of the invention,representing a noise image;representing a clean image;representing noise level;representing a standard deviation radius ofIs a gaussian noise of (c).

Alternatively, the classical maximum a posteriori framework (MAP) may be determined by maximizing the posterior distribution functionTo estimateAnd since the maximum posterior distribution function is equal to the minimum log likelihood function, an estimate is obtainedThe calculation formula of (2), namely:

；

in the method, in the process of the invention,representation independent of noisy imagesIs a de-noised image of (2)Is a priori of (2); in addition, in the case of the optical fiber,wherein, the liquid crystal display device comprises a liquid crystal display device,the regularization term is represented as a function of the regularization term,is a preset positive integer, thus, estimatesIs given by a priori expression:

；

and providing noise level estimation to the second layer through the first layer in the double-layer denoising network model, and obtaining an priori expression to execute prior noise distribution removal.

Example two

In a further embodiment based on the first embodiment, in order to improve the denoising performance of the dual-layer denoising network model, an evaluation loss function is constructed to evaluate the dual-layer denoising network model, and parameter optimization of the dual-layer denoising network model is performed according to an evaluation result, so that the dual-layer denoising network model with optimized final performance is obtained.

Wherein, the expression corresponding to the estimated loss function is:

；

In the preferred embodiment, noise distribution lossAdopting a cross entropy loss function; noise level lossAn absolute value error loss function is employed.

In a further embodiment, the synthetic noise image set and the natural noise image set are employed to construct training data and test data for performance optimization of a two-layer denoising network model.

In a further embodiment, the performance of the optimized dual-layer denoising network model is evaluated by using a PSNR evaluation index result.

Example III

In one embodiment, the second layer of the dual-layer denoising network model takes an encoder-decoder Unet convolutional neural network as a network architecture, and comprises a ConvNeXt-Block structure; the structure of ConvNeXt-Block is shown in FIG. 2, the denoising sub-network in the double-layer denoising network takes an encoder-decoder Unet convolutional neural network as a network architecture, and a ConvNeXt module is integrated into UNet so as to improve the capability of feature extraction. The advantages of a reverse bottleneck structure and deep convolution are adopted, the network width is expanded, the condition of losing tiny features is avoided, and the diversity of feature information is enriched.

As shown in fig. 2, unlike the conventional bottleneck structure, the present invention uses the inverse bottleneck structure to expand the receptive field by 7×7 large-depth convolution, and then uses 1×1 convolution to make the feature dimension first and then make the feature dimension second, so as to enrich the feature information to ensure the expressive power of the model. Furthermore, a layerNorm module is used for normalizing the feature graphs of each dimension, so that the stability of feature data distribution is ensured; the convolution kernel size is 7 multiplied by 7, the large-depth convolution and the convolution kernel sizes of 2 are 1 multiplied by 1 are adopted to improve the feature extraction capability, and the input parameter quantity is greatly reduced under the condition of extremely little precision loss; in a preferred embodiment, the activation function selects a GELU, which increases the network computing speed while saving memory resources.

Example IV

In one embodiment, an image enhancement system is presented for implementing an image enhancement method based on multi-level curvature supervision, the system comprising the following modules: the system comprises a model construction module, a feature extraction module, a feature fusion module, a noise estimation module, a curvature calculation module, an image denoising module and an image output module.

The model building module is used for building a double-layer denoising network model based on a neural network; the feature extraction module is used for receiving the original image data and extracting features in a parallel branch processing mode; the feature fusion module is used for fusing the branch processing results to obtain fusion features; the noise estimation module performs noise level estimation and noise distribution estimation according to the obtained fusion characteristics; the curvature calculation module is used for fusing the obtained noise distribution estimation and the original image data to calculate the corresponding curvature; the image denoising module performs image denoising processing by adopting an encoder-decoder structure according to curvature, noise level estimation and original image data; the image output module is used for outputting the denoised image data.

In a further embodiment, a first layer of the dual layer denoising network model is used to perform map learning and a second layer is used to perform image denoising processing.

The first layer of the double-layer denoising network model sequentially comprises: the convolution kernel is a Conv convolution layer with a size of 3*3, a parallel four-layer block layer, and a Conv layer formed by the convolution kernel and an activation function. The four parallel block layers adopt convolution kernels with different sizes, and the sizes of the convolution kernels are 1*1, 3*3, 5*5 and 7*7 from top to bottom in sequence and are used for multi-scale feature extraction. In order to keep the feature sizes consistent, different channel numbers are set for convolution kernels with different sizes, the channel numbers are 64, 32, 16 and 8 from top to bottom in sequence, and after feature extraction is respectively executed, feature fusion of branches is carried out.

The second layer of the two-layer denoising network model adopts an encoder-decoder architecture to perform denoising processing.

In a further embodiment, in order to improve the denoising performance of the dual-layer denoising network model, the system further comprises a performance optimization module, wherein the performance optimization module evaluates the dual-layer denoising network model by constructing an evaluation loss function, and performs parameter optimization of the dual-layer denoising network model according to an evaluation result to obtain the dual-layer denoising network model with optimized final performance.

As described above, although the present invention has been shown and described with reference to certain preferred embodiments, it is not to be construed as limiting the invention itself. Various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. An image enhancement method based on multi-level curvature supervision is characterized by comprising the following steps:

2. The method of claim 1, wherein the dual layer denoising network model further comprises, before performing image denoising:

step 1.1, evaluating the double-layer denoising network model by constructing an evaluation loss function, and optimizing parameters of the double-layer denoising network model according to an evaluation result to obtain a double-layer denoising network model with optimized final performance;

the expression corresponding to the evaluation loss function is as follows:

；

in the method, in the process of the invention,representing noise level loss; />Representing noise distribution loss; />Representing an estimated loss function; />Representing the weight ratio of the adjusted noise distribution loss and the noise level loss.

3. The method of claim 1, wherein the first layer of the two-layer denoising network model comprises, in order: conv convolution layer, parallel four-layer block layer, convolution kernel and Conv layer formed by activation function;

and the Conv convolution layer executes convolution operation on the received original image data to be processed, divides the convolution result into four branches, synchronously inputs the four parallel block layers, executes multi-scale feature extraction, and finally inputs the fused processing result into the Conv layer, executes learning mapping of noise distribution and noise level, and obtains noise level estimation and noise distribution estimation.

4. The image enhancement method based on multi-level curvature supervision according to claim 1, wherein the process of acquiring the curvature comprises the steps of:

5. The image enhancement method based on multi-level curvature supervision according to claim 1, wherein the CSAM model performs a data processing procedure, comprising the steps of:

6. The image enhancement method based on multi-level curvature supervision according to claim 1, wherein the second layer of the dual-layer denoising network model uses an encoder-decoder une convolutional neural network as a network architecture, and comprises a ConvNeXt-Block structure;

in the process of executing data processing by the ConvNeXt-Block structure, a data processing method combining a reverse bottleneck structure and deep convolution is adopted.

7. The image enhancement method based on multi-level curvature supervision according to claim 6, wherein the convnex-Block structure specifically comprises: a layerNorm normalization module, a 7×7 large depth convolutional layer, a 1×1 convolutional layer, and a simpleGate module;

the process of executing the data processing comprises the following steps:

8. Image enhancement system based on multi-level curvature supervision for implementing an image enhancement method based on multi-level curvature supervision according to any one of the claims 1 to 7, comprising the following modules:

the model construction module is used for constructing a double-layer denoising network model based on a neural network;

the feature extraction module is arranged to receive the original image data and extract features in a parallel branch processing mode;

the feature fusion module is arranged for fusing the branch processing results to obtain fusion features;

a noise estimation module configured to perform noise level estimation and noise distribution estimation based on the obtained fusion features;

the curvature calculation module is used for carrying out fusion according to the obtained noise distribution estimation and the original image data and calculating the corresponding curvature;

an image denoising module configured to perform image denoising processing using an encoder-decoder structure according to curvature, noise level estimation, original image data;

and an image output module configured to output the denoised image data.

9. An image enhancement device based on multi-level curvature supervision, the device comprising:

a processor and a memory storing computer program instructions;

the processor reads and executes the computer program instructions to implement the multi-level curvature supervision based image enhancement method as claimed in any one of the claims 1 to 7.

10. A computer readable storage medium, having stored thereon computer program instructions which, when executed by a processor, implement the multi-level curvature supervision based image enhancement method according to any one of the claims 1 to 7.