CN111429546A

CN111429546A - Large-size image compressed sensing reconstruction method based on neural network

Info

Publication number: CN111429546A
Application number: CN202010298406.8A
Authority: CN
Inventors: 胡雪梅; 王鹏
Original assignee: Nanjing University
Current assignee: Nanjing University
Priority date: 2020-04-16
Filing date: 2020-04-16
Publication date: 2020-07-17

Abstract

The invention discloses a neural network-based large-size image compressed sensing reconstruction method. The method comprises the following steps: (1) carrying out block-dividing compression sampling on an original image through a measurement matrix to manufacture a training data set; (2) restoring the data from low dimensionality to high dimensionality by the compressed and sampled data through a full connection layer of a neural network; (3) splicing each block into an image according to the sequence of each block in the original image; (4) transferring the re-spliced image to an intensive connection convolution network consisting of convolution layers, and further reconstructing and recovering the image; (5) calculating the error between the reconstructed image and the original image by a loss function, and updating the parameters of the network through error reverse transfer; (6) and (5) repeating the steps (2) to (5) until the error between the reconstructed image and the original image is smaller, and finishing the training of the neural network. The method can complete the compression and reconstruction of larger images, and the accuracy of the reconstructed images and the suppression of image noise can both obtain better effects.

Description

Large-size image compressed sensing reconstruction method based on neural network

Technical Field

The invention relates to the fields of computational photography and signal processing, in particular to a large-size image compressed sensing reconstruction method based on a neural network.

Background

Compressive sensing is a technique for finding sparse solutions to underdetermined linear systems, applied in electronic engineering, especially in signal processing, for acquiring and reconstructing sparse or compressible signals. In recent years, in order to adapt to the fifth generation mobile communication system which is about to be popularized, a compression sensing technology is also applied to a wireless communication system in a large amount, and a great deal of attention and research are paid.

The compressed sensing problem is a challenging problem in the field of signal processing, and since a neural network can sufficiently learn prior information in an image, in recent years, reconstructing a compressed sensing signal through the neural network becomes a popular method. When the compressed sensing signal is reconstructed through the neural network comprising the full connection layer, because the compressed signal with larger dimension number needs a large amount of parameters when being input into the network, the phenomenon of gradient explosion or gradient disappearance easily occurs to the large amount of parameters, and the reconstruction cannot be completed. Therefore, when the compressed sensing signal is reconstructed by the traditional neural network, the dimensionality of the signal input into the network is small, the number of parameters is not excessive, and the reconstruction of the signal is smoothly completed. In the method, because the signal with smaller dimension can be processed, when the method is applied to the image field, the size of the processed image is smaller, and the reconstructed smaller image can be obviously blocked after being spliced into the original large image, so that the reconstruction quality is reduced. The method of firstly blocking, reconstructing and then splicing based on neural network reconstruction compressed sensing has poor reconstruction effect and has great limitation in practical application.

Aiming at the problems, a novel reconstruction means is needed, so that not only is the influence of excessive network parameters on neural network training avoided, but also the blocking phenomenon generated by splicing small-size images into large-size original images after reconstruction is eliminated, and a better reconstruction effect is finally obtained.

Disclosure of Invention

Aiming at the defects existing in the conventional neural network reconstruction method, the invention aims to provide a large-size image compressed sensing reconstruction method based on a neural network.

In order to achieve the purpose, the technical scheme adopted by the invention is as follows:

a large-size image compressed sensing reconstruction method based on a neural network comprises the following steps:

step 1, dividing an original image x into a series of square images x with the same size₁,x₂,…,x_iRespectively carrying out compression sampling processing on the segmented images by using a standard normally distributed measurement matrix to obtain data y after compression sampling₁,y₂,…,y_iAs training data, training data y is created₁,y₂,…,y_iTraining sets corresponding to the original images x one by one;

step 2, inputting the training data into a neural network, and training data y₁,y₂,…,y_iRespectively through the first layer of the neural network: linear full connected layer, such that training data y₁,y₂,…,y_iRecovery from Low dimensionality to Pre-sampling high dimensionality data x'₁,x′₂,…,x′_i；

Step 3, the high-dimensional data x 'obtained in the step 2'₁,x′₂,…,x′_iBlock image x according to its correspondence₁,x₂,…,x_iSplicing the positions in the original image x into an original image in sequence to obtain a primarily reconstructed image x';

step 4, inputting the image x' preliminarily reconstructed in the step 3 into a dense connection convolution network consisting of convolution layers, and further reconstructing and restoring the image to obtain the output of the network, namely a final reconstructed image;

step 5, calculating the error between the reconstructed image obtained in the step 4 and the original real image through a loss function, reversely transmitting the error, and updating the parameters of the neural network;

and 6, repeating the steps 2-5 until the error of the loss function calculation reaches an expected value, and finishing the training of the neural network.

Further, theIn step 1, the processing mode of compressed sampling after original image segmentation is as follows: y is_i＝Φx_iWherein x_i∈ (N × 1) is data obtained by rearranging elements after dividing an original image x with the size of N × N into a small-size histogram, i is 1,2,3 and …, and a measurement matrix is phi ∈ (m × N, m < N), the elements of the measurement matrix are random numbers in standard normal distribution, N is the dimension of the original image data, and m is the dimension of the compressed and sampled image data.

Further, in the step 2, the basic network form of the linear full connection is as follows: the node number of the input layer is the value of the dimension m of the image data after compression sampling, the node number of the output layer is the value of the dimension n of the original image data, and no activation function exists.

Further, in the step 3, the concrete process of splicing the block images into the original image according to the position sequence of the block images in the original image is as follows: step 31, firstly, the data x 'obtained in the step 2'₁,x′₂,…,x′_iElement arrangement transformation of

Step 32, according to (x)₁,x′₁),(x₂,x′₂),…,(x_i,x′_i) And (3) rearranging the results obtained in the step (2) by the one-to-one correspondence of the positions.

Further, in the step 4, the dense connection convolution network is composed of 4 same dense connection modules, each module outputs 64 channels as the input of the next dense connection module, the rear end of the dense connection convolution network is connected with a convolution layer with a convolution kernel size of 3 × 3, the input of the convolution layer is 64 channels, the output of the convolution layer is 1 channel, and the reconstructed image is obtained.

On the basis of solving the compressed sensing problem based on the neural network, the invention provides a novel solution in combination with the problem that training is difficult to complete due to the fact that too many parameters of the full connection layer in the neural network cause gradient explosion or gradient dispersion, so that the use of the parameters of the full connection layer is reduced, and the blocking phenomenon caused by direct reconstruction of images with smaller sizes is eliminated. Meanwhile, the invention adopts the intensive connection convolution network for training, improves the information transmission between networks, can effectively inhibit the noise of the reconstructed image and obtains a clearer image. Compared with the prior art, the method can obtain better image accuracy and lower image noise interference in the compression reconstruction of larger images, and can effectively deal with the challenge brought by the compression reconstruction of larger-size images in practical application.

Drawings

FIG. 1 is a schematic diagram of a neural network according to the present invention;

FIG. 2 is a flow chart of the method of the present invention.

Detailed Description

The invention will now be described in detail with reference to the drawings and specific embodiments, which are illustrative and are not to be construed as limiting the invention.

Referring to fig. 1 and fig. 2, the method for reconstructing a compressed sensing of a large-size image based on a neural network according to the present embodiment includes the following specific steps:

step 1, dividing an original large-size image x ∈ (N × N) to be compressed and sampled into a series of square graphs with the same size and relatively small size

The elements are rearranged to N × 1, and the segmented small-size images are respectively compressed and sampled by using a measuring matrix phi ∈ (m × N, m < N) of standard normal distribution, wherein N is the dimensionality of signal data before compression sampling, and m is the dimensionality of signal data after compression sampling, in the embodiment, the original image size N is 128, the segmented image size N is 16, and the dimensionality m of the signal data after compression sampling is 4, namely:

y_i＝Φx_i,i＝1,2,…(Φ∈4×16；x_i∈16×1)

obtaining the data y after compression sampling₁,y₂… training data y is created as training data₁,y₂… are training sets corresponding to the original images x one to one.

Step 2, inputting training data into the neural network, and compressing the sampled data y by blocks of the original image₁,y₂… through the first layer of the neural network: the structure of the linear full-connection layer is that the node number of the input layer is the dimension m of the signal data after compression sampling, the node number of the output layer is the dimension n of the original signal data, and no activation function exists. The original image is divided into blocks to compress the sampled data y₁,y₂… (4 × 1) reverts from low dimension to high dimension x 'before sampling'₁,x′₂… (16 × 1). the compressed reconstruction is carried out after the large-size image is divided into small blocks, namely, the use of parameters of a full connecting layer is reduced, gradient explosion or diffusion in the training process is avoided, and the reconstruction of the compressed image is also realized.

Step 3, the high-dimensional data x 'obtained in the step 2'₁,x′₂… Block image x according to its correspondence₁,x₂…, the original image x is sequentially stitched into the original image x to obtain the preliminary reconstructed image x', which includes the following steps:

step 31 x 'obtained in step 2'₁,x′₂… (16 × 1) element permutation is transformed to 4 × 4;

step 32 is according to (x)₁,x′₁),(x₂,x′₂) …, rearranging the results obtained in the step 2 by the one-to-one correspondence relationship of the positions, and completing the preliminary reconstruction of the large-size image compressed sensing.

Step 4, inputting the image x' preliminarily reconstructed in the step 3 into a dense connection convolution network consisting of convolution layers, and further reconstructing and recovering the image to obtain the output of the network, namely the final reconstructed image x_outputIn the present example, the dense connected convolution network is specifically structured as follows, the input of the first layer convolution of the dense connected convolution network composed of convolution layers is the image x' which is preliminarily reconstructed, the number of convolution kernels is 16, 16 feature maps are output, and batch normalization is performed, and then four groups of dense connected convolution modules are formed, wherein the dense connected convolution modules are structured as (Conv1+ BN + Re L U) - (Conv2+ BN + Re L U) - (Conv3+ BN + Re L U) - (Conv4+ BN + Re L U), and the dense connected convolution modules are all structured as followsThe number of convolution kernels connecting each layer of convolution is 16, 16 feature maps are output, and the output feature maps are input to each convolution layer behind the feature maps; after the intensive connection convolution module, a layer of convolution network is passed through, the number of convolution kernels is 1, 1 characteristic diagram is output, namely the reconstructed recovery image x_outputIn the example, the sizes of convolution kernels adopted by the convolution layers are all 3 × 3, and each layer of convolution is subjected to padding operation, so that the sizes of the images before and after convolution are not changed, and the sizes of the feature maps in the intensive connection module when the output feature maps are directly connected are the same.

And 5, calculating the error between the reconstructed image obtained in the step 4 and the original real image through a loss function, reversely transferring the error, updating a weight parameter w and a bias parameter bias of the neural network, wherein the adopted loss function is an MSE loss function, and the calculation mode is as follows:

MSE＝(x_output-x)²

wherein x_outputFor reconstructing the image, x is the original real image, and the optimizer for error reverse transfer updating the network parameters is Adam.

And 6, after the parameters of the neural network are updated, repeating the steps 2-5 until the error of the loss function calculation reaches an expected value, and finishing the training of the neural network. In this example, the specific network hyper-parameters used are as follows: number of iterations (epoch): 100, respectively; learning rate (learning rate): 0.0002; number of samples of one training (batch size): 32, a first step of removing the first layer; the image training set used was: INRIA Holidays dataset.

Claims

1. A large-size image compressed sensing reconstruction method based on a neural network is characterized by comprising the following steps:

step 1, dividing an original image x into a series of square images x with the same size₁，x₂，...，x_iRespectively carrying out compression sampling processing on the segmented images by using a standard normally distributed measurement matrix to obtain data y after compression sampling₁，y₂，...，y_iAs training data, training data y is created₁，y₂，...，y_iTraining sets corresponding to the original images x one by one;

step 2, inputting the training data into a neural network, and training data y₁，y₂，...，y_iRespectively through the first layer of the neural network: linear full connected layer, such that training data y₁，y₂，...，y_iRecovery from Low dimensionality to Pre-sampling high dimensionality data x'₁，x′₂，...，x′_i；

Step 3, the high-dimensional data x 'obtained in the step 2'₁，x′₂，...，x′_iBlock image x according to its correspondence₁，x₂，...，x_iSplicing the positions in the original image x into an original image in sequence to obtain a primarily reconstructed image x';

2. The method for compressed sensing reconstruction of large-size image based on neural network as claimed in claim 1, wherein in step 1, the processing mode of compressed sampling after segmentation of original image is:

y_i＝Φx_i

wherein x_i∈ (N ×) is data of rearranged elements after an original image x with the size of N × N is cut into a small-size histogram, i is 1,2, 3.

3. The method for compressed sensing reconstruction of large size image based on neural network as claimed in claim 2, wherein in said step 2, the basic network form of linear full connection is: the node number of the input layer is the value of the dimension m of the image data after compression sampling, the node number of the output layer is the value of the dimension n of the original image data, and no activation function exists.

4. The method according to claim 3, wherein the step 3 of splicing the segmented images into the original image according to the position sequence of the segmented images in the original image comprises the following specific steps:

step 31, firstly, the data x 'obtained in the step 2'₁，x′₂，...，x′_iElement arrangement transformation of

Step 32, according to (x)₁，x′₁)，(x₂，x′₂)，...，(x_i，x′_i) And (3) rearranging the results obtained in the step (2) by the one-to-one correspondence of the positions.

5. The method for compressed sensing reconstruction of a large-size image based on a neural network as claimed in claim 4, wherein in the step 4, the densely connected convolutional network is composed of 4 identical densely connected modules, each module outputs 64 channels as input of the next densely connected module, and the back end of the densely connected convolutional network is connected with a convolutional layer with a convolutional kernel size of 3 × 3, wherein the convolutional layer has 64 input channels and 1 output channel, and is the reconstructed image.