Detailed Description
Based on the general framework diagram of fig. 1, for a given remote sensing image, the thinning can be performed in two stages. The first stage is used for thinning the low frequency of the remote sensing image, and the second stage is used for thinning the detail image. Two stage embodiments are given below.
The first embodiment is as follows: the remote sensing image sparse estimation method based on the hybrid transformation, which is described in the embodiment, is realized according to the following steps:
step one, carrying out sparse estimation on the low frequency of a remote sensing image:
performing tensor product wavelet transformation on an original image; step one (two), performing multi-phase decomposition on each sub-band by adopting a p-fold decimation filter; step one (three), carrying out principal component transformation on the decomposed components; step one (four), reserving N for the transformed image1The larger transform coefficients, the other coefficients set to 0; obtaining a sparse estimation result, and carrying out inverse transformation on the result;
step two, carrying out sparse estimation on the high frequency of the remote sensing image:
subtracting the original image from the low-frequency image obtained in the first step to obtain a high-frequency image containing most textures and edges; step two, transforming the high-frequency image by using Tetrolet transformation, and further performing energy aggregation on the transformed high-frequency sub-band by using a self-adaptive decomposition method; step two (three), reserving N for the transformed image2The larger transform coefficients, the other coefficients set to 0; step two (four), according to the process of the self-adaptive decomposition, carry on the corresponding inverse transformation to the high-frequency subband that has carried on the self-adaptive decomposition, and then carry on the inverse transformation to the transformation image of the whole high-frequency image, receive the result of sparse estimation;
and step three, overlapping the results of the sparse estimation in the step one and the step two to obtain a final estimation image.
The second embodiment is as follows: the first difference between the present embodiment and the specific embodiment is: the sparse process of the low-frequency part of the remote sensing image is as follows: let the original image be
The specific implementation process of the step one (I) is as follows: let A denote two-dimensional tensor product wavelet transform and X denote XA as the result of X after tensor product wavelet transformTThen the transformed image is represented as:
XAT:=[(XAT)(1),(XAT)(2),…,(XAT)(N)]
wherein, "=" means "defined as", N means the number of wavelet sub-bands;
the specific implementation process of the step one (two) is as follows: for each sub-band (XA)
T)
(i)Generating a transform component using a p-fold filter, wherein i =1, …, N; the process is represented as:
symbol'
"represents the variables before the symbol after being processed, and can be represented as the variables after the symbol; permu represents the decomposition of sub-band and the recombination process of wavelet coefficient, firstly adopting p-fold filter to extract the coefficient for given wavelet sub-band, then recombining the extracted coefficient into transformation component according to the corresponding position;
the specific implementation process of the step one (third) is as follows: and performing principal component transformation on the transformation component generated by each wavelet sub-band: let B(i)A transform matrix representing the ith sub-band, then
if the transformed image is Y, then
Y:=[B(1)permu(XAT)(1),B(2)permu(XAT)(2),…,B(N)permu(XAT)(N)]
The specific implementation process of the step one (four) is that for all the coefficients in Y, the larger N is reserved
1One coefficient, and the other coefficient is set to 0, and the transformed image at this time can be expressed as
Then
Other steps are the same as in the first embodiment.
And performing wavelet transformation on the image. In order to eliminate the correlation of the coefficients in the sub-bands, a p-fold decimation filter is used to perform a polyphase decomposition on each sub-band, and the decomposed components are processedAnd performing principal component transformation. Thus, the image is equivalent to two times of decomposition, so that the energy is more concentrated and the sparsity is stronger. Finally, N is reserved for the transformation image with high energy concentration1And setting the other coefficients to be 0, namely performing sparse estimation. And according to the result of sparse estimation, performing inverse transformation on the process to obtain the approximation of the low-frequency image of the original image.
The third concrete implementation mode: the present embodiment differs from the first or second embodiment in that: after the step one (four) is finished, the step one (five) is executed: the process of estimating the low-frequency part of the remote sensing image is carried out according to the following steps:
step 1, applying principal component inverse transformation to each transformation component sequence
Where i =1,2, …, N, the process is represented as:
step 2, based on the process of the
step 1, recombining the inversely transformed component sequence, and marking as permu
-1Then inverse two-dimensional tensor product wavelet transform A is applied
-1Let the estimation result of the low frequency image be
Then
The other steps are the same as in the first or second embodiment.
The fourth concrete implementation mode: the difference between this embodiment mode and one of the first to third embodiment modes is: in the second step, the specific process of the sparsity of the high-frequency part of the remote sensing image is as follows:
the specific implementation process of the step two (one) comprises the following steps: from the original image X, and the estimation result of the low-frequency image
Generating a high frequency image, denoted X
DThe high frequency image is obtained according to the following formula:
the specific implementation process of the second step (II) is as follows: for high frequency chartImage
If J is the number of layers of Tetrolet transform decomposition, the number of the layers is the r-th layer, wherein r = 1. The Tetrolet decomposition is performed as follows:
(1) image processing method
Divided into 4
gamma 4 blocks Q
i,j,i=0,1,...,M
1/2
r+1-1,j=0,1,...,M
2/2
r+1-1; by Q
i,jRepresenting the divided blocks, wherein i, j represents that the block is in the ith row and the jth column in all the blocks;
(2) for each block, in 4 subsets of four-grid tiles
c =1,.. logue, 117, s =0,1,2,3, a tensor product wavelet transform is performed to obtain corresponding 4 low-frequency coefficients
And 12 Tetrolet coefficients
Wherein,
<math>
<mrow>
<msubsup>
<mi>x</mi>
<mi>D</mi>
<mrow>
<mi>r</mi>
<mo>,</mo>
<mrow>
<mo>(</mo>
<mi>c</mi>
<mo>)</mo>
</mrow>
</mrow>
</msubsup>
<mo>[</mo>
<mi>s</mi>
<mo>]</mo>
<mo>=</mo>
<munder>
<mi>Σ</mi>
<mrow>
<mrow>
<mo>(</mo>
<mi>m</mi>
<mo>,</mo>
<mi>n</mi>
<mo>)</mo>
</mrow>
<mo>∈</mo>
<msubsup>
<mi>I</mi>
<mi>s</mi>
<mi>C</mi>
</msubsup>
</mrow>
</munder>
<mi>ϵ</mi>
<mo>[</mo>
<mn>0</mn>
<mo>,</mo>
<mi>L</mi>
<mrow>
<mo>(</mo>
<mi>m</mi>
<mo>,</mo>
<mi>n</mi>
<mo>)</mo>
</mrow>
<mo>]</mo>
<msup>
<mi>x</mi>
<mrow>
<mi>r</mi>
<mo>-</mo>
<mn>1</mn>
</mrow>
</msup>
<mo>[</mo>
<mi>m</mi>
<mo>,</mo>
<mi>n</mi>
<mo>]</mo>
</mrow>
</math>
<math>
<mrow>
<msubsup>
<mi>h</mi>
<mi>l</mi>
<mrow>
<mi>r</mi>
<mo>,</mo>
<mrow>
<mo>(</mo>
<mi>c</mi>
<mo>)</mo>
</mrow>
</mrow>
</msubsup>
<mo>[</mo>
<mi>s</mi>
<mo>]</mo>
<mo>=</mo>
<munder>
<mi>Σ</mi>
<mrow>
<mrow>
<mo>(</mo>
<mi>m</mi>
<mo>,</mo>
<mi>n</mi>
<mo>)</mo>
</mrow>
<mo>∈</mo>
<msubsup>
<mi>I</mi>
<mi>s</mi>
<mi>C</mi>
</msubsup>
</mrow>
</munder>
<mi>ϵ</mi>
<mo>[</mo>
<mn>0</mn>
<mo>,</mo>
<mi>L</mi>
<mrow>
<mo>(</mo>
<mi>m</mi>
<mo>,</mo>
<mi>n</mi>
<mo>)</mo>
</mrow>
<mo>]</mo>
<msup>
<mi>x</mi>
<mrow>
<mi>r</mi>
<mo>-</mo>
<mn>1</mn>
</mrow>
</msup>
<mo>[</mo>
<mi>m</mi>
<mo>,</mo>
<mi>n</mi>
<mo>]</mo>
<mo>,</mo>
<mi>l</mi>
<mo>=</mo>
<mn>0</mn>
<mo>,</mo>
<mo>.</mo>
<mo>.</mo>
<mo>,</mo>
<mn>3</mn>
</mrow>
</math>
wherein, ε [ l, m]L, m = 0.. 3 is derived from the tensor product wavelet transform matrixTo select the optimal direction c*Optimizing the 12 Tetrolet coefficients and the template selected when the coefficients are minimum;
<math>
<mrow>
<msup>
<mi>c</mi>
<mo>*</mo>
</msup>
<mo>=</mo>
<mi>arg</mi>
<munder>
<mi>min</mi>
<mi>c</mi>
</munder>
<munderover>
<mi>Σ</mi>
<mrow>
<mi>l</mi>
<mo>=</mo>
<mn>1</mn>
</mrow>
<mn>3</mn>
</munderover>
<msub>
<mrow>
<mo>|</mo>
<mo>|</mo>
<msubsup>
<mi>H</mi>
<mi>l</mi>
<mrow>
<mi>r</mi>
<mo>,</mo>
<mrow>
<mo>(</mo>
<mi>c</mi>
<mo>)</mo>
</mrow>
</mrow>
</msubsup>
<mo>|</mo>
<mo>|</mo>
</mrow>
<mn>1</mn>
</msub>
<mtext>=arg</mtext>
<munder>
<mi>min</mi>
<mi>c</mi>
<mtext></mtext>
</munder>
<munderover>
<mi>Σ</mi>
<mrow>
<mi>l</mi>
<mo>=</mo>
<mn>1</mn>
</mrow>
<mn>3</mn>
</munderover>
<munderover>
<mi>Σ</mi>
<mrow>
<mi>s</mi>
<mo>=</mo>
<mn>0</mn>
</mrow>
<mn>3</mn>
</munderover>
<mo>|</mo>
<msubsup>
<mi>h</mi>
<mi>l</mi>
<mrow>
<mi>r</mi>
<mo>,</mo>
<mrow>
<mo>(</mo>
<mi>c</mi>
<mo>)</mo>
</mrow>
</mrow>
</msubsup>
<mo>[</mo>
<mi>s</mi>
<mo>]</mo>
<mo>|</mo>
<mo>-</mo>
<mo>-</mo>
<mo>-</mo>
<mrow>
<mo>(</mo>
<mn>3</mn>
<mo>)</mo>
</mrow>
</mrow>
</math>
for each block Q
i,jTo obtain the optimal Tetrolet decomposition
(3) Rearranging the low and high frequency subbands, a matrix of size 2 x 2,
also, in the same manner as above,
(4) after finding the optimal decomposition direction, saving the high frequency part coefficient and the direction c*For low frequency imagesThe Tetrolet decomposition continues until the J-1 layer ends. The other steps are the same as those in one of the first to third embodiments.
And subtracting the original image from the obtained low-frequency image to obtain a detail image containing most textures and edges. Because the Tetrolet transformation has better detail keeping capability, the Tetrolet transformation is adopted to transform the detail image. Aiming at the problem that the amplitude of the high-frequency sub-band coefficient is still larger after the detail image is transformed, the energy aggregation is further carried out by adopting a high-frequency sub-band self-adaptive decomposition method, so that most detail information is concentrated into a few coefficients, and more detail information is kept while the sparsity is improved. And performing inverse transformation according to the self-adaptive decomposition process, and performing inverse transformation on the transformed image of the whole detail image to obtain an approximate image of the detail image.
The fifth concrete implementation mode: the difference between this embodiment and one of the first to fourth embodiments is: the specific implementation process of the self-adaptive decomposition method in the second step (II) is as follows: the high-frequency sub-band after J-level Tetrolet transformation is set as HFm,dM denotes a scale, m =1, 2.., J, d denotes a direction, d =1,2,3, "1" denotes a horizontal direction, "2" denotes a vertical direction, "3" denotes a diagonal direction;
step 1, calculate each HFm,dAnd is denoted as Em,d;
<math>
<mrow>
<msub>
<mi>E</mi>
<mrow>
<mi>m</mi>
<mo>,</mo>
<mi>d</mi>
</mrow>
</msub>
<mo>=</mo>
<mfrac>
<mn>1</mn>
<mi>MN</mi>
</mfrac>
<munderover>
<mi>Σ</mi>
<mrow>
<mi>i</mi>
<mo>,</mo>
<mi>j</mi>
</mrow>
<mrow>
<mi>M</mi>
<mo>,</mo>
<mi>N</mi>
</mrow>
</munderover>
<msup>
<mrow>
<mi>h</mi>
<mrow>
<mo>(</mo>
<mi>i</mi>
<mo>,</mo>
<mi>j</mi>
<mo>)</mo>
</mrow>
</mrow>
<mn>2</mn>
</msup>
</mrow>
</math>
M, N denotes the size of the corresponding subband, h (i, j) denotes the coefficient value of the subband at position (i, j);
step 2, judging the importance of each sub-band, and if the importance is important, continuing decomposition; for the mth level subband, m =1,2, … …, J, the judgment and decomposition process is as follows:
calculating the average energy E of three high-frequency sub-bands at the mth level of the scalem,ave;
For each sub-band HF under the scalem,d,d=1,2,3,
If Em,d>=Em,ave
Then
Subband HFm,dIf the sub-band is judged to be important, performing Tetrolet decomposition on the sub-band once to generate four new sub-bands; recalculating energy for the four sub-bands and judging importance, and if so, continuing decomposition; the process is repeated until all high frequency sub-bands are inspected. The other steps are the same as those in one of the first to fourth embodiments.
The sixth specific implementation mode: the difference between this embodiment and one of the first to fifth embodiments is: in the second step, after the second (fourth) step is executed, the second (fifth) step is executed: estimating the high-frequency part of the image, wherein the process is carried out according to the following steps:
step 1, after the high-frequency sub-band is subjected to self-adaptive decomposition, most of energy of the high-frequency image is concentrated on a few coefficients, and N with a large absolute value is reserved2Setting the coefficients and setting the rest coefficients to be 0;
step 2, reserving N
2The method comprises the steps that (1) a transformed image with each coefficient is subjected to Tetrolet inverse transformation on a corresponding sub-band according to the inverse sequence of self-adaptive decomposition, and then the whole image is subjected to Tetrolet transformation; finally, a high-frequency diagram is obtainedAn estimated image of the image, the result of said estimated image being noted
The other steps are the same as those in one of the first to fifth embodiments.
The seventh embodiment: the difference between this embodiment and one of the first to sixth embodiments is: from the estimation of the low-frequency part of the image
And estimation result of high frequency part
Obtaining an estimated image of the original image X
Is marked as
The other steps are the same as those in one of the first to sixth embodiments.
Effect verification:
to verify the effect of the present invention, an experiment was performed using a remote sensing image, as shown in fig. 2 (a). For the proposed hybrid transformation method, when estimating the low frequency, 9/7 biorthogonal wavelet decomposition is adopted, the decomposition layer number is five levels, and the number of reserved coefficients N1 is 6000; when detail images are estimated, five-level Tetrolet transformation is adopted, and the number of reserved coefficients N2 is 1500. For comparison, 9/7 biorthogonal wavelets, the Tetrolet transform, the Curvelet transform, and the Contourlet transform were used for comparison with the proposed method, respectively, under the same conditions. For these four methods for comparison, 7500 coefficients are retained directly after the transform. The PSNR results of the reconstructed images and the partially enlarged views of the reconstructed images are shown in fig. 2(b) - (k), and the partially enlarged regions are indicated by white frames in the original image. As can be seen from fig. 2, the proposed hybrid method performs best, the Tetrolet transform method is second and the Curvelet method is worst.
To further demonstrate the effectiveness of the present invention, two remote sensing images (a) RS1 and (b) RS2 were used as test images, each having a size of 512 x 512(262144 coefficients), as shown in FIG. 3. In the experiment, the proposed hybrid transformation method is respectively compared with 9/7 tensor wavelets, Tetrolet transformation, Curvelet transformation and Contourlet transformation, and the used hierarchical layers are all 5 levels. For the proposed hybrid transformation method, let 9/7 number of wavelet transformation retention coefficients be N1, number of Tetrolet transformation retention coefficients be N2, under N1: N2=4:1 and N1: N2=3:1, PSNR, RMSE, MAE and MAD are respectively adopted as objective evaluation indexes, and SSIM is a subjective evaluation index for experiments. The results of the PSNR comparison are shown in tables 1 and 2, and the results of the SSIM comparison are shown in tables 3 and 4. Taking RS1 as an example, the comparison results of RMSE, MAE, and MAD are shown in fig. 4, and the comparison result of RS2 is similar thereto.
TABLE 1 PSNR results obtained with different methods (N1: N2=4:1)
TABLE 2 PSNR results obtained with different methods (N1: N2=3:1)
The SSIM index comparison (N) obtained by the method presented in Table 3 with four other different methods1:N2=4:1)
TABLE 4 SSIM index comparison (N) of the proposed method with four other different methods1:N2=3:1)
FIG. 5 shows LL as1The polyphase decomposition process of the wavelet sub-bands is given for example (assuming LL1Is 8 x 8). This process is implemented by a p-fold filter, where p is assumed to be 4. Will LL1Sub-blocks of size 2 x 2 are divided, different blocks being represented in different colours for visual perception, and the numbers in the blocks representing the positions of the coefficients. Firstly, extracting the coefficient of the upper left corner of each block, and placing the coefficients according to the sequence of the original blocks to obtain a first component; secondly, extracting the coefficient of the upper right corner of each block, and placing the coefficients according to the sequence of the original blocks to obtain a second component; and by analogy, finally, extracting the coefficient of the lower right corner of each block, and placing according to the sequence of the original blocks, thereby obtaining a fourth component. It can be seen that the adjacent four coefficients in each block are placed exactly at the same position in the four components. Therefore, the PCA is adopted to remove the redundancy among the components, namely the redundancy of adjacent wavelet coefficients. The polyphase decomposition process for the other subbands is similar to this process.
From these results, the proposed hybrid method is superior to the other four transformation methods in both subjective and objective aspects while retaining different numbers of coefficients, further proving the effectiveness of the proposed method.