CN114897751A

CN114897751A - Infrared and visible light image perception fusion method based on multi-scale structural decomposition

Info

Publication number: CN114897751A
Application number: CN202210381391.0A
Authority: CN
Inventors: 周志强; 费二芳; 缪玲娟; 崔赛佳; 叶何; 李家琪
Original assignee: Beijing Institute of Technology BIT
Current assignee: Beijing Institute of Technology BIT
Priority date: 2022-04-12
Filing date: 2022-04-12
Publication date: 2022-08-12

Abstract

The invention relates to an infrared and visible light image perception fusion method based on multi-scale structural decomposition, and belongs to the technical field of multi-sensor image fusion. The method fully considers relevant characteristics of a Human Visual System (HVS), and can help solve potential defects of current fusion research in visual information perception. Compared with other algorithms, the method constructs a multi-scale structure decomposition method based on scale perception edge preservation, and can obtain image structures with different scales, wherein edge information is kept in each layer, and small-scale details can be regarded as structures with fine spatial scales. In addition, the method fully considers the significant information of the pixel level and the large-scale structural information in the fusion process, so that a fusion image with rich information and good visual perception effect can be obtained.

Description

Infrared and visible light image perception fusion method based on multi-scale structural decomposition

Technical Field

The invention relates to an infrared and visible light image perception fusion method based on multi-scale structural decomposition, and belongs to the technical field of multi-sensor image fusion.

Background

The image fusion technology has important significance in image processing and computer vision, and is widely applied to the fields of military affairs, remote sensing, medical image processing, industrial detection and the like. Among them, infrared and visible image fusion has become one of the most studied branches due to its uniqueness in application. Visible light images are generally of higher resolution and contain important detailed information of the scene, but their imaging quality is susceptible to external factors such as weather, light, etc. In contrast, infrared images contain hidden information that is lost in visible light images and can reflect the thermal radiation information of the scene, but their detail information is often poor. Therefore, the information of the visible light image and the infrared image are complementary to a certain extent, and a relatively complete scene mapping can be obtained by fusing the infrared image and the visible light image. One basic principle of infrared and visible image fusion is to preserve as much salient information as possible in the infrared and visible images. In addition, it is desirable that the fused image introduces less artifacts and has good visual perception.

In general, infrared and visible image fusion includes three important steps, namely feature extraction, fusion strategy formulation, and image reconstruction. Depending on the analysis tools used in the above process, existing infrared and visible light image fusion algorithms can be classified into six categories: multi-scale transform based methods, subspace based methods, sparse representation based methods, saliency based methods, deep learning based methods, and hybrid methods. Among the methods, the most widely studied and applied method is a fusion method based on multi-scale transformation, which first decomposes a source image by using a transformation technology to obtain multi-scale information, and then fuses each scale information one by using a certain fusion strategy. Among them, the laplacian pyramid is a classical transformation technique commonly used for multi-scale decomposition, and thus multi-scale decomposition techniques such as a contrast pyramid, a steerable pyramid, and a morphological pyramid are derived. Wavelet transformation, another important multi-scale decomposition tool, provides a method for decomposing an image into a low-pass layer image and detail layer images in different directions, so that noise in a fused image can be reduced. On the basis of the above, researchers have proposed improved analysis tools such as discrete wavelet transform, contourlet transform, shear wave transform, etc., which have better decomposition performance. In addition, many edge-preserving filters, such as bilateral filters, guided filters, are proposed and widely used for multi-scale decomposition of images, which can preserve the spatial continuity of the image structure and reduce the generation of halos and artifacts. In the fusion process, the fusion weight is often determined by adopting a maximum value selection and weighted average strategy. Toet et al uses contrast pyramid transformation to decompose the source image and then selects the maximum contrast value as the fusion coefficient. Adu et al propose to use a weighted average strategy to calculate the weight coefficients of the decomposed images, and then to fuse the images of the same scale by the weight coefficients.

Existing fusion methods focus more on preserving significant information or avoiding artifacts in infrared and visible images, and take less into account perceptual issues and the characteristics of the Human Visual System (HVS). Considering the mechanism of the HVS not only can produce what we generally consider visually pleasing fusion results, but more importantly it can help address potential drawbacks of the current fusion framework. Generally, the image fusion process may involve using visual features from different source images, in particular by comparison between them to determine fusion weights or to obtain an appropriate information fusion strategy. However, visual features are susceptible to external physical conditions (e.g., ambient lighting, characteristics of different sensors, etc.), which means that these features are not placed on an equal and unambiguous basis when compared and fused, which can affect fusion quality, especially for infrared and visible image fusion, the response characteristics of the two sensors vary greatly, and the visual information in the visible spectrum can be severely affected by changes in external lighting conditions.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the method transforms the physical strength of source images into a visual response space of an HVS (high voltage sequence transformation) so that input information from different source images can be compared and fused in a unified human visual response space, and all features in the space are in the same perception state, thereby eliminating external physical factors which can influence the fusion process, finally generating a fusion image with rich information and good visual perception effect, solving the current potential defects and improving the fusion effect.

The technical scheme of the invention is as follows:

an infrared and visible light image perception fusion method based on multi-scale structural decomposition comprises the following steps:

step 1, carrying out multi-scale structural decomposition on infrared and visible light images of the same scene based on a scale-aware edge preserving (SAEP) filtering algorithm to obtain infrared and visible light multi-scale filtering images

And

wherein j is 0,1, …, N; n is the number of scale layers;

step 2, the infrared and visible light multi-scale filtering image obtained in the step 1 is processed

And

converting the image into a visual response space of an HVS (high Voltage stereo System), and obtaining the multi-scale perception contrast of the infrared image

And multi-scale perceptual contrast of visible light images

The conversion method comprises the following steps:

(1) calculating multi-scale adaptive contrast of infrared and visible light images according to contrast sensitivity and local adaptive mechanism of HVS

And

wherein the content of the first and second substances,

is the j-th layer low-pass image of the infrared image,

is a jth layer band-pass image of the infrared image,

is the jth layer low pass image of the visible image,

the j-th layer bandpass image of the visible light image, t is an adaptive parameter, and is a set value, preferably, t is 1, α is an adjustment parameter, preferably, α is 0.8,

(2) calculating initial value of multi-scale perception contrast of infrared and visible light images according to nonlinear conversion mechanism of HVS

And

when in use

When the positive value and the 0 value are taken,

when in use

When the negative value is taken as the value,

when in use

When the positive value and the 0 value are taken,

when in use

When the negative value is taken as the value,

wherein h is a threshold, preferably, h is 0.5; c is a constant, and is set to be 21.3, p values are different in different scale layers, and when j is increased from small to large, and when N is set to be 4, the values of p are respectively as follows: 1.40,1.15,1.04, 1.15;

(3) for the initial value of the multi-scale perception contrast obtained in the step (2)

And

carrying out noise and intensity saturation suppression to obtain the final multi-scale perception contrast of the infrared image

And multi-scale perceptual contrast of visible light images

The noise suppression method is as follows:

where th is a threshold to distinguish between noise and useful information,

the average gray value of the source image is obtained; since noise usually occurs in a small-scale layer, it is common to set

The strength was suppressed as follows:

unlike noise, overexposure often occurs at the large scale layer, rAs overexposure suppression parameter, I ₀ Representing a source image;

step 3, according to the visual characteristics of human eyes, the lowest layer low-pass image of the infrared and visible light images is processed

And

self-adaptive adjustment is carried out, fusion weight is determined based on a significance strategy, and then infrared and visible light bottom layer low-pass fusion images are obtained

The specific method comprises the following steps:

(1) for the bottom low-pass image of infrared and visible light images

And

performing self-adaptive adjustment to obtain an adjusted bottommost layer low-pass image

And

wherein l is a threshold value which reflects the average background brightness of the adjusted lowest layer low-pass image, and is set to be 128;

(2) determining visible light according to significance fusion strategyLowest layer low pass image

Fusion weight w of (c):

wherein

Which represents a gaussian filtering operation, is shown,

and

the saliency maps of the lowest low-pass image of the visible and infrared images, respectively, are calculated as follows:

wherein the content of the first and second substances,

and

respectively representing the gray values of a pixel point n and an adjacent pixel point k in an infrared image bottom layer image area omega,

and

respectively representing pixel point n and adjacent pixel point k in visible light image bottom layer image area omegaThe gray value of (a);

thus, the infrared and visible bottom layer low-pass fusion image

Is composed of

Step 4, determining the multi-scale perception contrast of the infrared image by using a bidirectional significance polymerization strategy

And multi-scale perceptual contrast of visible light images

The fusion weight of (1), wherein one direction is the combination of pixel level significance from top to bottom, and the other direction is the aggregation of the opposite directions of the structure significance, and further a fusion image of the j-th layer perception contrast of the infrared and visible light images is obtained;

the method comprises the following specific steps:

(1) for pixel level saliency, contrast will be perceived

And

respectively polymerizing from small scale to large scale to obtain the perception contrast of the jth layer

Pixel level saliency of

And j-th layer perceived contrast

Pixel level saliency of

(2) For structural significance, contrast will be perceived

And

aggregating from large scale to small scale separately, and also taking into account the adjusted lowest low-pass image

And

thereby obtaining the j-th layer perception contrast

Structural significance of

And j-th layer perceived contrast

Structural significance of

Where sf is the structural significance function, which is calculated as follows:

gamma is a balance parameter, and gamma is 0.1; s ₁ 、s ₂ In relation to the eigenvalues of the gradient covariance matrix C, it is given by:

wherein I _x (X) and I _y (X) respectively represent partial windows W _i The gradient of pixel points X in the X and y directions.

(3) Computing the j-th layer perceived contrast

Overall significance of

And j-th layer perceived contrast

Overall significance of

Wherein beta is a balance parameter, beta is 5,

a saliency map for layer j of an infrared image,

the saliency adjustment map of the jth layer of the visible light image has the following values:

wherein the content of the first and second substances,

in the form of a source infrared image,

representing the average gray value of the infrared image in the neighborhood omega; sg denotes a sigmoid function which is,

u is a control parameter, and when u takes different values, the sigmoid function has different shapes, and here, u is set to 5.

(4) Overall saliency from perceived contrast of layers of visible and infrared images

And

the fusion weight of each layer of perception contrast of the visible light image is as follows:

wherein, in different scale layers, u has different values, and u is equal to0.1*2 ^4-j 。

Further, a fusion image of the j-th layer perception contrast of the infrared and visible light images

Is composed of

And 5, obtaining a final fusion image through inverse transformation and reconstruction processes:

wherein the content of the first and second substances,

is the lowest-level low-pass fused image,

resulting from the inverse transformation process in step 2:

a contrast image is perceived for the fused layers.

Advantageous effects

1. The invention provides a novel sensing framework based on multi-scale structural decomposition, which is used for fusing infrared and visible light images. The proposed framework fully considers relevant characteristics of the HVS and can help solve potential drawbacks of current fusion studies in visual information perception.

2. The invention constructs a multi-scale structure decomposition method based on an SAEP filter to design a perception fusion framework. Compared with other algorithms, the method has excellent edge retention and scale perception characteristics, can obtain image structures of different scales, wherein edge information is kept in each layer, and small-scale details can be regarded as structures with fine spatial scales.

3. The invention provides a novel bidirectional significance aggregation algorithm for determining the fusion weight of multi-scale perception contrast, and the algorithm fully considers pixel-level significance information and large-scale structure information, so that a fusion image with rich information and good visual perception effect can be obtained.

4. The framework proposed by the present invention combines some key characteristics of the HVS, including multi-scale processing channels, contrast sensitivity, local adaptation, and supra-threshold characteristics. All relevant key features of the HVS are integrated synthetically into the proposed framework to simulate human visual response in complex scenes, creating a visual response space in the HVS that is representative of multi-scale perceptual contrast.

5. The present invention constructs a multi-scale structural decomposition by utilizing a scale-aware edge preservation (SAEP) filter that has good scale separation and edge preservation characteristics. By decomposition, an image structure of different scales is obtained, with edges remaining in each layer, and the details therein can be regarded as an image structure with a fine spatial scale.

6. In the fusion process, the invention proposes a two-way saliency aggregation strategy to fuse the perceptual contrast of each scale, one direction is aggregated from top to bottom in a scale space to obtain pixel-level saliency, and the other direction is aggregated reversely to calculate structural saliency. The two types of saliency are then combined and fusion weights are calculated according to the sigmoid function.

Drawings

FIG. 1 is a sigmoid function with u taking different values;

FIG. 2 is a flow chart of the fusion framework of the present invention;

FIG. 3 is a comparison of fused images of infrared and visible light images obtained by different methods. The image processing method comprises the following steps of (a) obtaining an infrared image, (b) obtaining a visible light image, (c) obtaining a fused image of the infrared image and the visible light image obtained by a WLS method, (d) obtaining a fused image of the infrared image and the visible light image obtained by a U2Fusion method, (e) obtaining a fused image of the infrared image and the visible light image obtained by an IFCNN method, and (f) obtaining the fused image of the infrared image and the visible light image obtained by the method.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings.

The invention provides an infrared and visible light image perception fusion framework based on multi-scale structural decomposition, which converts a source image into a visual response space of a Human Visual System (HVS) for comparison and fusion by taking the reference of a relevant mechanism of the HVS.

Based on this, the specific embodiments of the present invention are:

suppose the input infrared and visible images are I respectively _r And I _v As shown in fig. 2, the fusion steps are as follows:

step 1: according to the characteristics of a multi-scale processing channel of the HVS, the invention constructs multi-scale structure decomposition based on a scale perception edge preserving (SAEP) filtering algorithm to obtain infrared and visible light multi-scale filtering images

And

I _j ＝SAEP(I _j-1 ,λ _j ,r _j,0 ,r _j,1 ),j＝1,2,…,N

wherein I ₀ λ is the global smoothing weight, λ ₁ ＝0.1,λ _j+1 ＝λ _j + 0.9; r is a scale parameter, the scale is [ r ] _j,0 ,r _j,1 ]The image structure in between will be smoothed. In this example, r _1,0 ＝0,r _1,1 ＝4,r _j+1,1 ＝2r _j,1 ,r _j+1,0 ＝r _j,1 The filtering number N is 4.

Further, infrared and visible light layer band pass images

And

is obtained by the following formula:

B _j ＝I _j-1 -I _j ,j＝1,2,…,N

step 2: based on the correlation mechanisms such as contrast sensitivity, local adaptation and super-threshold characteristics of the HVS, the band-pass and low-pass images obtained by the multi-scale decomposition are converted into the visual response space of the HVS to obtain the multi-scale perception contrast of the infrared image

And multi-scale perceptual contrast of visible light images

And

wherein, I _j And B _j A jth layer low pass image and a band pass image of the infrared or visible light image, respectively; t is a self-adaptive parameter, and different values are taken according to the characteristics of the infrared image and the visible light image; α is a regulation parameter, and preferably, α is 0.8.

(2) The adaptive contrast resulting from (1) adapts to some extent to human vision, but it is still not the perceptual contrast in the visual response space. The HVS presents a non-linear transfer function that helps to achieve perceptual contrast in this unified space. Obtaining multi-scale perception contrast of infrared and visible light images according to HVS nonlinear conversion mechanism

And

wherein R is _j Taking a positive value, if the positive value is negative, taking the absolute value as an input to calculate, and inverting the output of the absolute value; h is a threshold, preferably, h is 0.5; c is a constant, and is 21.3; in different scale layers, the p values are different, and when the scale is increased from small to large, the values are respectively as follows: 1.40,1.15,1.04,1.15,1.35,1.93.

(3) Further carrying out noise and intensity saturation suppression on the obtained perception contrast to obtain the final multi-scale perception contrast of the infrared image

And multi-scale perceptual contrast of visible light images

In general, smaller scale layers contain more noise, while larger scale layers have more over-exposure. Therefore, different suppression methods are applied in different frequency layers.

The noise suppression method is as follows:

where th is a threshold to distinguish between noise and useful information,

the average gray value of the source image is obtained; in this example, let

The suppression method of intensity saturation is as follows:

wherein r is an overexposure suppression parameter, I ₀ A source image is represented.

And step 3: the lowest low-pass layers of the infrared and visible images reflect background information of the scene. According to the visual characteristics of human eyes, the lowest layer low-pass image of the infrared image and the visible light image is subjected to

And

carrying out self-adaptive adjustment, and determining fusion weight based on a significance strategy:

(1) for the bottom low-pass image of infrared and visible light images

And

And

wherein l is a threshold value which reflects the average background brightness of the adjusted lowest layer low-pass image, and is set to be 128; α is a regulation parameter, and preferably, α is 0.8.

(2) Fusion strategy according to significanceCalculating the low-pass image of the bottom layer of visible light

Fusion weight w of (c):

wherein

Which represents a gaussian filtering operation, is shown,

and

wherein A is _N (n) and A _N (k) Respectively representing the gray values of the pixel point n and the adjacent pixel point k in the bottom layer image area omega.

Thus, the infrared and visible bottom layer low-pass fusion image

Is composed of

And 4, step 4: in the visual response space of the HVS, the perceived contrast typically contains fine pixel-level saliency information and structural information of the image. Thus, multi-scale perceptual contrast for infrared images

And it can be seenMulti-scale perceptual contrast of light images

The invention proposes a two-way significance aggregation strategy to fully aggregate these features and determine fusion weights based on this. One of which is a top-down combination of pixel-level saliency, the other is a polymerization of the structural saliency in the opposite direction.

(1) For pixel level significance, the method superposes the perception contrast from small scale to large scale to obtain the perception contrast C of the jth layer _j Pixel level saliency of D _j ：

The j-th layer pixel level significance comprises fine-grained information of a current layer and a smaller-scale layer, and more complete details can be reserved, so that a final fused image is finer and smoother.

(2) For structural significance, the invention aggregates the perceived contrast from large scale to small scale, and in addition, due to the adjusted lowest low-pass layer image A _N Contains the basic structural information of the source image and we take this into account to obtain relatively complete structural saliency. Layer j perceived contrast C _j Structural significance of (1) G _j Comprises the following steps:

wherein sf is a structural significance function, can reflect structural information such as corners and the like in the image, and is calculated as follows:

α is a balance parameter, and in the present invention, α is made 0.1; s ₁ 、s ₂ With respect to the eigenvalues of the gradient covariance matrix C, it is obtained by the following equation:

wherein I _x (X) and I _y (X) respectively denote partial windows w _i The gradient of the inner pixel point X in the X and y directions.

(3) Calculating the j-th layer perception contrast C _j Overall significance of S _j ：

S _j ＝M _j *(D _j +β*G _j )

Wherein beta is a balance parameter, and beta is 5; denotes element-by-element multiplication operations; m _j The map is adjusted for saliency, which can help the fused image capture more highlight target information and less noise from the infrared image. For infrared and visible images, the values are as follows:

wherein the content of the first and second substances,

in the form of a source infrared image,

representing the average gray value of the infrared image in the neighborhood omega; sg denotes the sigmoid function and,

u is a control parameter, and when u takes different values, the sigmoid function has different shapes, as shown in fig. 1, where u is 5.

And

the fusion weight of each layer of perceived contrast of the visible light image is as follows:

wherein, in different scale layers, u takes different values, in this example, u is 0.1 × 2 ^4-j 。

Further, the fusion process of the infrared and visible image perception contrast layers can be described as

Wherein the content of the first and second substances,

and the fusion image is the j-th layer of perceived contrast.

And 5: lowest-layer low-pass fusion image obtained in visual response space of HVS based on step 3 and step 4

And fused, individual scale perceptual contrast images

Obtaining a final fused image through inverse transformation and reconstruction processes:

wherein the content of the first and second substances,

resulting from the inverse transformation process in step 2:

FIG. 3 is a comparison of fused images and other methods in accordance with the teachings of the present invention. Wherein, (a) is an infrared image, (b) is a visible light image, and (c), (d), (e) and (f) are Fusion results of the WLS method, the U2Fusion method, the IFCNN method and the method of the invention respectively. It can be seen that the inventive framework can achieve better fusion results by performing fusion in a consistent and well-defined visual response space, since the relevant properties of the human visual system are fully taken into account.

Claims

1. The infrared and visible light image perception fusion method based on multi-scale structural decomposition is characterized by comprising the following steps:

step 1, performing multi-scale structural decomposition on infrared and visible light images of the same scene to obtain infrared and visible light multi-scale filtering images

And

wherein j is 0,1, …, N; n is the number of scale layers;

And

And multi-scale perceptual contrast of visible light images

Step 3, the lowest layer low-pass image of the infrared and visible light images

And

self-adaptive adjustment is carried out, fusion weight is determined based on a significance strategy, and an infrared and visible light bottom layer low-pass fusion image is obtained

Step 4, determining the multi-scale perception contrast of the infrared image

And multi-scale perceptual contrast of visible light images

Obtaining a fusion image of the j-th layer perception contrast of the infrared and visible light images;

step 5, low-pass fusion image of the infrared and visible light bottom layer obtained in the step 3

And 4, obtaining the final fusion image by the fusion image of the infrared image and the visible light image with each layer perception contrast through inverse transformation and reconstruction processes.

2. The infrared and visible image perception fusion method based on multi-scale structural decomposition according to claim 1, characterized in that:

in the step 1, multi-scale structural decomposition is performed on the infrared and visible light images of the same scene based on a scale-aware edge preservation (SAEP) filtering algorithm.

3. The infrared and visible image perception fusion method based on multi-scale structural decomposition according to claim 1, characterized in that:

in the step 2, the infrared and visible light multi-scale filtering images are obtained

And

And multi-scale perceptual contrast of visible light images

The specific method comprises the following steps:

(1) multi-scale adaptive contrast ratio for calculating infrared and visible light images

And

wherein the content of the first and second substances,

is the j-th layer low-pass image of the infrared image,

is a jth layer bandpass image of the infrared image,

is the jth layer low pass image of the visible image,

(2) calculating multi-scale perception contrast initial value of infrared and visible light image

And

when in use

When the positive value and the 0 value are taken,

when in use

When the negative value is taken as the value,

when in use

When the positive value and the 0 value are taken,

when in use

When the negative value is taken as the value,

wherein h is a threshold value and c is a constant;

And

And multi-scale perceptual contrast of visible light images

The noise suppression method is as follows:

wherein, th is a threshold value,

the average gray value of the source image is obtained;

the strength was suppressed as follows:

r is an overexposure suppression parameter, I ₀ A source image is represented.

4. The method of claim 3, wherein the method comprises:

in the step 3, the lowest layer low-pass image of the infrared and visible light images is subjected to image processing

And

The method comprises the following steps:

(1) for the bottom low-pass image of infrared and visible light images

And

And

wherein l is a threshold;

(2) determining the lowest layer low-pass image of visible light according to a significance fusion strategy

Fusion weight w of (c):

wherein

Which represents a gaussian filtering operation, is shown,

and

are saliency maps of the lowest low pass image of the visible and infrared images respectively,it is calculated as follows:

wherein the content of the first and second substances,

and

and

respectively representing the gray values of a pixel point n and an adjacent pixel point k in a bottom image region omega of the visible light image;

thus, the infrared and visible bottom layer low-pass fusion image

Is composed of

5. The method of claim 4, wherein the method comprises:

step 4, determining the multi-scale perception contrast of the infrared image by using a bidirectional significance aggregation strategy

And multi-scale perceptual contrast of visible light images

The method for obtaining the fusion image of the j-th layer perception contrast of the infrared and visible light image comprises the following steps:

(1) will perceive contrast

And

respectively polymerizing from small scale to large scale to obtain the perception contrast of the j layer

Pixel level saliency of

And j-th layer perceived contrast

Pixel level saliency of

(2) Will perceive contrast

And

respectively polymerizing from large scale to small scale to obtain the perception contrast of the j layer

Structural significance of

And j-th layer perceived contrast

Structural significance of

Where sf is a structural significance function, which is calculated as follows:

gamma is a balance parameter, s ₁ 、s ₂ Is obtained by the following formula:

wherein I _x (X) and I _y (X) respectively represent partial windows W _i The gradients of the inner pixel points X in the X and y directions;

(3) computing the j-th layer perceived contrast

Overall significance of

And j-th layer perceived contrast

Overall significance of

Wherein beta is a balance parameter,

a saliency map for layer j of an infrared image,

wherein the content of the first and second substances,

in the form of a source infrared image,

u is a control parameter;

And

fusion image of j-th layer perception contrast of infrared and visible light image

Is composed of

6. The method of claim 5, wherein the method comprises:

in the step 5, the method for obtaining the final fusion image through the inverse transformation and reconstruction processes comprises the following steps:

wherein the content of the first and second substances,

for the bottom-most low-pass fused image,

resulting from the inverse transformation process in step 2:

a contrast image is perceived for the fused layers.

7. The method of claim 6, wherein the method comprises:

h＝0.5。

8. the method of claim 6, wherein the method comprises:

c＝21.3。

9. the method of claim 6, wherein the method comprises:

in different scale layers, the values of p are different, when j is increased from small to large, when N is 4, the values of p are respectively: 1.40,1.15,1.04,1.15.

10. The method of claim 6, wherein the method comprises:

l＝128，β＝5，γ＝0.1。