Disclosure of Invention
The invention aims to overcome the defects of the prior art, provides a method for generating a high space-time resolution leaf area index product in a cloudy and foggy region, and effectively solves the problem of data missing in the cloudy and foggy region by integrating space-time fusion, data assimilation and deep learning technologies, and fills the blank of a high space-time resolution LAI product in the cloudy and foggy region.
In order to achieve the above purpose, the invention provides a method for generating a high space-time resolution leaf area index product in a cloudy and foggy region, which comprises the following steps:
S1:
S1-1, calculating a structural similarity index according to GLASS leaf area indexes aiming at images of three years before and after a Landsat target image polluted by cloud, searching a reference Landsat image which is most similar to a spatial structure of the target image, specifically, a Landsat image with the structural similarity index higher than 0.85 is considered to be similar to the target image structure, and a Landsat image which is completely clear and has the highest structural similarity index is selected as an optimal reference image;
S1-2, when the target influence is polluted by the cloud, if the optimal reference image of the S1-1 is provided, using FSDAF space-time technology to fuse Landsat with MODIS surface reflectivity to predict Landsat surface reflectivity density of the target date; if the optimal reference image conforming to the S1-1 is not provided, the original Landsat surface reflectivity of the target date is reserved;
S2, selecting pixels with the land surface reflectivity density higher than 60% as sampling points, uniformly selecting the sampling points for each surface coverage type by means of Global land 30 surface classification data, constructing a dynamic model from GLASS leaf area indexes and assimilating land surface area index reference sequences with 8-day time resolution for each sampling point by using a set Kalman filtering technology by taking the land surface reflectivity as observation;
S3, extracting a 2-year Landsat surface reflectivity time sequence matrix in a 1.5km multiplied by 1.5km area taking the sample point as a center from all the sample points as input, wherein the dimension of the matrix is 46 multiplied by 50, and the missing value is assigned to 0; according to the real missing condition of Landsat data after space-time fusion, the proportion of effective values in a Landsat earth surface reflectivity time sequence matrix is redistributed, and a sample point Landsat leaf area index reference sequence is taken as output to construct a CNN-BiGRU model;
S4, in the cloudy and foggy region, estimating pixels in all the S2, and sequentially applying FSDAF technology and a CNN-BiGRU model to estimate the high space-time resolution leaf area index of the cloudy and foggy region.
And S3, when the Landsat data of the cloudy and foggy region is obtained according to the step S1, the proportion of the effective reflectivity value in the Landsat earth surface reflectivity matrix is redistributed according to the actual missing condition, the reflectivity matrix is used as input, the Landsat leaf area index reference sequence corresponding to the sample point is used as output, and a CNN-BiGRU time sequence regression model comprehensively utilizing the space-time information is trained.
The beneficial technical effects of the invention are as follows:
The invention integrates space-time fusion, data assimilation and deep learning technologies, wherein the space-time fusion technology provides more learning information for the deep learning technology, the data assimilation technology provides a high-precision learning target for the deep learning technology, the deep learning technology effectively solves the problem of data missing in a cloudy and foggy area, and the three technologies complement each other to fill the blank of a high space-time resolution LAI product in the cloudy and foggy area.
According to the invention, space-time information constraint is introduced, so that the time change rule and the space dependence of the LAI are better modeled, the estimation precision of the LAI with high space-time resolution in the cloud and fog region is improved, and the estimation result has higher consistency with the ground reference LAI. Meanwhile, the LAI product with a long time sequence of 30m and 8 days can be accurately produced under all conditions in the cloudy and foggy region with high efficiency, the advantages of space-time fusion, data assimilation and deep learning technology can be integrated, the problem of Landsat data long time sequence deletion caused by the cloudy and foggy region is solved, the leaf area index product with long time sequence, high precision and high space-time resolution in the cloudy and foggy region is generated, and data support is provided for ecological environment monitoring and ecological process simulation in the cloudy and foggy region.
Detailed Description
Referring to fig. 1, the invention provides a method for generating a high space-time resolution leaf area index product in a cloudy and foggy region, which comprises the following steps:
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The embodiment of the invention provides a method for generating a high space-time resolution leaf area index product in a cloudy and foggy region, which comprises the following steps:
S1: aiming at a cloud polluted Landsat target image, calculating a structural similarity index by using a GLASS leaf area index, searching a reference Landsat image which is most similar to the spatial structure of the GLASS leaf area index, and fusing Landsat with MODIS ground surface reflectivity by using FSDAF space-time technology to improve the available Landsat ground surface reflectivity density;
S2: selecting available sample points with higher land surface reflectivity density, and generating a land surface index reference sequence from the GLASS leaf area index and the land surface reflectivity by using a set Kalman filtering technology;
S3: extracting a Landsat earth surface reflectivity time sequence matrix of 2 years in a region with 1.5km multiplied by 1.5km taking the sample point as the center as input, redistributing the proportion of effective values in the Landsat earth surface reflectivity time sequence matrix according to the real missing condition of Landsat data after space-time fusion, and constructing a CNN-BiGRU model by taking a sample point Landsat leaf area index reference sequence as output;
S4: and sequentially applying FSDAF technology and a CNN-BiGRU model in the cloudy and foggy region to estimate the high space-time resolution leaf area index of the cloudy and foggy region.
In one embodiment, step S1 further comprises:
When the GLASS leaf area index is used for calculating the structural similarity index, the time window is set to be three years before and after the target image, the Landsat image with the structural similarity index higher than 0.85 is considered to be similar to the target image structure, and the Landsat image with the complete clear sky and the highest structural similarity index is selected as the optimal reference image.
In the implementation process, a certain cloudy and foggy area is divided into a plurality of sinusoidal projection subareas, the Lansat earth surface reflectivity of a subarea, which is polluted by clouds, is called target Lansat earth surface reflectivity, the date is called target date, and the GLASS V6250 meter LAI matched with the target date and time and the target Lansat earth surface reflectivity space is called target GLASS LAI. And (3) taking the target date as the center, defining a time window of three years (seven years in total), acquiring GLASS LAI of other dates and calculating a structural similarity index with the target GLASS LAI. And (3) sequentially screening Landsat earth surface reflectivities of other dates according to the structural similarity structural indexes from high to low until the Landsat earth surface reflectivities which are completely clear sky and have the structural similarity indexes not smaller than 0.85 are found, and taking the Landsat earth surface reflectivities as reference Landsat earth surface reflectivities, wherein the corresponding dates are called reference dates.
And obtaining MODIS 500 m earth surface reflectivities of the reference date and the target date, which are spatially matched with the reference Landsat earth surface reflectivities and correspond to the wave bands, respectively serving as the reference and target MODIS earth surface reflectivities, applying FSDAF space-time fusion algorithm to input the reference Landsat earth surface reflectivities, the reference MODIS earth surface reflectivities and the target MODIS earth surface reflectivities, and predicting the target Landsat earth surface reflectivities.
If a sub-area polluted by the cloud on a specific date has a reference image which is completely clear and empty and has a structural similarity index not smaller than 0.85, a FSDAF space-time fusion algorithm is applied to predict the target Landsat ground surface reflectivity of the sub-area.
In one embodiment, step S2 includes:
and counting the density of available Landsat ground surface reflectivity in the time sequence of each year of each pixel in the whole cloudy and foggy area 2000-2022, screening pixels with the available Landsat ground surface reflectivity density higher than 60%, and uniformly selecting the same number of pixels with the available Landsat ground surface reflectivity density higher than 60% for each ground surface type as sampling points according to the classified products of 30 meters of the Global land 30.
And extracting GLASS V6 LAI time sequences of corresponding years of the sampling points for constructing a LAI dynamic model, simulating Landsat ground surface reflectivity by adopting an A Two-Layer Canopy Reflectance Model (ACRM) canopy reflectivity model, assimilating the GLASS V6 LAI and Landsat ground surface reflectivity by using a set Kalman filtering technology, and generating a 30 m 8 day LANDSAT LAI reference time sequence.
In one embodiment, step S3 includes:
and (3) extracting a corresponding Landsat surface reflectivity time series matrix for each sample point in the step S2. The time dimension is the current year and the previous year for two years, the space dimension is 1.5km×1.5km, the spectrum dimension is red, green, blue, near infrared and 2 short wave infrared for 6 wave bands, and the final reflectivity matrix is 6×46×50×50.
And (3) counting 20% -90% of the land surface reflectivity missing conditions of the cloudy and foggy area obtained in the step (S1), adjusting the use proportion of the effective reflectivity value in the land surface reflectivity matrix according to the real missing conditions, so as to simulate various missing conditions, uniformly distributing the adjusted available land surface reflectivity density at 10% -80%, and setting the invalid value in the reflectivity matrix to 0.
The reflectance matrix is used as input, LANDSAT LAI reference time sequences are used as output to form training samples, 85% of samples are randomly selected for model training, and the remaining 15% are used for model verification. Training LANDSAT LAI time series regression models by adopting a CNN-BiGRU algorithm which simultaneously considers space and time information, respectively training a model for the three sensors of TM, ETM and OLI, wherein the three models use the same parameter setting: the Adam gradient descent algorithm has a batch size of 100, a maximum number of iterations of 1000, an initial learning rate of 0.01, and a learning rate of 0.001 after 500 training, each training upsets the data set.
In one embodiment, step S4 includes:
When LANDSAT LAI time series of a certain pixel in a certain year are estimated, a corresponding Landsat surface reflectivity time series matrix is extracted. The time dimension is the current year and the previous year for two years, the space dimension is 1.5km×1.5km, the spectrum dimension is red, green, blue, near infrared and 2 short wave infrared for 6 wave bands, the final reflectivity matrix is 6×46×50×50, and the invalid value in the reflectivity matrix is set to 0. And (3) inputting a reflectivity matrix by using the CNN-BiGRU model obtained by training in the step (3) to obtain a LANDSAT LAI time series estimated value of the pixel for one year. The steps are repeatedly executed for each year of each pixel, so that the batch time sequence leaf area index batch production of each pixel is realized.
Finally, the Landsat leaf area index products of 2000-2022 years, 30 meters and 8 days in the cloudy and foggy region are obtained by using the steps S1 to S4.
The invention integrates the space-time fusion, data assimilation and deep learning technologies, fully utilizes multi-source remote sensing data, considers the space-time information constraint of vegetation LAI, and generates a long-time sequence high-precision space-time continuous 30-meter leaf area index product in a cloudy and foggy region.
The implementation flow of the method provided by the invention is shown in fig. 1, and a specific implementation mode is further described with reference to an embodiment. A method for generating a leaf area index product with high space-time resolution in a cloudy and foggy region specifically comprises the following steps:
Step 1, aiming at a Landsat target image polluted by cloud, calculating a structural similarity index by using a GLASS leaf area index, searching a reference Landsat image most similar to the spatial structure of the GLASS leaf area index, and fusing Landsat with an MODIS ground surface reflectivity by using FSDAF space-time technology to improve the available Landsat ground surface reflectivity density;
Taking an Chuan Dian region as an example, dividing the whole Chuan Dian region into 28.8km multiplied by 28.8km sinusoidal projection subregions, regarding a subregion with cloud pollution on a certain date as a target date, taking Landsat ground surface reflectivity images (6 wave bands of red, green, blue, near red, short wave infrared 1 and short wave infrared 2) with cloud pollution on the target date as target Landsat ground surface reflectivity, and acquiring GLASS V6250 m LAI images with the same size as the target Landsat ground surface reflectivity on the target date as target GLASS LAI. And (3) taking the target date as the center, defining a time window of three years (seven years in total), acquiring GLASS V6250 m LAI images with the same size as the target Landsat earth surface reflectivity on other dates (other dates except the target date in the time window), calculating a structural similarity index with the target GLASS LAI, and sequencing from large to small according to the structural similarity, wherein the structural similarity index is larger, so that the earth surface structural characteristics of the other dates are similar to the earth surface structural characteristics of the target date. And obtaining Landsat earth surface reflectivities with the same size as the target Landsat earth surface reflectivities of other dates, identifying cloud and clear sky pixels by using a Fmak 4.6 algorithm, sequentially screening Landsat earth surface reflectivities of other dates according to structural similarity structural indexes from high to low until the Landsat earth surface reflectivities of the complete clear sky with the structural similarity indexes not less than 0.85 are found, and taking the Landsat earth surface reflectivities as reference Landsat earth surface reflectivities, wherein the corresponding dates are called reference dates.
Obtaining an MODIS 500-meter earth surface reflectivity image with the same size as the reference Landsat earth surface reflectivity and corresponding to a wave band, taking the image as the reference MODIS earth surface reflectivity, obtaining an MODIS 500-meter earth surface reflectivity image with the same size as the target Landsat earth surface reflectivity and corresponding to the wave band, taking the image as the target MODIS earth surface reflectivity, applying FSDAF space-time fusion algorithm to input the reference Landsat earth surface reflectivity, the reference MODIS earth surface reflectivity and the target MODIS earth surface reflectivity, and predicting the target Landsat earth surface reflectivity.
In practice, for a target subarea polluted by cloud, if a completely clear sky is found and the Landsat surface reflectivity with the structural similarity index not smaller than 0.85 is used as a reference image, executing FSDAF space-time fusion algorithm to predict the Landsat surface reflectivity of the target date; if the land surface reflectivity meeting the condition is not used as the reference image, the original land surface reflectivity of the target date is reserved.
After the step is finished, the density of the available Landsat surface reflectivity in the Chuan Yunnan area is improved.
Step 2, selecting a sample point with higher available Landsat surface reflectivity density, and generating a Landsat leaf area index reference sequence from the GLASS leaf area index and the Landsat surface reflectivity by using a set Kalman filtering technology;
Counting the density of available Landsat ground surface reflectivity in a time sequence of each year of each pixel in 2000-2022 years of the whole Chuan Yunnan region, screening pixels with available Landsat ground surface reflectivity density higher than 60%, classifying products according to Global land30 ground surface 30 m, uniformly selecting equal quantity of pixels with available Landsat ground surface reflectivity density higher than 60% for each ground surface type as sampling points, and finally collecting 1257795 sampling points.
And extracting GLASS V6 LAI time sequences of corresponding years of the sampling points for constructing a LAI dynamic model, simulating Landsat ground surface reflectivity by adopting an A Two-Layer Canopy Reflectance Model (ACRM) canopy reflectivity model, assimilating the GLASS V6 LAI and Landsat ground surface reflectivity by using a set Kalman filtering technology, and generating a 30 m 8 day LANDSAT LAI reference time sequence.
Step 3, extracting a Landsat earth surface reflectivity time sequence matrix of 2 years in a region with 1.5km multiplied by 1.5km by taking the sample point as a center as input, redistributing the proportion of effective values in the Landsat earth surface reflectivity time sequence matrix according to the real missing condition of Landsat data of the whole Chuan Yunnan region after space-time fusion, and constructing a CNN-BiGRU model by taking a Landsat leaf area index reference sequence of the sample point as output;
extracting a corresponding Landsat earth surface reflectivity time sequence matrix of each sample point in the step 2, and particularly extracting Landsat earth surface reflectivity data in a region with 1.5km multiplied by 1.5km taking the sample point as a center in space, wherein the reflectivity matrix is 50 multiplied by 50; selecting a time sequence of two years of the year of the sample point and the previous year, wherein the number of Landsat ground surface reflectivities in each year is 23, and the reflectivity matrix is 46 multiplied by 50; spectrally, red, green, blue, near infrared and 2 short wave infrared are selected for a total of 6 band surface reflectivities, where the reflectivity matrix becomes 6 x 46 x 50.
The sampling points in the step 2 are obtained by screening pixels with the available Landsat surface reflectivity density higher than 60%, the available Landsat surface reflectivity density in the corresponding reflectivity matrix is also higher than 60%, and LANDSAT LAI reference time sequences generated on the sampling points in the step 2 have higher precision. However, the use proportion of the effective reflectivity values in the reflectivity matrix of the sample points needs to be adjusted according to the actual lack of the land surface reflectivity of Landsat in the Chuan-Dian region. Specifically, the defect condition of Landsat surface reflectivity of the Chuan Yunnan area obtained in the step 1 is counted to be 20% -90%, the use proportion of the effective reflectivity value in the Landsat surface reflectivity matrix is adjusted according to the actual defect condition, so that various defect conditions are simulated, and the adjusted available Landsat surface reflectivity density is uniformly distributed at 10% -80%.
And assigning 0 to the invalid earth surface reflectivity in the reflectivity matrix, and reserving the valid earth surface reflectivity value. And taking the earth surface reflectivity matrix as a model input, taking a corresponding LANDSAT LAI reference time sequence as a model output, and combining the model training samples. The CNN-BiGRU algorithm, which can consider both spatial and temporal information, is chosen to train a regression model that estimates LANDSAT LAI the time series from the surface reflectivity matrix, with model parameters set to: the Adam gradient descent algorithm has a batch size of 100, a maximum number of iterations of 1000, an initial learning rate of 0.01, and a learning rate of 0.001 after 500 training, each training upsets the data set. Training samples were randomly divided into two groups, of which 85% were used for model training and the remaining 15% were used for model independent verification.
For different Landsat sensors, including TM, ETM+, OLI, we trained three CNN-BiGRU models using the same procedure and parameter settings.
And 4, sequentially applying FSDAF technology and a CNN-BiGRU model in the Chuan-Dian region, and estimating the leaf area index with high space-time resolution in the Chun-Dian region.
And (3) executing the step 1 in the Chuan Yunnan region to improve the available Landsat ground surface reflectivity density.
When LANDSAT LAI time series of a certain pixel in a certain year are estimated, extracting a corresponding Landsat earth surface reflectivity time series matrix, specifically, extracting Landsat earth surface reflectivity data in a region with the pixel as a center of 1.5km multiplied by 1.5km in space, wherein the reflectivity matrix is 50 multiplied by 50; selecting a time sequence of two years from the year and the previous year, wherein the number of Landsat surface reflectivities in each year is 23, and the reflectivity matrix is 46×50×50; spectrally, red, green, blue, near infrared and 2 short wave infrared are selected for a total of 6 band surface reflectivities, where the reflectivity matrix becomes 6 x 46 x 50. And assigning an invalid earth surface reflectivity value in the earth surface reflectivity matrix to be 0, and reserving an effective value. And (3) inputting the final earth surface reflectivity matrix into the CNN-BiGRU model trained in the step (3) to obtain a LANDSAT LAI time series estimated value of the pixel in one year. The steps are repeatedly executed for each year of each pixel, so that the batch time sequence leaf area index batch production of each pixel is realized.
Based on the steps, finally, landsat leaf area index products of 2000-2022 years, 30m and 8 days in Chuan Yunnan region are generated.
The above is only a preferred embodiment of the present application, only for helping to understand the method and the core idea of the present application, and the scope of the present application is not limited to the above examples, and all technical solutions belonging to the concept of the present application belong to the scope of the present application. It should be noted that modifications and adaptations to the present application may occur to one skilled in the art without departing from the principles of the present application and are intended to be within the scope of the present application.
The invention fundamentally solves the problems that Landsat observation data available in a cloudy and foggy area in the prior art are very limited, the existing data assimilation algorithm needs high-quality and high-density Landsat observation to adjust the trend of the LAI dynamic model, otherwise, the estimation result has larger uncertainty to influence the estimation result, the time change rule and the space dependence of the LAI are better modeled by introducing space-time information constraint, the estimation precision of the LAI with high space-time resolution in the cloudy and foggy area is improved, and the estimation result has higher consistency with the ground reference LAI.