CN116881664A

CN116881664A - Soil texture prediction method based on local-global dependency relationship

Info

Publication number: CN116881664A
Application number: CN202310950055.8A
Authority: CN
Inventors: 李清亮; 张程; 上官微; 李叶光; 肖祺昀
Original assignee: Changchun Normal University
Current assignee: Changchun Normal University
Priority date: 2023-07-31
Filing date: 2023-07-31
Publication date: 2023-10-13
Anticipated expiration: 2043-07-31
Also published as: CN116881664B

Abstract

The application provides a soil texture prediction method based on local-global dependency relationship, which comprises the following steps: fitting the soil texture of the multi-layer soil depth by an equal-area secondary spline method; acquiring a main environment covariate, and screening out a relevant environment covariate with the absolute value of a pearson coefficient of soil texture in the whole soil section of each layer of soil depth being more than 0.02; re-ordering the related environment covariates according to the degree of correlation between the covariates and the soil texture to generate a first model input set; selecting the most relevant environment covariates with highest covariates and soil texture relativity, and sequentially ordering the rest relevant environment covariates according to the sequence with highest adjacent covariates relativity to generate a second model input set; and training a two-layer LSTM model by using the first model input set and the second model input set, establishing a relation model between the soil texture and the environment covariates by applying a full-connection layer FCN to the training result, and predicting the soil texture by using the relation model.

Description

Soil texture prediction method based on local-global dependency relationship

Technical Field

The application relates to the technical field of machine deep learning, in particular to a soil texture prediction method based on local-global dependency relationship.

Background

Accurate and effective soil texture mapping is critical to agricultural development and environmental activities. Current Digital Soil Mapping (DSM) methods lack accurate predictions due to the high degree of spatial heterogeneity of soil texture. Although feature engineering methods have been used to improve drawing accuracy by capturing complex earth-forming relationships, the calculation is time-consuming. Furthermore, the "discrete" features generated by the feature engineering method cannot reflect the interactions (or dependencies) of the environmental covariates. Thus, predictions of soil texture may be affected.

Disclosure of Invention

In order to solve the problems in the prior art, the application provides a new local-global dependency long-short-term memory DL (deep learning) model, firstly, a covariate reconstruction method is designed to generate a plurality of groups of inputs, then a multi-layer long-short-term memory model (LSTM) is adopted to extract the inherent correlation of environment covariates, and finally, the prediction is realized through a full-connection layer, so that the soil texture prediction of different soil depths is improved.

Specifically, the application provides a soil texture prediction method based on local-global dependency relationship, which comprises the following steps:

fitting the soil texture of the multi-layer soil depth by an equal-area secondary spline method;

acquiring a main environment covariate, screening out a related environment covariate with the absolute value of a pearson coefficient of soil texture being more than 0.02 in the whole soil section of each layer of soil depth, and removing redundant environment covariates affecting the prediction performance by using co-linearity analysis;

re-ordering the related environment covariates according to the relevance of the covariates and the soil texture according to the Pearson coefficient to generate a first model input set; selecting the most relevant environment covariates with highest covariates and soil texture relativity, and sequentially ordering the rest relevant environment covariates according to the sequence with highest adjacent covariates relativity to generate a second model input set;

and training two layers of long-short-term memory LSTM models by using the first model input set and the second model input set, establishing a relation model between soil texture and environment covariates by applying a full-connection layer FCN to training results, and predicting the soil texture by using the relation model.

As a further illustration of the present application, the soil texture of the multi-layered soil depth comprises 6 standard soil layer soil textures of 0-5cm, 5-15cm, 15-30cm, 30-60cm, 60-100cm and 100-200 cm; the soil texture is soil composition data with the sum of clay, sand and silt content being 100%.

As a further illustration of the present application, the environmental covariates include five soil texture influencing factors, climate, biology, topography, matrix and soil category.

As a further illustration of the present application, the selecting the most relevant environment covariates with the highest relevance between the covariates and the soil texture, and the remaining relevant environment covariates are sequentially ordered according to the order with the highest relevance between the adjacent covariates to generate a second model input set, including:

selecting the most relevant environment covariates with highest covariates and soil texture relativity as the model input data set of the last dimension;

calculating the covariates with the highest correlation degree with the most relevant environment covariates in other covariates to form secondary relevant environment covariates, and taking the secondary relevant environment covariates as a model input data set of the penultimate dimension;

and sequentially calculating covariates with highest correlation degree with the model input data set of the previous dimension, taking the covariates as the model input data set of the current dimension, and finally generating all second model input sets.

As a further illustration of the present application, the LSTM model is capable of calculating a mapping from environmental covariates to soil texture as follows:

i(n)＝σ(W _ih h(n-1)+W _ix x(n)+W _ix c(n-1)+b _i ) (1)

f(n)＝σ(W _fh h(n-1)+W _fx x(n)+W _fx c(n-1)+b _f ) (2)

o(n)＝σ(W _oh h(n-1)+W _ox x(n)+W _ox c(n)+b _o ) (4)

y＝W _yh h(n)+b _o (6)

where W is the weight matrix of the hidden state for the nth time step, b is the bias, σ (deg.) is the nonlinear activation function, tanh (deg.) is the hyperbolic tangent function,representing the multiplication of two vectors, x (t) and +.>Representing the input and output data, respectively, for the nth time step.

As a further illustration of the present application, the hyper-parameters recommended in the relational model are: epoch is 1000, hidden unit size is 128, sample data is 128, learning rate is 1e-4, time step is 5, LSTM layer number is 2.

Compared with the prior art, the application has the following beneficial technical effects:

the prediction method provided by the application improves the prediction capability of soil texture on the basis of considering inherent environment covariate dependency, so that the improved machine deep learning model provided by the application is helpful for more accurately and economically predicting the soil texture of China, ensures the applicability of the model in monitoring the soil texture change, and provides a new choice in use for a Digital Soil Mapping (DSM) method, in particular to an integrated machine learning model.

Drawings

FIG. 1 is a flow chart of two recombination methods provided by the present application;

fig. 2 is a diagram of an improved predictive model framework provided in accordance with the present application.

Detailed Description

In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of one or more embodiments. It may be evident, however, that such embodiment(s) may be practiced without these specific details.

Specific embodiments of the present application will be described in detail below with reference to the accompanying drawings.

The soil texture prediction method based on the local-global dependency relationship provided by the application comprises the following steps:

s110: the soil texture of the multi-layer soil depth was fitted by an equal area quadratic spline method.

The soil texture data in the present application is from the second national soil census data (official et al, 2013). Soil profiles with depths of about 1.5-2 meters are typically obtained according to standard field soil investigation methods (Zhang and Gong, 2012).

In order to meet the international soil mapping standard, the equal-area secondary spline function is used for fitting 0-5cm, 5-15cm, 15-30cm, 30-60cm, 60-100cm and 100-200cm of 6 standard soil layers. Soil texture is a component data of clay, sand and silt content totaling 100%. Thus, the logarithmic conversion process is used to address the compositional constraints of clay, sand and silt content. In Table 1, the clay, sand and silt contents range from 0 to 76.73%, 0.01% to 98.31% and 0 to 100%, respectively, with average values of 18.89%, 39.93% and 41.51%, respectively, and standard deviations of 10.97%, 20.98% and 15.41%, respectively, at depths of 0 to 5 cm. With increasing depth, the average clay content tends to increase, while the sand and silt content does not change much. The standard deviation in table 1 shows that the variability of sand content is highest. As depth increases, variability in clay, silt and sand content increases.

Table 1 statistical description of soil texture at different depths (weight percent)

S120: and acquiring main environment covariates, screening out related environment covariates with absolute values of pearson coefficients of soil textures greater than 0.02 in the whole soil section of each layer of soil depth, and removing redundant environment covariates affecting the prediction performance by using co-linearity analysis.

The environment covariates comprise five soil texture influencing factors including climate, biology, topography, matrix and soil category. Climate data were collected from Modis, woRldClim and chemical-close datasets. Biological data is represented primarily by data sources such as MODIS, GLOBELLAND, global accessibility map and GlobCoveR datasets. The MODIS data is an effective means of exploring biosphere variation, helping to understand the earth climate system, which is distributed at a spatial resolution of 11 km. The GLOBELLAND30 data is an important result of global and local global land cover remote sensing mapping and key technical research projects in China. It is the full coverage of the global land surface with a spatial resolution of 3030 meters. Global reachability map data describes urban or rural population gradients with spatial resolution of 1 km between 2000. GlobCoveR data is a global land cover map developed by the european space agency. The spatial resolution of the GlobCoveR data is chosen to be 11 km. The matrix data was collected from the american geological exploration institute earth science and environmental change science center (GECSC), based on the global lithology map database version 1.1. The relief data is primarily represented by the MERIT Digital Elevation Model (DEM) dataset, which is distributed at a spatial resolution of 100100. It takes into account various errors as an updated on-board DEM. Soil category data are mainly from coordinated world soil databases (spatial resolution 1 km) and soil grids (spatial resolution 250 meters), which are based on a linking method and a machine learning method, respectively.

S130: re-ordering the related environment covariates according to the relevance of the covariates and the soil texture according to the Pearson coefficient to generate a first model input set; and selecting the most relevant environment covariates with highest correlation degree of the covariates and the soil texture, and sequentially ordering the rest relevant environment covariates according to the sequence with highest correlation degree of the adjacent covariates to generate a second model input set.

The selected covariates and the most relevant environment covariates with highest soil texture relativity are sequentially ordered according to the sequence with highest adjacent covariates relativity to generate a second model input set, and the method comprises the following steps:

s131: and selecting the most relevant environment covariates with highest covariates and soil texture correlations as the model input data set of the last dimension.

S132: and calculating the covariates with the highest correlation degree with the most relevant environment covariates in other covariates to form secondary relevant environment covariates, and taking the secondary relevant environment covariates as the model input data set of the penultimate dimension.

S133: and sequentially calculating covariates with highest correlation degree with the model input data set of the previous dimension, taking the covariates as the model input data set of the current dimension, and finally generating all second model input sets.

As the soil texture is produced by complex activities of man and nature over time. In selecting relevant covariates and building predictive models, current research has focused mainly on the correlation between covariates and soil properties. However, it is not reasonable to predict the change of soil properties by only the relevant covariates. Taking soil and sand as an example; researchers have traditionally thought that high soil sand content is often accompanied by low variation in the amount of evapotranspiration. However, many studies have led to the opposite conclusion due to variations in land coverage and other complications. This also demonstrates that predictions made by the model can only be affected by virtue of the relationship between covariates and soil properties. Therefore, it is necessary to further study the potential relationship between the interaction between covariates and the change in soil properties, based on which the present application devised the two covariate reconstruction methods described above.

As shown in fig. 1, the first method is to rank the different covariates according to absolute values of pearson coefficients of soil texture. The most relevant covariates are placed in the last dimension, the last time step in the LSTM model. The second related covariate is placed in the penultimate dimension, the penultimate time step in the LSTM model, and the other covariates are analogized. It is assumed that if there is no correlation between adjacent covariates, the problem of gradient extinction may be faced during back propagation and further affect the prediction performance. To solve this problem, the present application also proposes a second reorganization method in which the present application also places the most relevant covariates (named a) in the last dimension. But instead of selecting the second related covariate, the covariate most related to a among the other covariates is calculated and placed in the penultimate dimension, and so on. In this way, it is ensured that adjacent covariates are correlated and the problem of gradient extinction is alleviated.

S140: and training two layers of long-short-term memory LSTM models by using the first model input set and the second model input set, establishing a relation model between soil texture and environment covariates by applying a full-connection layer FCN to training results, and predicting the soil texture by using the relation model.

The LSTM model is a recurrent neural network that maps variable-length environmental covariates to fixed-length vectors by recursively computing neighboring covariates. The model based can capture the dependency of covariates well. It calculates a mapping from environmental covariates to soil texture as follows:

i(n)＝σ(W _ih h(n-1)+W _ix x(n)+W _ix c(n-1)+b _i ) (1)

f(n)＝σ(W _fh h(n-1)+W _fx x(n)+W _fx c(n-1)+b _f ) (2)

o(n)＝σ(W _oh h(n-1)+W _ox x(n)+W _ox c(n)+b _o ) (4)

y＝W _yh h(n)+b _o (6)

According to the above equation, the feature vector in the input parameter of the nth covariate is calculated based on the nth covariate, the global (long-term) correlation from the last covariate, and the local (short-term) correlation. As shown in equation 3, the input data is applied to preserve important features from local dependencies and then mapped to local parts. Equation 2 shows that the mathematical procedure of hiding the state of the nth covariate is similar to the input parameters. It is used to remove useless features from the global dependencies and further obtain the global dependency part in equation 3. Meanwhile, the output mode of the nth argument is generated according to the new unit and then used to update the hidden state in equation 5. Finally, the soil geology is calculated from the updated hidden state in equation 6.

Although LSTM may well express the correlation of neighboring covariates. But correlations on different scales should also be considered so that the prediction quality can be improved. To this end, the present application also introduces relationships between secondary related environment covariates. Thus, a deep learning model with two LSTM layers and one FCN layer was designed. As shown in FIG. 2, the first layer is identical to the conventional LSTM model, showing local correlation between adjacent covariates. To further provide more potential information, the second LSTM layer representation shows global dependencies between covariates. The input data in the second LSTM layer is generated by the second reassembly method described above.

The prediction accuracy of the prediction method provided by the application is analyzed as follows:

the prediction accuracy is evaluated based on three common metrics as follows; interpretation of the variants (R) ² ) For verifying the percentage of variation interpreted by the predictive model. Root Mean Square Error (RMSE) is used to observe the ability to predict fluctuations. The gram-copta efficiency (KGE) represents the goodness of fit based on correlation, conditional and systematic bias. These indices are expressed as follows:

wherein y is _i And x _i The measured and predicted soil characteristics of the I-th section are respectively, and N is the total number of soil sections. In the KGE metric, R is the pearson coefficient between observed and predicted soil geology values. Alpha = sigma _x /σ _y Wherein σ is _x Sum sigma _y Standard deviations of predicted and observed soil geological values are represented, respectively. Beta = mu _x /μ _y Wherein μ is _x Sum mu _y Representing the average of the predicted and observed soil geology values.

To test the uncertainty of different predictive models, first, in the training process, a 10-fold cross validation method is applied to build a predictive model with optimal super parameters. In the process, an optimized model can be obtained and a deviation between predicted and observed soil characteristics can be generated from each verification intersection. All the deviations were used to fit quantile regression, forest models. Furthermore, a fit quantile regression forest model was used to calculate the percentile for each soil profile. Finally, the uncertainty is evaluated using a combination of Prediction Interval Coverage Probability (PICP) and average prediction interval (MPI). In the uncertainty index, we expect that 90% of the observed soil characteristics can fall within the 90% prediction interval. The index is expressed as follows:

PICP and MPI are respectively taken as confidence level and average width of the prediction interval. When the model reaches a similar PICP, the model with lower MPI has better predictive performance. As described in equation 10, y _i Is the soil characteristic observed in the ith section, and N is the total number of soil sections.And->Representing the lower and upper values of the prediction.

The entire soil profile for each layer depth of the present application was randomly divided into training (80%), validation (10%) and test (10%) groups when evaluated by the following test.

Super parameter evaluation:

many super-parameters in the deep learning model, such as the number of samples, epoch, learning rate, size of hidden units, and LSTM layer number, may affect prediction performance. Therefore, the application evaluates the influence of all super parameters, thereby obtaining the optimal prediction model. Soil clay with a soil depth of 0-5cm is exemplified.

The five indexes are used for a test meter2, performance of different superparameters. The larger period (2000) and smaller period (500) result in the model tending to slightly over fit (R ² 0.603) and under-fitting (R) ² 0.603), the parameters cannot converge to an optimal value and further affect the prediction performance. Furthermore, the different size hidden units represent different information content after being encoded in the model. The small information (hidden layer size 64) cannot fully reflect the complex relationship between covariates and soil clay (R ² 0.593). Although an excessive amount of information (hidden unit size of 192) can achieve better fit results in the training set, generalization ability in the invisible test set is poor (R ² 0.603).

The number of samples represents the number of soil sections selected during training in one epoch; setting the sample number (64) too small is that the model hardly converges to the global optimum (R ² 0.610), while too large a sample number (192) results in poor generalization ability (R ² Is 0.593). Furthermore, the learning rate is an important super-parameter that determines how the model weights are updated based on each prediction error. A lower learning rate (5 e-5) results in model retention (R ² 0.610) or requires a longer time cost to converge, while a higher learning rate (1 e-3) may miss the sweet spot (R ² 0.597) and further results in an unstable training process. Thus, the accuracy (R ² 0.614) is superior to the model with 4 time steps (R ² 0.608) and a model (R) with 6 time steps ² 0.608) the LSTM layer is the core of the model, which describes the local and global dependencies between covariates. The model of only one LSTM layer is the conventional LSTM model. Covariates in the LGD-LSTM model are selected based on correlations between covariates. However, in the conventional LSTM model, covariates are selected according to the correlation between covariates and soil properties. From Table 2 we can see that the number of LSTM layers is 2, which has the best performance (R ² Is 0.605 corresponds to 3 layers, R ² 0.610 for layer 1). This also demonstrates that capturing global-local correlations between covariates can provide potential information for improved prediction accuracy. Final resultThe super parameters recommended in the relationship model finally designed by the application are as follows: epoch is 1000, hidden unit size is 128, sample data is 128, learning rate is 1e-4, time step is 5, LSTM layer number is 2.

Table 2 0-results for soil clay at soil depths of 5cm with different super parameter sets for LS-LSTM. The best settings in each group are shown in bold.

Prediction accuracy analysis:

as shown in tables 3 to 5, the predicted performance of the soil texture varies from depth to depth. According to all precision indexes (R ² RMSE, KGE), the proposed deep learning model generally performs better than other predictive models. The improved LGD-LSTM model of the present application provides more accurate predictions than other LGD-LSTM models. On average, R ² About 5.66% increase, about 4.88% decrease in RMSE, and about 2.06% increase in KGE. For soil silt prediction, R ² About 2.64% increase, about 2.41% decrease in RMSE, and about 0.59% increase in KGE. This suggests that conventional DSM methods typically only consider the relationship between covariates and soil properties, whereas the present application introduces interactions between covariates that can provide more potential information that can affect soil property changes and further enhance predictive power. For soil clay prediction, the improved LGD-LSTM model provided by the application is generally R ² RMSE and KGE were 0.672 (4.02% increase over LSTM), 6.517% (3.55% decrease over LSTM) and 0.742 (1.48% increase over LSTM), respectively. For soil sand prediction, the average R of the improved LGD-LSTM model provided by the application ² RMSE and KGE were 0.725 (0.97% increase over LSTM), 10.882% (1.36% decrease over LSTM) and 0.767 (equal to LSTM), respectively. For soil silt prediction, the application provides a general R of an improved LGD-LSTM model ² RMSE and KGE were 0.661 (1.85% increase over LSTM), 8.956% (1.69% decrease over LSTM) and 0.681 (1.49% increase over LSTM), respectively. This shows that the improved LGD-LSTM model provided by the application has greater prediction advantages than other prediction models after considering the local and overall correlation between covariates and soil properties. It can thus be concluded that interactions between covariates play a greater role in the texture mapping of soil than local and global dependencies between covariates and soil properties.

It is well known that RF models are frequently applied and accepted in DSM, CNN models also exhibit reasonable performance in DSM. Furthermore, from tables 3 to 5, the improved LGD-LSTM model proposed by the present application has a significant improvement compared to the RF and CNN models. Soil clay prediction R compared to LSTM model ² The average improvement is 17.69%, the average reduction of RMSE is 12.39%, and the average improvement of KGE is 6.46%; soil sand prediction R ² On average 13.10% increase, 12.50% decrease in RMSE on average, 2.82% increase in KGE on average, soil silt predictive R ² Average increase 16.58%, average decrease 11.52% RMSE, average increase 4.77% KGE; furthermore, the prediction ability of the CNN model is worse than the RF model, because of inter-channel redundancy, CNN-based models have limited flexibility in DSM. The improved LGD-LSTM model proposed by the present application provides more options for the application of DSM, in particular integrated machine learning models.

Uncertainty analysis:

first, all models were tested for uncertainty using PICP and MPI with a confidence of 90%. Table 2 shows that the PICP of all models was close to the expected 90% confidence level. For soil clay prediction, the PICP of the improved LGD-LSTM model provided by the application is respectively 0.894 to 0.899 at all depths, the LGD-LSTM model is also 0.894 to 0.899, the LSTM model is 0.8999 to 0.913, the CNN model is 0.852 to 0.901, and the RF model is 0.876 to 0.901. For all depths, the highest PICP is implemented by the LSTM model. The LSTM model has relatively highest confidence compared to other predictive models, and consistent and stable soil clay can be predicted. According to MPI, for soil clay prediction, the improved LGD-LSTM model proposed by the present application predicts MPI ranging from 26.519% to 35.076%, LGD-LSTM model ranging from 27.826% to 34.151%, LSTM model ranging from 27.875% to 35.771%, CNN model ranging from 28.681% to 36.667%, and RF model ranging from 27.727% to 33.572, respectively. In general, the RF model produced an average value of MPI that was lower than other soil clay predictive models. LGD-LSTM has the narrowest MPI in all models for soil sand and silt predictions. Furthermore, in addition to soil clay and sand predictions at soil depths of 100-200cm, increases in MPI are typically accompanied by increases in soil depth. Overall, considering the interactions between covariates in the texture mapping of soil according to PICP and MPI can improve predictive ability and create relatively low uncertainty in predictions.

Table 3 accuracy of soil clay predictive model at all soil depths. The best index in the model is shown in bold.

/>

Table 4 is the same as table 3 but for soil sand prediction.

/>

Table 5 is the same as Table 3, but for soil silt prediction.

/>

It will be evident to those skilled in the art that the application is not limited to the details of the foregoing illustrative embodiments, and that the present application may be embodied in other specific forms without departing from the spirit or essential characteristics thereof.

The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive, the scope of the application being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference signs in the claims shall not be construed as limiting the claim concerned.

Furthermore, it is evident that the word "comprising" does not exclude other elements or steps, and that the singular does not exclude a plurality. A plurality of units or means recited in the system claims can also be implemented by means of software or hardware by means of one unit or means. The terms second, etc. are used to denote a name, but not any particular order.

Finally, it should be noted that the above-mentioned embodiments are merely for illustrating the technical solution of the present application and not for limiting the same, and although the present application has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications and equivalents may be made to the technical solution of the present application without departing from the spirit and scope of the technical solution of the present application.

Claims

1. A soil texture prediction method based on local-global dependency relationship, the method comprising:

2. The local-global dependency-based soil texture prediction method according to claim 1, wherein the soil texture of the multi-layered soil depth comprises 6 standard soil layer soil textures of 0-5cm, 5-15cm, 15-30cm, 30-60cm, 60-100cm and 100-200 cm; the soil texture is soil composition data with the sum of clay, sand and silt content being 100%.

3. The local-global dependency-based soil texture prediction method according to claim 1, wherein the environmental covariates comprise five soil texture influencing factors of climate, biology, topography, matrix and soil category.

4. The method for predicting soil texture based on local-global dependency relationship according to claim 1, wherein the selecting the most relevant environment covariates with highest correlation degree of the soil texture, and the remaining relevant environment covariates are sequentially ordered in the order of highest correlation degree of adjacent covariates to generate a second model input set, comprising:

5. The local-global dependency-based soil texture prediction method according to claim 1, wherein the LSTM model is capable of calculating a mapping from environmental covariates to soil texture as follows:

i(n)＝σ(W _ih h(n-1)+W _iχ χ(n)+W _iχ c(n-1)+b _i ) (1)

f(n)＝σ(W _fh h(n-1)+W _fx x(n)+W _fχ c(n-1)+b _f ) (2)

o(n)＝σ(W _oh h(n-1)+W _ox x(n)+W _ox c(n)+b _o ) (4)

y＝W _yh h(n)+b _o (6)

where W is the weight matrix of the hidden state for the nth time step, b is the bias, σ (deg.) is the nonlinear activation function, tanh (deg.) is the hyperbolic tangent function,representing the multiplication of two vectors, χ (t) and +.>Representing the input and output data, respectively, for the nth time step.

6. The local-global dependency relationship-based soil texture prediction method according to claim 1, wherein the super parameters recommended in the relationship model are: epoch is 1000, hidden unit size is 128, sample data is 128, learning rate is 1e-4, time step is 5, LSTM layer number is 2.