CN115266612A - A method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables - Google Patents
A method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables Download PDFInfo
- Publication number
- CN115266612A CN115266612A CN202210889387.5A CN202210889387A CN115266612A CN 115266612 A CN115266612 A CN 115266612A CN 202210889387 A CN202210889387 A CN 202210889387A CN 115266612 A CN115266612 A CN 115266612A
- Authority
- CN
- China
- Prior art keywords
- variables
- soil
- data
- resolution
- available phosphorus
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000002689 soil Substances 0.000 title claims abstract description 197
- 229910052698 phosphorus Inorganic materials 0.000 title claims abstract description 86
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 title claims abstract description 83
- 239000011574 phosphorus Substances 0.000 title claims abstract description 83
- 230000007613 environmental effect Effects 0.000 title claims abstract description 66
- 238000013507 mapping Methods 0.000 title claims abstract description 45
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000005070 sampling Methods 0.000 claims abstract description 17
- 238000011156 evaluation Methods 0.000 claims abstract description 16
- 238000007781 pre-processing Methods 0.000 claims abstract description 15
- 238000000605 extraction Methods 0.000 claims abstract description 13
- 238000012216 screening Methods 0.000 claims abstract description 9
- 238000010276 construction Methods 0.000 claims abstract description 4
- 238000012360 testing method Methods 0.000 claims abstract description 4
- 238000012937 correction Methods 0.000 claims description 16
- 238000001556 precipitation Methods 0.000 claims description 15
- 238000007637 random forest analysis Methods 0.000 claims description 15
- 230000000717 retained effect Effects 0.000 claims description 15
- 230000001419 dependent effect Effects 0.000 claims description 12
- 239000000126 substance Substances 0.000 claims description 9
- 239000002344 surface layer Substances 0.000 claims description 9
- 238000012549 training Methods 0.000 claims description 9
- 238000012795 verification Methods 0.000 claims description 9
- 230000008030 elimination Effects 0.000 claims description 7
- 238000003379 elimination reaction Methods 0.000 claims description 7
- 238000010220 Pearson correlation analysis Methods 0.000 claims description 6
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 claims description 6
- 238000004422 calculation algorithm Methods 0.000 claims description 6
- 238000004458 analytical method Methods 0.000 claims description 5
- 238000007689 inspection Methods 0.000 claims description 5
- 230000000875 corresponding effect Effects 0.000 claims description 4
- 238000005527 soil sampling Methods 0.000 claims description 4
- 230000003595 spectral effect Effects 0.000 claims description 4
- 229910052787 antimony Inorganic materials 0.000 claims description 3
- WATWJIUSRGPENY-UHFFFAOYSA-N antimony atom Chemical compound [Sb] WATWJIUSRGPENY-UHFFFAOYSA-N 0.000 claims description 3
- 238000004737 colorimetric analysis Methods 0.000 claims description 3
- 238000012821 model calculation Methods 0.000 claims description 3
- 229910052750 molybdenum Inorganic materials 0.000 claims description 3
- 239000011733 molybdenum Substances 0.000 claims description 3
- 230000008569 process Effects 0.000 claims description 3
- 229910000030 sodium bicarbonate Inorganic materials 0.000 claims description 3
- 235000017557 sodium bicarbonate Nutrition 0.000 claims description 3
- 230000009466 transformation Effects 0.000 claims description 3
- 230000002596 correlated effect Effects 0.000 claims description 2
- 238000011439 discrete element method Methods 0.000 claims 9
- 230000006872 improvement Effects 0.000 abstract description 6
- 238000011160 research Methods 0.000 abstract description 3
- 230000000694 effects Effects 0.000 description 5
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 230000009467 reduction Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 239000005416 organic matter Substances 0.000 description 1
- 238000012876 topography Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/17—Systems in which incident light is modified in accordance with the properties of the material investigated
- G01N21/25—Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands
- G01N21/27—Colour; Spectral properties, i.e. comparison of effect of material on the light at two or more different wavelengths or wavelength bands using photo-electric detection ; circuits for computing concentration
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/75—Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated
- G01N21/77—Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated by observing the effect on a chemical indicator
- G01N21/78—Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated by observing the effect on a chemical indicator producing a change of colour
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/24—Earth materials
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01W—METEOROLOGY
- G01W1/00—Meteorology
- G01W1/02—Instruments for indicating weather conditions by measuring two or more variables, e.g. humidity, pressure, temperature, cloud cover or wind speed
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Environmental & Geological Engineering (AREA)
- Pathology (AREA)
- Immunology (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Environmental Sciences (AREA)
- Geology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Ecology (AREA)
- Biodiversity & Conservation Biology (AREA)
- Atmospheric Sciences (AREA)
- General Life Sciences & Earth Sciences (AREA)
- Plasma & Fusion (AREA)
- Remote Sensing (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明涉及土壤有效磷研究技术领域,具体涉及一种基于高分辨率环境变量的南方丘陵地区耕地土壤有效磷制图方法,主要包括S1:土壤样点数据获取及建库、S2:环境数据的获取、S3:数据预处理、S4:环境变量的提取及筛选、S5:模型构建及精度评价与S6:预测与土壤样点采样时间同期的待测地区所有耕地的表层土壤(0‑15cm)有效磷含量:本发明提供的基于高分辨率环境变量的南方丘陵地区耕地土壤有效磷制图方法可以显著提升南方丘陵地区耕地土壤有效磷制图精度,以及快速实现南方丘陵地区大面积的耕地土壤有效磷含量的预测及其空间分布,适宜进一步推广应用。
The invention relates to the technical field of soil available phosphorus research, in particular to a method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables. , S3: Data preprocessing, S4: Extraction and screening of environmental variables, S5: Model construction and accuracy evaluation, and S6: Predicting available phosphorus in the surface soil (0-15cm) of all cultivated land in the test area at the same time as the sampling time of soil sample points Content: The method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables provided by the present invention can significantly improve the mapping accuracy of soil available phosphorus in cultivated land in southern hilly areas, and quickly realize the improvement of the content of available phosphorus in large-area cultivated land in southern hilly areas. The prediction and its spatial distribution are suitable for further promotion and application.
Description
技术领域technical field
本发明涉及土壤有效磷研究技术领域,具体涉及一种基于高分辨率环境变量的南方丘陵地区耕地土壤有效磷制图方法。The invention relates to the technical field of soil available phosphorus research, in particular to a method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables.
背景技术Background technique
传统的土壤有效磷含量分析与制图需要采集大量的土壤样点,往往费时、费力、成本高,还会增加生态环境风险,而且面向较大区域的制图精度较低。卫星遥感图像可反映土壤表层信息而被用于数字土壤制图研究,很多卫星遥感图像可免费获取,且遥感数据空间覆盖具有连续性,作为二次数据源可提高稀疏样点土壤属性的制图效率,利用卫星遥感图像联合其他环境变量成为稀疏土壤样点的较大区域数字土壤制图的重要手段。但是由于以前可以获取使用的免费遥感卫星数据空间分辨率较低,导致最终的数字土壤制图精度不够高。目前随着Sentinel-2卫星等空间分辨率更高且易获取的新遥感数据的出现,与常用的Landsat数据相比,Sentinel-2数据具有更高的空间分辨率和3个红边波段,而且红边波段在土壤属性,如有机质和全氮预测方面具有明显的优势。相关研究已经证实加入遥感数据可有效提高土壤磷素的预测精度。因此,引入Sentinel-2数据可提高土壤磷素制图的精度。Traditional analysis and mapping of soil available phosphorus content requires the collection of a large number of soil samples, which is often time-consuming, laborious, and costly. It also increases ecological and environmental risks, and the accuracy of mapping for larger areas is low. Satellite remote sensing images can reflect soil surface information and are used in digital soil mapping research. Many satellite remote sensing images can be obtained for free, and the spatial coverage of remote sensing data is continuous. As a secondary data source, it can improve the mapping efficiency of sparse sample soil properties. The use of satellite remote sensing images combined with other environmental variables has become an important means of digital soil mapping of large areas with sparse soil samples. However, due to the low spatial resolution of the previously available free remote sensing satellite data, the accuracy of the final digital soil mapping is not high enough. At present, with the emergence of new remote sensing data with higher spatial resolution and easy access, such as Sentinel-2 satellites, compared with commonly used Landsat data, Sentinel-2 data has higher spatial resolution and 3 red-edge bands, and The red-edge band has a clear advantage in predicting soil properties such as organic matter and total nitrogen. Relevant studies have confirmed that adding remote sensing data can effectively improve the prediction accuracy of soil phosphorus. Therefore, the introduction of Sentinel-2 data can improve the accuracy of soil phosphorus mapping.
此外,以往的数字土壤制图方面主要面向北方平坦地区的耕地,单个田块面积大,且成规模,其耕地土壤光谱特征和地形比较均匀,所以目前数字土壤有效磷制图方法效果较好。然而南方耕地多为丘陵地区,地形起伏较大,且耕地比较破碎,单个田块面积较小,采用传统的数字制图精度极差。因此,目前常用的数字土壤制图方法不适合南方丘陵地区。许多学者采用更高精度即更高空间分辨率的遥感变量或地形变量显著提升了数字土壤碳氮制图精度。In addition, the previous digital soil mapping was mainly for the cultivated land in the northern flat area. The single field area is large and large-scale, and the soil spectral characteristics and topography of the cultivated land are relatively uniform. Therefore, the current digital soil available phosphorus mapping method is better. However, most of the cultivated land in the south is hilly area, with large terrain undulations, and the cultivated land is relatively fragmented. The area of a single field is small, and the accuracy of traditional digital mapping is extremely poor. Therefore, the currently commonly used digital soil mapping methods are not suitable for the southern hilly areas. Many scholars have significantly improved the accuracy of digital soil carbon and nitrogen mapping by using remote sensing variables or topographic variables with higher precision, that is, higher spatial resolution.
然而,当前利用更高空间分辨率的遥感变量或地形变量提升数字土壤磷制图精度还是空白,因此,本专利利用更高空间分辨率的遥感变量、地形变量以期望提升数字土壤有效磷制图精度。However, it is still blank to use higher spatial resolution remote sensing variables or terrain variables to improve the accuracy of digital soil phosphorus mapping. Therefore, this patent uses higher spatial resolution remote sensing variables and terrain variables to improve the accuracy of digital soil available phosphorus mapping.
发明内容Contents of the invention
为解决上述问题,本发明提供了一种基于高分辨率环境变量的南方丘陵地区耕地土壤有效磷制图方法,以提升南方丘陵地区耕地表层(0-15cm)土壤有效磷制图精度。In order to solve the above problems, the present invention provides a method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables to improve the accuracy of mapping soil available phosphorus in the surface layer (0-15cm) of cultivated land in southern hilly areas.
为了解决上述技术问题,本发明采用的技术方案为:In order to solve the problems of the technologies described above, the technical solution adopted in the present invention is:
一种基于高分辨率环境变量的南方丘陵地区耕地土壤有效磷制图方法,包括以下步骤:A method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables, including the following steps:
S1、土壤样点数据获取及建库:获取待测地区耕地表层土壤样点的理化属性数据,并在ArcGIS10.2软件中建立土壤样点数据库;S1. Acquisition of soil sample point data and building a database: obtain the physical and chemical attribute data of the surface soil sample points of the cultivated land in the area to be tested, and establish a soil sample point database in ArcGIS10.2 software;
S2、环境数据的获取:获取与土壤采样点同期该地区的环境数据,包括中高分辨率的遥感变量、中高分辨率的地形数据以及气象数据;S2. Acquisition of environmental data: Acquire the environmental data of the area at the same time as the soil sampling point, including medium and high resolution remote sensing variables, medium and high resolution terrain data and meteorological data;
S3、数据预处理:将遥感数据、地形数据和气象数据统一投影到西安1980坐标系,并检查环境数据与土壤样点属性空间匹配的一致性,无误后进行空间属性连接,通过待测地区的行政区划图,利用ArcGIS10.2软件中的矢量裁剪功能,得到待测地区耕地表层土壤;S3. Data preprocessing: project the remote sensing data, terrain data and meteorological data to the Xi’an 1980 coordinate system, and check the consistency of the spatial matching between the environmental data and the soil sample point attributes, and connect the spatial attributes after correctness. Administrative division map, use the vector clipping function in ArcGIS10.2 software to get the cultivated land surface soil in the area to be measured;
S4、环境变量的提取及筛选:选取的预测变量包括遥感变量、地形变量、气象变量和土壤pH值;为简化模型输入,建模前对辅助预测因子进行优选,通过皮尔逊相关分析保留与土壤有效磷相关性显著的变量,通过后向剔除法,保留最终参与建模的辅助变量;S4. Extraction and screening of environmental variables: The selected predictive variables include remote sensing variables, topographical variables, meteorological variables and soil pH value; in order to simplify the model input, the auxiliary predictors are optimized before modeling, and the relationship between soil and soil is retained through Pearson correlation analysis. For the variables with significant correlation with available phosphorus, the auxiliary variables that finally participate in the modeling are retained through the backward elimination method;
S5、模型构建及精度评价:将待测地区预设个数耕地表层的土壤有效磷含量作为因变量,以上述筛选的中高分辨率的遥感变量、地形变量、气象变量和土壤pH值作为自变量,在Python平台基于中高分辨率的环境变量分别执行随机森林模型运算,自动筛选最佳预测精度的随机森林模型及其变量重要性分析,分别得到基于中高分辨率环境变量的与土壤样点采样时间同期的待测地区耕地表层土壤有效磷含量预期模型;S5. Model construction and accuracy evaluation: the soil available phosphorus content of the preset number of cultivated land surfaces in the area to be measured is used as the dependent variable, and the medium and high resolution remote sensing variables, topographic variables, meteorological variables and soil pH value screened above are used as independent variables On the Python platform, the random forest model calculation is performed based on the medium and high resolution environmental variables, and the random forest model with the best prediction accuracy and its variable importance analysis are automatically selected, and the sampling time of the soil sample points based on the medium and high resolution environmental variables are respectively obtained The predictive model of the available phosphorus content in the surface soil of the cultivated land in the area to be tested in the same period;
S6、预测与土壤样点采样时间同期的待测地区所有耕地的表层土壤(0-15cm)有效磷含量:根据高分辨率(10米)的环境变量构建的最佳模型及输入高精度(10米)的环境变量,在Python软件执行随机森林算法,预测同期待测地区所有的耕地表层土壤(0-15cm)有效磷含量,得到该时期下待测地区所有的耕地表层土壤(0-15cm)有效磷空间分布图。S6. Predict the available phosphorus content of the surface soil (0-15cm) of all cultivated land in the area to be tested at the same time as the sampling time of the soil sample points: the best model constructed according to the high-resolution (10m) environmental variables and input high-precision (10m) m) environment variable, execute the random forest algorithm in Python software, predict the available phosphorus content of all cultivated land surface soil (0-15cm) in the same area to be measured, and obtain all the cultivated land surface soil (0-15cm) in the area to be measured in this period Spatial distribution map of available phosphorus.
进一步的,S1具体包括如下步骤:Further, S1 specifically includes the following steps:
S11:研究待测区预设个数耕地表层土壤样点的理化属性数据,来源于国家农业农村部耕地质量检测与评价样点数据,包括土壤有效磷、土壤pH值、地理坐标、高程;S11: Study the physical and chemical attribute data of the preset number of cultivated land surface soil samples in the area to be measured, which comes from the data of the cultivated land quality inspection and evaluation sample points of the Ministry of Agriculture and Rural Affairs, including soil available phosphorus, soil pH value, geographical coordinates, and elevation;
S12:根据土壤样点的地理坐标和理化属性数据,在ArcGIS10.2软件中建立土壤样点数据库,得到西安1980坐标系投影下的土壤样点空间分布图。S12: According to the geographical coordinates and physical and chemical attribute data of the soil samples, the soil sample points database was established in ArcGIS10.2 software, and the spatial distribution map of the soil samples under the projection of Xi'an 1980 coordinate system was obtained.
进一步的,S11中土壤pH值采用酸度计法测定,土壤有效磷采用碳酸氢钠浸提-钼锑抗比色法测定。Further, the pH value of the soil in S11 was measured by the acidity meter method, and the available phosphorus in the soil was measured by the sodium bicarbonate leaching-molybdenum antimony anti-colorimetric method.
进一步的,S2中所述地形数据包括高分辨率(12.5米)的DEM和常用于数字土壤制图的空间分辨率为30米的DEM;Further, the topographic data described in S2 includes a high-resolution (12.5 meters) DEM and a DEM with a spatial resolution of 30 meters commonly used in digital soil mapping;
所述遥感数据包括高分辨率(10米)的4景Sentinel-2A遥感影像和常用于数字土壤制图的空间分辨率为30米的Landsat-8OLI影像;The remote sensing data includes 4 high-resolution (10 meters) Sentinel-2A remote sensing images and Landsat-8OLI images with a spatial resolution of 30 meters commonly used in digital soil mapping;
所述气象数据为待测地区历史月平均降水量和月平均气温的1km×1km的栅格数据。The meteorological data are 1km×1km grid data of historical monthly average precipitation and monthly average temperature in the area to be measured.
进一步的,S3具体包括如下步骤:Further, S3 specifically includes the following steps:
S31、遥感数据预处理:遥感数据预处理包括辐射定标、大气校正、几何精校正、图像镶嵌;对于研究区4景Sentinel-2A遥感影像,在SNAP软件分别对每景影像辐射定标再进行大气校正,将遥感图像的DN值转化为地表反射率,然后进行图像镶嵌拼接,接着在ENVI5.3软件中利用精纠正的SPOT5参考影像对其进行几何精纠正,并按照待测地区的行政区划图进行裁剪;Landsat-8OLI遥感影像为几何精纠正的影像,在ENVI5.3软件中进行大气校正,所有遥感影像几何精纠正的误差控制在1个像素以内,并统一投影到西安1980坐标系;S31. Remote sensing data preprocessing: remote sensing data preprocessing includes radiometric calibration, atmospheric correction, geometric fine correction, and image mosaic; for the Sentinel-2A remote sensing images of the 4 scenes in the study area, the SNAP software performs radiometric calibration for each scene separately. Atmospheric correction, the DN value of the remote sensing image is converted into the surface reflectance, and then the image is mosaiced and stitched, and then the finely corrected SPOT5 reference image is used in the ENVI5.3 software to perform geometric fine correction, and according to the administrative division of the area to be measured The image is cropped; the Landsat-8OLI remote sensing image is a geometrically corrected image, and the atmospheric correction is performed in the ENVI5.3 software. The geometrically corrected error of all remote sensing images is controlled within 1 pixel, and they are uniformly projected into the Xi'an 1980 coordinate system;
S32、地形变量的预处理:首先检查空间分辨率为12.5m和30m的DEM数据无误后,在ArcGIS10.2软件中,通过投影变换统一转换到西安1980坐标系,并利用最邻近元法将12.5m的DEM生产为空间分辨率为10m的DEM数据;S32. Preprocessing of terrain variables: firstly, after checking that the DEM data with spatial resolutions of 12.5m and 30m are correct, in ArcGIS10.2 software, they are uniformly converted to the Xi’an 1980 coordinate system through projection transformation, and the 12.5 The DEM of m is produced as DEM data with a spatial resolution of 10m;
S33、气象变量的预处理:利用arcgis10.2软件将气象数据统一投影到西安1980坐标系;S33. Preprocessing of meteorological variables: using arcgis10.2 software to project the meteorological data into the Xi'an 1980 coordinate system;
S34、在ArcGIS10.2软件中检查以上的遥感数据、地形数据、气象数据和土壤样点数据属性空间匹配的一致性,检查无误后进行空间数据与土壤属性建立地理连接。S34. In the ArcGIS10.2 software, check the consistency of the attribute spatial matching of the above remote sensing data, terrain data, meteorological data and soil sample point data, and establish a geographical connection between the spatial data and the soil attribute after the inspection is correct.
进一步的,S4中后向剔除法具体为:Further, the backward culling method in S4 is specifically:
依据建模时将每个变量依次排除模型后均方根误差的增减对变量进行筛选,RMSE增加则保留变量,反之剔除。The variables were screened according to the increase or decrease of the root mean square error after each variable was excluded from the model in turn during the modeling. If the RMSE increased, the variables were retained, and vice versa.
进一步的,S4具体包括如下步骤:Further, S4 specifically includes the following steps:
S41、遥感变量的提取:在ArcGIS10.2软件中分别从Sentinel-2A和Landsat-8OLI提取遥感变量计算用于数字土壤制图的46个光谱指数;S41. Extraction of remote sensing variables: extracting remote sensing variables from Sentinel-2A and Landsat-8OLI in ArcGIS10.2 software to calculate 46 spectral indices for digital soil mapping;
S42、地形变量的提取:在ArcGIS10.2软件分别从重采样后的10米分辨率的DEM和30米的DEM提取常用于数字土壤制图的8个地形变量;S42. Extraction of topographic variables: 8 topographic variables commonly used in digital soil mapping are extracted from the resampled 10-meter resolution DEM and 30-meter DEM in ArcGIS10.2 software;
S43、气象变量的提取:在ArcGIS10.2软件计算多年月平均降水量和月平均气温数据,再使用栅格计算器计算研究区多年月平均降水量和多年月平均温度的平均值,得到研究区多年的年均温和年均降水量数据,最后采用最近邻域法对年均降水量和年均温数据重采样至10米、30米空间分辨率的栅格数据;S43. Extraction of meteorological variables: Calculate the average monthly precipitation and average temperature data for many years in ArcGIS10.2 software, and then use the grid calculator to calculate the average value of the average monthly precipitation and average temperature for many years in the study area to obtain the study area Years of average annual temperature and average annual precipitation data, and finally use the nearest neighbor method to resample the annual average precipitation and annual average temperature data to raster data with a spatial resolution of 10 meters and 30 meters;
S44、土壤pH空间分布图的获取:在ArcGIS10.2软件中利用克里格插值法得到研究区土壤pH进行内插,获得其空间分布图;S44. Acquisition of the spatial distribution map of soil pH: use the Kriging interpolation method in the ArcGIS10.2 software to obtain the soil pH in the study area for interpolation to obtain its spatial distribution map;
S45、在ArcGIS10.2软件中将土壤样点与以上的遥感变量、地形变量、气象变量和土壤pH数据进行地理关联,得到研究区土壤样点对应的所有的环境变量;S45, in the ArcGIS10.2 software, the soil sample points are geographically correlated with the above remote sensing variables, terrain variables, meteorological variables and soil pH data, and all environmental variables corresponding to the soil sample points in the study area are obtained;
S46、参与建模的环境变量筛选:在Python平台执行皮尔逊相关分析,筛选与土壤有效磷相关性显著的变量,通过后向剔除法,依据建模时将每个变量依次排除模型后均方根误差(RMSE)的增减对变量进行筛选,RMSE增加则保留变量,反之剔除,保留最终参与建模的辅助变量,最终得到参与建模的样本集。S46. Screening of environmental variables involved in modeling: Perform Pearson correlation analysis on the Python platform to screen variables that have a significant correlation with soil available phosphorus. Through the backward elimination method, each variable is excluded from the model in turn according to the mean square of the model. The increase or decrease of the root error (RMSE) screens the variables, the RMSE increases and the variables are retained, otherwise, the auxiliary variables that are finally involved in the modeling are retained, and finally the sample set that participates in the modeling is obtained.
进一步的,S5具体包括如下步骤:Further, S5 specifically includes the following steps:
S51、构建训练样本集与验证样本集:按比例将上述样本集随机分成训练集和验证集,其中训练集的样本用于建模,验证集的样本用于检验模型预测精度;S51. Constructing a training sample set and a verification sample set: randomly divide the above sample set into a training set and a verification set in proportion, wherein the samples of the training set are used for modeling, and the samples of the verification set are used for testing the prediction accuracy of the model;
S52、确定模型的因变量和自变量:将待测地区土壤样点耕地表层的土壤有效磷含量作为因变量,将筛选过的显著性相关的高分辨率(10米)的遥感变量、高分辨率(10米)的地形变量、气象变量和土壤pH值作为自变量,在Python平台执行随机森林算法,自动筛选基于高精度环境变量的与土壤样点采样时间同期的待测地区耕地表层土壤有效磷含量预期模型;S52. Determine the dependent variable and independent variable of the model: use the soil available phosphorus content of the cultivated land surface of the soil sample point in the area to be measured as the dependent variable, and use the screened significant correlation high-resolution (10 meters) remote sensing variables, high-resolution The topographic variables, meteorological variables and soil pH value of the rate (10 meters) are used as independent variables, and the random forest algorithm is executed on the Python platform to automatically screen the effective soil surface soil of the cultivated land in the area to be measured based on the high-precision environmental variables at the same time as the sampling time of the soil samples. Phosphorus content prediction model;
S53、模型预测精度评价:通过R2(决定系数)、MAE(平均绝对误差)和RMSE(均方根误差)来筛选最佳预测模型,R2越接近1,MAE和RMSE越小,表明预测模型精度越高;其精度评价公式(1-3)为:S53, model prediction accuracy evaluation: filter the best prediction model by R 2 (coefficient of determination), MAE (mean absolute error) and RMSE (root mean square error), the closer R 2 is to 1, the smaller the MAE and RMSE are, indicating that the prediction The higher the accuracy of the model; the accuracy evaluation formula (1-3) is:
式中,n为采样点数量,Oi、Pi为采样点i的实测值、预测值,为实测值平均值;In the formula, n is the number of sampling points, O i and P i are the measured and predicted values of sampling point i, is the average value of the measured value;
S54、将土壤样点耕地表层(0-15cm)的土壤有效磷含量作为因变量,将筛选的显著性相关的中分辨率(30米)的遥感变量、中分辨率(30米)的地形变量、气象变量和土壤pH值作为自变量,重复执行S51一样的步骤,得到常用的基于中分辨率环境变量的与土壤样点采样时间同期的耕地表层土壤(0-15cm)有效磷含量的最佳随机森林预测模型;S54. Taking the soil available phosphorus content in the surface layer (0-15cm) of the cultivated land of the soil sample point as the dependent variable, and using the remote sensing variable with a moderate resolution (30 meters) and the terrain variable with a medium resolution (30 meters) that are significantly related to the screening , Meteorological variables and soil pH value as independent variables, repeat the same steps as S51, and obtain the best available phosphorus content of cultivated land surface soil (0-15cm) based on the medium-resolution environmental variables and the sampling time of the soil sample point at the same time. random forest prediction model;
S55、通过多次迭代RF模型取平均值得到预测土壤有效磷含量的过程中变量的相对重要性分值;S55. Obtain the relative importance score of variables in the process of predicting soil available phosphorus content by taking the average value of multiple iterations of the RF model;
基于分辨率为10m的环境变量建模的变量相对重要性显示,气象变量、地形变量、遥感变量和土壤pH值的相对重要性分值依次为30.64%、30.38%、22.87%和16.11%;按照单个变量重要程度从大到小依次为年均温、pH值、地形湿度指数、DEM值、年均降雨量、增强型植被指数、第一主成分和红边波段6;The relative importance of variables based on the modeling of environmental variables with a resolution of 10m shows that the relative importance scores of meteorological variables, topographical variables, remote sensing variables, and soil pH are 30.64%, 30.38%, 22.87%, and 16.11%; The order of importance of individual variables from large to small is annual average temperature, pH value, topographic humidity index, DEM value, annual average rainfall, enhanced vegetation index, first principal component and red edge band6;
基于空间分辨率为30米的环境变量的变量相对重要性显示,气象变量、地形变量、遥感变量和土壤pH值的相对重要性分值依次为25.86%、32.44%、21.31%和20.39%;按照重要程度从大到小依次为高程(DEM值)、pH值、年均降雨量、增强型植被指数、B5和第一主成分。The relative importance of variables based on environmental variables with a spatial resolution of 30 meters shows that the relative importance scores of meteorological variables, topographical variables, remote sensing variables and soil pH are 25.86%, 32.44%, 21.31% and 20.39% respectively; The order of importance from large to small is elevation (DEM value), pH value, average annual rainfall, enhanced vegetation index, B5 and the first principal component.
本发明的有益效果在于:The beneficial effects of the present invention are:
本发明提供的基于高分辨率环境变量的南方丘陵地区耕地土壤有效磷制图方法可以显著提升南方丘陵地区耕地土壤有效磷制图精度,以及快速实现南方丘陵地区大面积的耕地土壤有效磷含量的预测及其空间分布,适宜进一步推广应用。The method for mapping available soil phosphorus in cultivated land soil in southern hilly areas based on high-resolution environmental variables provided by the present invention can significantly improve the accuracy of mapping available phosphorus in cultivated land soil in southern hilly areas, and quickly realize the prediction and analysis of available phosphorus content in large-scale cultivated land soil in southern hilly areas. Its spatial distribution is suitable for further popularization and application.
附图说明Description of drawings
图1为耕地表层土壤(0-15cm)样点分布图;Fig. 1 is the sample point distribution map of cultivated land surface soil (0-15cm);
图2为基于研究区土壤有效磷含量空间分布图;Figure 2 is based on the spatial distribution of soil available phosphorus content in the study area;
图3为本发明的流程示意图。Fig. 3 is a schematic flow chart of the present invention.
具体实施方式Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the embodiments of the present invention. Obviously, the described embodiments are part of the present invention Examples, not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.
参照附图3所示,本实施例以福建省建瓯市耕地为对象,进行耕地土壤有效磷制图。具体步骤如下:Referring to the accompanying drawing 3, the present embodiment takes the cultivated land in Jian'ou City, Fujian Province as an object, and carries out mapping of available phosphorus in cultivated land soil. Specific steps are as follows:
S1,土壤样点数据获取及建库。获取2017年福建省建瓯市耕地表层(0-15cm)土壤样点的理化属性数据,并在ArcGIS10.2软件中建立土壤样点数据库。具体包括如下步骤:S1, Soil sample point data acquisition and database building. The physical and chemical property data of the cultivated land surface (0-15cm) soil sample points in Jian'ou City, Fujian Province in 2017 were obtained, and the soil sample point database was established in ArcGIS10.2 software. Specifically include the following steps:
S11,研究区共97个耕地表层土壤(0-20cm)样点的理化属性数据,来源于国家农业农村部2017年末耕地质量检测与评价样点数据,包括土壤有效磷、土壤pH值、地理坐标、高程等。其中,土壤pH值采用酸度计法测定,土壤有效磷采用碳酸氢钠浸提-钼锑抗比色法测定。剔除同期卫星遥感影像上受到云层影响的1个采样点,共采用96个土壤样点。S11, the physical and chemical attribute data of 97 cultivated land surface soil (0-20cm) samples in the study area are derived from the data of the cultivated land quality inspection and evaluation sample points at the end of 2017 by the Ministry of Agriculture and Rural Affairs of the People’s Republic of China, including soil available phosphorus, soil pH value, geographical coordinates , elevation, etc. Among them, the pH value of the soil was measured by the acidity meter method, and the available phosphorus in the soil was measured by the sodium bicarbonate leaching-molybdenum antimony anti-colorimetric method. A total of 96 soil sampling points were used, excluding one sampling point affected by clouds on the satellite remote sensing image in the same period.
S12,根据96个土壤样点的地理坐标和有效磷含量、pH值等理化属性,在ArcGIS10.2软件中建立土壤样点数据库,得到西安1980坐标系投影下的土壤样点空间分布图,见图1。S12, according to the geographical coordinates, available phosphorus content, pH value and other physical and chemical properties of 96 soil sample points, the soil sample point database was established in ArcGIS10.2 software, and the spatial distribution map of soil sample points under the projection of Xi’an 1980 coordinate system was obtained, see figure 1.
S2,环境数据的获取。获取与土壤采样点同期该地区的环境数据,包括中高分辨率的遥感变量、中高分辨率的地形数据以及气象数据。其中,地形数据包括2017年福建省建瓯市高分辨率(12.5米)的DEM和常用于数字土壤制图的空间分辨率为30m的DEM。遥感数据为与土壤样点采样时间同期的高分辨率(10m)的4景Sentinel-2A遥感影像和常用于数字土壤制图的空间分辨率为30m的Landsat-8OLI影像。气象数据为1970—2000年月平均降水量(1-12月)和月平均气温(1-12月)的1km×1km的栅格数据。S2, acquisition of environmental data. Obtain the environmental data of the area at the same time as the soil sampling point, including medium and high resolution remote sensing variables, medium and high resolution terrain data and meteorological data. Among them, topographic data include the high-resolution (12.5m) DEM of Jian'ou City, Fujian Province in 2017 and the DEM with a spatial resolution of 30m commonly used in digital soil mapping. The remote sensing data are 4-scene Sentinel-2A remote sensing images with high resolution (10m) at the same time as the sampling time of soil samples and Landsat-8OLI images with a spatial resolution of 30m commonly used in digital soil mapping. Meteorological data are 1km×1km raster data of monthly average precipitation (January-December) and monthly average temperature (January-December) from 1970 to 2000.
S3,数据预处理。将遥感数据、地形数据和气象数据统一投影到西安1980坐标系,并检查所有的环境数据与土壤样点属性空间匹配的一致性,无误后进行空间属性连接,通过建瓯市的行政区划图,利用ArcGIS10.2软件中的矢量裁剪功能,得到建瓯市耕地表层(0-15cm)土壤。具体包括如下步骤:S3, data preprocessing. Project the remote sensing data, terrain data and meteorological data to the Xi'an 1980 coordinate system, and check the consistency of the spatial matching between all the environmental data and the soil sample point attributes, and connect the spatial attributes after correctness. Through the administrative division map of Jian'ou City, Using the vector clipping function in ArcGIS10.2 software, the surface (0-15cm) soil of cultivated land in Jian'ou City was obtained. Specifically include the following steps:
S31,遥感数据预处理。主要包括辐射定标、大气校正、几何精校正、图像镶嵌。对于研究区4景Sentinel-2A遥感影像,在SNAP软件分别对每景影像辐射定标再进行大气校正,将遥感图像的DN值转化为地表反射率,然后进行图像镶嵌拼接,接着在ENVI5.3软件中利用精纠正的SPOT5参考影像对其进行几何精纠正,并按照福建省建瓯市的行政区划图进行裁剪。Landsat-8OLI遥感影像为几何精纠正的影像,在ENVI5.3软件中进行大气校正。所有遥感影像几何精纠正的误差控制在1个像素以内,并统一投影到西安1980坐标系。S31, remote sensing data preprocessing. It mainly includes radiometric calibration, atmospheric correction, geometric precision correction, and image mosaic. For the Sentinel-2A remote sensing images of the four scenes in the study area, the SNAP software was used to perform radiometric calibration on each scene and then perform atmospheric correction. The DN value of the remote sensing image was converted into the surface reflectance, and then the images were mosaiced and stitched, and then in ENVI5.3 The software uses the finely corrected SPOT5 reference image to perform geometric fine correction, and cuts it according to the administrative division map of Jian'ou City, Fujian Province. The Landsat-8OLI remote sensing image is a geometrically corrected image, and the atmospheric correction is carried out in the ENVI5.3 software. The error of geometric fine correction of all remote sensing images is controlled within 1 pixel, and they are uniformly projected into the Xi'an 1980 coordinate system.
S32,地形变量的预处理。首先检查空间分辨率为12.5米和30米的DEM数据无误后,在ArcGIS10.2软件中,通过投影变换统一转换到西安1980坐标系,并利用最邻近元法将12.5米的DEM生产为空间分辨率为10米的DEM数据。S32, preprocessing of terrain variables. First, after checking that the DEM data with a spatial resolution of 12.5 meters and 30 meters are correct, in the ArcGIS10.2 software, they are uniformly converted to the Xi’an 1980 coordinate system through projection transformation, and the 12.5-meter DEM is produced as a spatial resolution using the nearest neighbor method. DEM data with a rate of 10 m.
S33,气象变量的预处理:利用arcgis10.2软件将气象数据统一投影到西安1980坐标系。S33, preprocessing of meteorological variables: use arcgis10.2 software to uniformly project the meteorological data to the Xi'an 1980 coordinate system.
S34,在ArcGIS10.2软件中检查以上的遥感数据、地形数据、气象数据和土壤样点数据属性空间匹配的一致性,检查无误后进行空间数据与土壤属性建立地理连接。S34. In the ArcGIS10.2 software, check the consistency of the attribute spatial matching of the above remote sensing data, terrain data, meteorological data and soil sample point data, and establish a geographical connection between the spatial data and the soil attribute after the check is correct.
S4,环境变量的提取及其筛选。本研究选取的预测变量包括遥感变量、地形变量、气象变量和土壤pH值。为进一步简化模型输入,建模前需对辅助预测因子进行优选。通过皮尔逊相关分析保留与土壤有效磷相关性显著的变量,进一步通过后向剔除法,依据建模时将每个变量依次排除模型后均方根误差(RMSE)的增减对变量进行筛选,RMSE增加则保留变量,反之剔除,保留最终参与建模的辅助变量。具体包括如下步骤:S4, extraction and screening of environment variables. The predictor variables selected in this study include remote sensing variables, topographical variables, meteorological variables and soil pH. In order to further simplify the model input, the auxiliary predictors need to be optimized before modeling. The variables with significant correlation with soil available phosphorus were retained by Pearson correlation analysis, and the variables were screened by the backward elimination method according to the increase or decrease of the root mean square error (RMSE) after each variable was excluded from the model in turn during modeling. If the RMSE increases, the variables will be retained; otherwise, the auxiliary variables that will eventually participate in the modeling will be retained. Specifically include the following steps:
S41,遥感变量的提取。在ArcGIS10.2软件中分别从Sentinel-2A和Landsat-8OLI提取遥感变量计算用于数字土壤制图的46个光谱指数。S41, extraction of remote sensing variables. In ArcGIS10.2 software, remote sensing variables were extracted from Sentinel-2A and Landsat-8OLI to calculate 46 spectral indices for digital soil mapping.
S42,地形变量的提取。在ArcGIS10.2软件分别从重采样后的10米分辨率的DEM和30米的DEM提取常用于数字土壤制图的8个地形变量。S42, extraction of terrain variables. Eight topographic variables commonly used in digital soil mapping were extracted from the resampled 10-meter resolution DEM and 30-meter DEM in ArcGIS10.2 software.
S43,气象变量的提取。在ArcGIS10.2软件计算多年月平均降水量和月平均气温数据,再使用栅格计算器计算研究区多年月平均降水量和多年月平均温度的平均值,得到研究区多年的年均温和年均降水量数据。最后采用最近邻域法对年均降水量和年均温数据重采样至10米、30米空间分辨率的栅格数据。S43, extraction of meteorological variables. Calculate the average monthly precipitation and temperature data for many years in ArcGIS10.2 software, and then use the grid calculator to calculate the average monthly average precipitation and average temperature for many years in the study area, and obtain the annual average temperature and temperature of the study area for many years precipitation data. Finally, the nearest neighbor method was used to resample the annual average precipitation and annual average temperature data to raster data with a spatial resolution of 10 meters and 30 meters.
S44,土壤pH空间分布图的获取。在ArcGIS10.2软件中利用克里格插值法得到研究区土壤pH进行内插,获得其空间分布图。S44, obtaining the spatial distribution map of soil pH. In the ArcGIS10.2 software, the Kriging interpolation method was used to interpolate the soil pH in the study area to obtain its spatial distribution map.
S45,在ArcGIS10.2软件中将土壤样点与以上的遥感变量、地形变量、气象变量和土壤pH等空间数据进行地理关联,得到研究区土壤样点对应的所有的环境变量。S45, in ArcGIS10.2 software, geographically correlate the soil sample points with the above remote sensing variables, terrain variables, meteorological variables and soil pH and other spatial data to obtain all the environmental variables corresponding to the soil sample points in the study area.
S46,参与建模的环境变量筛选。在Python平台执行皮尔逊相关分析,筛选与土壤有效磷相关性显著的变量,进一步通过后向剔除法,依据建模时将每个变量依次排除模型后均方根误差(RMSE)的增减对变量进行筛选,RMSE增加则保留变量,反之剔除,保留最终参与建模的辅助变量,最终得到参与建模的样本集,参与建模的辅助变量见表1。S46, screening of environmental variables involved in modeling. Perform Pearson correlation analysis on the Python platform to screen the variables that have a significant correlation with soil available phosphorus, and further use the backward elimination method to eliminate the root mean square error (RMSE) of each variable in turn after the modeling. Variables are screened, if the RMSE increases, the variables are retained, otherwise, the auxiliary variables that participate in the modeling are retained, and finally the sample set that participates in the modeling is obtained. The auxiliary variables that participate in the modeling are shown in Table 1.
表1土壤有效磷建模辅助变量Table 1 Auxiliary variables for soil available phosphorus modeling
S5,模型构建及精度评价。将建瓯市96个耕地表层(0-15cm)的土壤有效磷含量作为因变量,将以上筛选的中高分辨率的遥感变量、地形变量、气象变量和土壤pH值作为自变量,在Python平台基于中高分辨率的环境变量分别执行随机森林模型运算,自动筛选最佳预测精度的随机森林模型及其变量重要性分析,分别得到基于中高分辨率的环境变量的2017年建瓯市耕地表层土壤(0-15cm)有效磷含量预期模型;并对模型的预测精度进行评价。具体包括如下步骤:S5, model construction and accuracy evaluation. Taking the soil available phosphorus content of 96 cultivated land surfaces (0-15cm) in Jian'ou City as the dependent variable, and the medium and high resolution remote sensing variables, terrain variables, meteorological variables and soil pH value screened above as the independent variables, the Python platform based on The medium and high resolution environmental variables were respectively executed with the random forest model calculation, and the random forest model with the best prediction accuracy and the importance analysis of the variables were automatically selected, and the surface soil of cultivated land in Jian'ou City in 2017 (0 -15cm) prediction model of available phosphorus content; and evaluate the prediction accuracy of the model. Specifically include the following steps:
S51,构建训练样本集与验证样本集。将上述的样本集,按照9:1将样本集随机分成训练集和验证集,其中训练集的样本用于建模,验证集的样本用于检验模型预测精度。S51. Construct a training sample set and a verification sample set. The above sample set is randomly divided into a training set and a verification set according to 9:1, wherein the samples of the training set are used for modeling, and the samples of the verification set are used to test the prediction accuracy of the model.
S52,确定模型的因变量和自变量。将建瓯市96个耕地表层(0-15cm)的土壤有效磷含量作为因变量,将前面筛选过的显著性相关的高分辨率(10米)的遥感变量、高分辨率(10米)的地形变量、气象变量和土壤pH值作为自变量,在Python平台执行随机森林算法,自动筛选基于高精度的环境变量的2017年建瓯市耕地表层土壤(0-15cm)有效磷含量的最佳随机森林预测模型。S52, determining dependent variables and independent variables of the model. Taking the soil available phosphorus content of 96 cultivated land surfaces (0-15cm) in Jian’ou City as the dependent variable, the high-resolution (10m) remote sensing variables and high-resolution (10m) Terrain variables, meteorological variables, and soil pH were used as independent variables, and the random forest algorithm was executed on the Python platform to automatically screen the best random value of the available phosphorus content in the surface soil (0-15cm) of cultivated land in Jian'ou City in 2017 based on high-precision environmental variables. Forest Prediction Model.
S53,模型预测精度评价。通过R2(决定系数)、MAE(平均绝对误差)和RMSE(均方根误差)来筛选最佳预测模型,R2越接近1,MAE和RMSE越小,表明预测模型精度越高。其精度评价公式(1-3)为:S53, evaluation of model prediction accuracy. The best prediction model is screened by R 2 (coefficient of determination), MAE (mean absolute error) and RMSE (root mean square error). The closer R 2 is to 1, the smaller MAE and RMSE indicate the higher accuracy of the prediction model. The accuracy evaluation formula (1-3) is:
式中,n为采样点数量,Oi、Pi为采样点i的实测值、预测值,为实测值平均值。In the formula, n is the number of sampling points, O i and P i are the measured and predicted values of sampling point i, is the average value of the measured values.
基于高精度(10米)的环境变量预测建瓯市耕地表层(0-15cm)土壤有效磷含量的预测精度指标R2为0.59,MAE为19.04mg·kg-1,RMSE误差为25.26mg·kg-1。Based on high-precision (10 meters) environmental variables, the prediction accuracy index R 2 of the soil available phosphorus content in the surface layer (0-15cm) of cultivated land in Jian'ou City is 0.59, the MAE is 19.04mg·kg -1 , and the RMSE error is 25.26mg·kg -1 .
S54,将建瓯市96个耕地表层(0-15cm)的土壤有效磷含量作为因变量,将筛选的显著性相关的中分辨率(30米)的遥感变量、中分辨率(30米)的地形变量、气象变量和土壤pH值作为自变量,重复执行S51一样的步骤,得到常用的基于中分辨率的环境变量的2017年建瓯市耕地表层土壤(0-15cm)有效磷含量的最佳随机森林预测模型。S54, taking the soil available phosphorus content of 96 cultivated land surfaces (0-15cm) in Jian’ou City as the dependent variable, and the remote sensing variables of the moderate resolution (30 meters) and the remote sensing variables of the medium resolution (30 meters) Topographic variables, meteorological variables, and soil pH values are used as independent variables, and the same steps as S51 are repeated to obtain the best available phosphorus content in surface soil (0-15cm) of cultivated land in Jian'ou City in 2017 based on commonly used medium-resolution environmental variables. Random Forest Prediction Model.
基于常用的中分辨率(30米)的环境变量的建瓯市耕地表层(0-15cm)土壤有效磷含量的预测精度指标R2为0.42,MAE误差为21.84mg·kg-1,RMSE误差为30.18mg·kg-1。Based on the commonly used environmental variables with medium resolution (30 meters), the prediction accuracy index R 2 of soil available phosphorus content in the surface layer (0-15cm) of cultivated land in Jian'ou City is 0.42, the MAE error is 21.84 mg·kg -1 , and the RMSE error is 30.18 mg·kg -1 .
S55,通过多次迭代RF模型取平均值得到预测土壤有效磷含量的过程中变量的相对重要性分值。基于分辨率为10m的环境变量建模的变量相对重要性显示,气象变量、地形变量、遥感变量和土壤pH值的相对重要性分值依次为30.64%、30.38%、22.87%和16.11%。按照单个变量重要程度从大到小依次为年均温、pH值、地形湿度指数、DEM值、年均降雨量、增强型植被指数、第一主成分和红边波段6。S55, obtaining the relative importance scores of variables in the process of predicting soil available phosphorus content by taking the average value of the RF model for multiple iterations. The relative importance of variables based on environmental variable modeling with a resolution of 10m showed that the relative importance scores of meteorological variables, topographical variables, remote sensing variables, and soil pH were 30.64%, 30.38%, 22.87%, and 16.11%, respectively. According to the importance of individual variables, they are annual average temperature, pH value, topographic humidity index, DEM value, annual average rainfall, enhanced vegetation index, first principal component and red edge band6.
基于空间分辨率为30米的环境变量的变量相对重要性显示,气象变量、地形变量、遥感变量和土壤pH值的相对重要性分值依次为25.86%、32.44%、21.31%和20.39%。按照重要程度从大到小依次为高程(DEM值)、pH值、年均降雨量、增强型植被指数、B5和第一主成分。The relative importance of variables based on environmental variables with a spatial resolution of 30 meters showed that the relative importance scores of meteorological variables, topographical variables, remote sensing variables, and soil pH were 25.86%, 32.44%, 21.31%, and 20.39%, respectively. In descending order of importance, they are elevation (DEM value), pH value, average annual rainfall, enhanced vegetation index, B5 and the first principal component.
S6,方法效果评价。基于空间分辨率为30米的环境变量预测2017年建瓯市耕地表层(0-15cm)土壤有效磷的建模为参考标准,分别从R2、MAE、RMSE三项指标,对本专利提出的基于高分辨率(10米)的环境变量预测2017年建瓯市耕地表层(0-15cm)土壤有效磷含量的精度进行效果评价。结果表明,基于10米的遥感数据、12.5米的地形数据联合其他环境变量比常用的基于30米的遥感数据、30米的地形数据联合其他变量的精度显著提升,建模精度R2提升40.5%(从0.42提升到0.59),MAE误差下降12.8%(从21.84mg·kg-1下降到19.04mg·kg-1),RMSE误差下降16.3%(从30.18mg·kg-1下降到25.26mg·kg-1)。具体步骤包括:S6, method effect evaluation. Based on the environmental variables with a spatial resolution of 30 meters, the modeling of soil available phosphorus in the top layer of cultivated land (0-15cm) in Jian'ou City in 2017 is used as a reference standard, and the three indicators of R 2 , MAE and RMSE are used to analyze the proposed method based on this patent. The accuracy of high-resolution (10m) environmental variables to predict the soil available phosphorus content in the surface layer (0-15cm) of cultivated land in Jian'ou City in 2017 was evaluated. The results show that the precision based on 10-meter remote sensing data, 12.5-meter topographic data combined with other environmental variables is significantly improved compared with the commonly used 30-meter remote sensing data, 30-meter topographic data combined with other variables, and the modeling accuracy R2 is increased by 40.5% ( From 0.42 to 0.59), the MAE error decreased by 12.8% (from 21.84mg·kg -1 to 19.04mg·kg -1 ), the RMSE error decreased by 16.3% (from 30.18mg·kg -1 to 25.26mg·kg - 1 ). Specific steps include:
S61,构建方法改进效果指标。基于空间分辨率为30米的环境变量预测2017年建瓯市耕地表层(0-15cm)土壤有效磷的建模为参考,分别从R2、MAE、RMSE三项指标,对本专利提出的基于高分辨率(10米)的环境变量预测2017年建瓯市耕地表层(0-15cm)土壤有效磷含量的精度进行效果评价。方法改进效果P的公式为:S61, constructing method improvement effect indicators. Based on the environmental variables with a spatial resolution of 30 meters, the modeling of soil available phosphorus in the cultivated land surface (0-15cm) of Jian'ou City in 2017 is used as a reference. From the three indicators of R 2 , MAE and RMSE, the high Environmental variables with a resolution (10 meters) can predict the accuracy of soil available phosphorus content in the surface layer (0-15cm) of cultivated land in Jian'ou City in 2017 to evaluate the effect. The formula of method improvement effect P is:
P=(X10米-X30米)/X30米*100%P=(X 10m- X 30m )/X 30m *100%
其中,P表示R2改进的比例或MAE、RMSE误差下降的比例,X10米表示基于高分辨率(10m)的环境变量预测南方丘陵地区耕地表层(0-15cm)土壤有效磷含量的精度,X30米表示基于常规使用的30米的环境变量预测南方丘陵地区耕地表层(0-15cm)土壤有效磷含量的精度。Among them, P represents the ratio of R2 improvement or the ratio of MAE and RMSE error reduction, X10m represents the accuracy of predicting soil available phosphorus content in the surface layer (0-15cm) of cultivated land in southern hilly areas based on high-resolution (10m) environmental variables, X30 M represents the accuracy of predicting soil available phosphorus content in the surface layer (0-15cm) of cultivated land in southern hilly areas based on the routinely used 30-meter environmental variable.
S62,计算R2提升效果。根据上述公式,在excel表格中计算建模精度R2提升40.5%(从0.42提升到0.59)。S62. Calculate the R 2 improvement effect. According to the above formula, the modeling accuracy R 2 calculated in the excel table is increased by 40.5% (from 0.42 to 0.59).
S63,计算误差降低的比例。根据上述公式,在excel表格中计算MAE误差和RMSE误差的降低比例。MAE误差下降12.8%(从21.84mg·kg-1下降到19.04mg·kg-1),RMSE误差下降16.3%(从30.18mg·kg-1下降到25.26mg·kg-1)。S63. Calculate the ratio of error reduction. According to the above formula, calculate the reduction ratio of MAE error and RMSE error in the excel table. The MAE error decreased by 12.8% (from 21.84 mg·kg -1 to 19.04 mg·kg -1 ), and the RMSE error decreased by 16.3% (from 30.18 mg·kg -1 to 25.26 mg·kg -1 ).
S64,结果表明见表2,在三项测评指标方面,基于10米的遥感数据、12.5米的地形数据联合其他环境变量比常用的基于30米的遥感数据、30米的地形数据联合其他变量的预测精度显著提升。S64, the results are shown in Table 2. In terms of the three evaluation indicators, the combination of 10-meter remote sensing data and 12.5-meter topographic data combined with other environmental variables is better than the commonly used 30-meter remote sensing data and 30-meter topographic data combined with other variables. The prediction accuracy is significantly improved.
表2本方法改进测评结果Table 2 Improvement evaluation results of this method
S7,预测2017年建瓯市所有耕地的表层土壤(0-15cm)有效磷含量。根据高分辨率(10米)的环境变量构建的最佳模型及输入高精度(10米)的环境变量,在Python软件执行随机森林算法,预测2017年建瓯市所有的耕地表层土壤(0-15cm)有效磷含量,得到2017年建瓯市所有的耕地表层土壤(0-15cm)有效磷空间分布图,见图2。S7, predicting the available phosphorus content in the surface soil (0-15cm) of all cultivated land in Jian'ou City in 2017. Based on the best model constructed with high-resolution (10 meters) environmental variables and the input of high-precision (10 meters) environmental variables, the random forest algorithm is executed in Python software to predict the surface soil of all cultivated land in Jian'ou City in 2017 (0- 15cm) available phosphorus content, and the spatial distribution map of available phosphorus in all cultivated land surface soil (0-15cm) in Jian'ou City in 2017 was obtained, as shown in Figure 2.
以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。The above embodiments are only used to illustrate the technical solutions of the present invention, rather than to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be described in the foregoing embodiments Modifications are made to the recorded technical solutions, or equivalent replacements are made to some of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210889387.5A CN115266612A (en) | 2022-07-27 | 2022-07-27 | A method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210889387.5A CN115266612A (en) | 2022-07-27 | 2022-07-27 | A method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115266612A true CN115266612A (en) | 2022-11-01 |
Family
ID=83769631
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210889387.5A Pending CN115266612A (en) | 2022-07-27 | 2022-07-27 | A method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115266612A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118429564A (en) * | 2024-07-03 | 2024-08-02 | 江西省地质局地理信息工程大队 | Machine learning-based three-dimensional modeling method for soil of south hills |
CN118537441A (en) * | 2024-05-27 | 2024-08-23 | 浙江大学 | Digital soil mapping method and device based on soil space neighborhood information |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3340130A1 (en) * | 2016-12-23 | 2018-06-27 | Hexagon Technology Center GmbH | Method for prediction of soil and/or plant condition |
CN109342697A (en) * | 2018-12-07 | 2019-02-15 | 福建农林大学 | Prediction method of soil organic carbon content based on random forest-ordinary kriging method |
CN110413666A (en) * | 2019-05-31 | 2019-11-05 | 河南省科学院地理研究所 | A kind of multi-source heterogeneous data integration method of farmland quality |
CN111879915A (en) * | 2020-08-04 | 2020-11-03 | 北京师范大学 | A high-resolution monthly soil salinity monitoring method and system for coastal wetlands |
CN113671130A (en) * | 2021-08-24 | 2021-11-19 | 上海交通大学 | A research method of interface barrier to control mechanism of weeds in paddy field |
CN113902580A (en) * | 2021-10-18 | 2022-01-07 | 四川农业大学 | A Reconstruction Method of Historical Farmland Distribution Based on Random Forest Model |
CN114324823A (en) * | 2021-12-30 | 2022-04-12 | 河南农业大学 | Time-space prediction mapping method for soil organic carbon content |
-
2022
- 2022-07-27 CN CN202210889387.5A patent/CN115266612A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3340130A1 (en) * | 2016-12-23 | 2018-06-27 | Hexagon Technology Center GmbH | Method for prediction of soil and/or plant condition |
CN109342697A (en) * | 2018-12-07 | 2019-02-15 | 福建农林大学 | Prediction method of soil organic carbon content based on random forest-ordinary kriging method |
CN110413666A (en) * | 2019-05-31 | 2019-11-05 | 河南省科学院地理研究所 | A kind of multi-source heterogeneous data integration method of farmland quality |
CN111879915A (en) * | 2020-08-04 | 2020-11-03 | 北京师范大学 | A high-resolution monthly soil salinity monitoring method and system for coastal wetlands |
CN113671130A (en) * | 2021-08-24 | 2021-11-19 | 上海交通大学 | A research method of interface barrier to control mechanism of weeds in paddy field |
CN113902580A (en) * | 2021-10-18 | 2022-01-07 | 四川农业大学 | A Reconstruction Method of Historical Farmland Distribution Based on Random Forest Model |
CN114324823A (en) * | 2021-12-30 | 2022-04-12 | 河南农业大学 | Time-space prediction mapping method for soil organic carbon content |
Non-Patent Citations (3)
Title |
---|
张钟亿: "《余庆耕地》", vol. 1, 31 December 2020, 贵州大学出版社, pages: 21 - 24 * |
曹佳萍等: "基于稀疏样点的南方丘陵地区耕地土壤有效磷制图", 《中国生态农业学报(中英文)》, vol. 30, no. 2, 28 February 2022 (2022-02-28), pages 290 - 301 * |
石云: "《GIS在宁夏水土保持生态中的实践ArcGIS专题应用》", vol. 1, 31 March 2020, 阳光出版社, pages: 48 - 62 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118537441A (en) * | 2024-05-27 | 2024-08-23 | 浙江大学 | Digital soil mapping method and device based on soil space neighborhood information |
CN118429564A (en) * | 2024-07-03 | 2024-08-02 | 江西省地质局地理信息工程大队 | Machine learning-based three-dimensional modeling method for soil of south hills |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108764255B (en) | Method for extracting winter wheat planting information | |
CN101963664B (en) | Microwave remote sensing pixel element decomposing method based on land and water living beings classifying information | |
CN104462660B (en) | A kind of field Transmission Line in Winter ice covering thickness distribution core method | |
CN107063197B (en) | Reservoir characteristic curve extraction method based on spatial information technology | |
CN110427857A (en) | A kind of transmission line of electricity geological disasters analysis method based on Remote Sensing Data Fusion Algorithm | |
CN110335355B (en) | Automatic calculation method for water level height of large shallow lake | |
CN108241779A (en) | Modeling Method of Spatial Filter Value of Ground PM2.5 Concentration Feature Vector Based on Remote Sensing Data | |
CN110927120B (en) | Early warning method for vegetation coverage | |
CN110118982A (en) | A kind of satellite precipitation data bearing calibration based on space optimization interpolation | |
CN111337434A (en) | A method and system for estimating biomass of reclaimed vegetation in mining area | |
CN112013822A (en) | Multispectral remote sensing water depth inversion method based on improved GWR model | |
Addink et al. | A comparison of conventional and geostatistical methods to replace clouded pixels in NOAA-AVHRR images | |
CN115266612A (en) | A method for mapping soil available phosphorus in cultivated land in southern hilly areas based on high-resolution environmental variables | |
WO2024036739A1 (en) | Reservoir water reserve inversion method and apparatus | |
CN110988909A (en) | TLS-based vegetation coverage determination method for sandy land vegetation in alpine and fragile areas | |
CN112329265A (en) | Refinement spatial estimation method and system of satellite remote sensing precipitation | |
CN112882032B (en) | Method and device for dynamically monitoring geological disaster SAR in key area of gas pipeline | |
Hutchinson | Adding the Z-dimension | |
NL2027951B1 (en) | Fine-scale delineation method for assessing land sensitivity to desertification | |
CN113238228A (en) | Level constraint-based InSAR three-dimensional surface deformation acquisition method, system and device | |
CN113538388B (en) | Arable land loss assessment method based on MODIS NDVI time sequence data | |
CN114461983A (en) | A spatial downscaling method for satellite precipitation products based on the principle of water balance | |
CN117523388A (en) | Urban different-stage identification method based on sub-pixel-level impervious surface change mode | |
CN115830476A (en) | Terrain factor space downscaling method | |
CN116977401A (en) | Lake area remote sensing extraction precision assessment method, system and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |