CN113609783B - Upward forecasting system and method of salt tide coupled with large-scale circulation climate information - Google Patents
Upward forecasting system and method of salt tide coupled with large-scale circulation climate information Download PDFInfo
- Publication number
- CN113609783B CN113609783B CN202110944776.9A CN202110944776A CN113609783B CN 113609783 B CN113609783 B CN 113609783B CN 202110944776 A CN202110944776 A CN 202110944776A CN 113609783 B CN113609783 B CN 113609783B
- Authority
- CN
- China
- Prior art keywords
- upward
- tide
- salt tide
- data
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 150000003839 salts Chemical class 0.000 title claims abstract description 222
- 238000000034 method Methods 0.000 title abstract description 24
- 238000012795 verification Methods 0.000 claims abstract description 24
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 claims abstract description 13
- 238000007637 random forest analysis Methods 0.000 claims description 36
- 238000012549 training Methods 0.000 claims description 25
- ZAMOUSCENKQFHK-UHFFFAOYSA-N Chlorine atom Chemical compound [Cl] ZAMOUSCENKQFHK-UHFFFAOYSA-N 0.000 claims description 20
- 229910052801 chlorine Inorganic materials 0.000 claims description 20
- 239000000460 chlorine Substances 0.000 claims description 20
- 238000004422 calculation algorithm Methods 0.000 claims description 19
- 238000004458 analytical method Methods 0.000 claims description 12
- 238000011144 upstream manufacturing Methods 0.000 claims description 11
- 238000012216 screening Methods 0.000 claims description 10
- 239000012535 impurity Substances 0.000 claims description 9
- 230000010355 oscillation Effects 0.000 claims description 9
- 238000010200 validation analysis Methods 0.000 claims description 9
- 238000007405 data analysis Methods 0.000 claims description 7
- 238000013277 forecasting method Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 11
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- 230000007774 longterm Effects 0.000 description 5
- 238000004088 simulation Methods 0.000 description 5
- 238000003066 decision tree Methods 0.000 description 4
- 230000002950 deficient Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000013517 stratification Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012876 topography Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
- G06F30/27—Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Computer Hardware Design (AREA)
- Geometry (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明涉及一种耦合大尺度环流气候信息的咸潮上溯预报系统和方法,该系统包括数据输入模块、模型预报模块和数据输出模块。该耦合大尺度环流气候信息的咸潮上溯预报系统通过对需要预测受咸潮上溯影响地区的地区获取历史影响咸潮上溯的水文气象数据,依据从数据中筛选出关键影响因子建立咸潮上溯预报模型,并对咸潮上溯预报模型进行验证,验证通过的咸潮上溯预报模型输出的含氯度作为某个地区咸潮上溯的预报数据;该耦合大尺度环流气候信息的咸潮上溯预报系统得到的预报数据精度高,解决了现有咸潮上溯预报模型对数据要求高、信息获取成本大,且存在预报因子互相关的干扰,使得预报精度低的技术问题。
The invention relates to a salt tide upward forecasting system and method coupled with large-scale circulation climate information. The system includes a data input module, a model forecast module and a data output module. The salt tide upward forecasting system coupled with large-scale circulation climate information obtains the hydrometeorological data of the historically affected salt tide upward for the areas that need to predict the area affected by the salt tide upward movement, and establishes the salt tide upward forecast based on the key influencing factors selected from the data. model, and verify the salt tide upward forecast model, and the chloride content output by the salt tide upward forecast model that has passed the verification is used as the forecast data of the salt tide upward forecast in a certain area; the salt tide upward forecast system coupled with the large-scale circulation climate information obtains The forecast data has high precision, which solves the technical problems that the existing salt tide upward forecast model has high data requirements, high cost of information acquisition, and the interference of the cross-correlation of forecast factors, resulting in low forecast precision.
Description
技术领域technical field
本发明涉及环境数据模型技术领域,尤其涉及一种耦合大尺度环流气候信息的咸潮上溯预报系统和方法。The invention relates to the technical field of environmental data models, in particular to a salt tide upward forecasting system and method coupled with large-scale circulation climate information.
背景技术Background technique
咸潮上溯作为河口地区典型的水环境问题,在沿海或沿河的国家或地区均有出现。严重的咸潮上溯不仅会改变河道内的水质状况,破坏水生态平衡,还会扰乱周边城市的供水秩序,制约当地的社会经济发展。因此,研究咸潮上溯的关键影响要素及其时滞,建立咸潮上溯中长期预报机制,对区域水资源管理具有重要的现实意义。As a typical water environment problem in estuary areas, salt tide uptrend occurs in countries or regions along the coast or along the river. Severe upward salt tide will not only change the water quality in the river, destroy the ecological balance of water, but also disturb the water supply order of surrounding cities and restrict local social and economic development. Therefore, it is of great practical significance for regional water resources management to study the key influencing factors and time lag of the uptake of salt tide, and to establish a medium and long-term forecast mechanism for the uptake of salt tide.
现有对河口的咸潮上溯预报方法包括经验公式法、物理实验法、数值模拟法这三种方法。其中:经验公式法主要是建立咸潮与其影响要素间的经验公式,基于对大量实测资料的统计与分析,推导出两者之间的数理关系。例如,在《河口动力及形态学》中的戴维·普朗特尔总结了适用于部分混合河口的含氯度模拟方程,方程中某点的含氯度大小与河口的横断面面积、衰减系数、纵向混合系数、径流量以及该点到河口的距离有关,但是此经验公式法忽略了水平对流扩散方程的时间因子。物理实验法主要是物理模型或现场试验的方法,对物理过程进行还原与模拟。例如,通过在英国特威特河口的现场试验,分析了含氯度对径流与潮汐的响应规律,发现河口的理查德森数在落潮期达到经验最大值,而在涨潮期出现最小值。在《磨刀门咸潮入侵与抑咸技术》的卢陈、何用等开展了磨刀门水道咸潮上溯的物理模型试验,通过变化上游来水、海平面、潮差等影响要素,观测磨刀门水道咸潮上溯的变化特点。数值模拟法主要是通过计算机方法建立数学模型并求解。例如,采用非结构化有限体积近岸海洋三维模型FVCOM建立磨刀门水道至伶仃洋咸潮上溯的数值模型,研究并分析了磨刀门水道含氯度的时空变化规律及其垂向分层现象。Existing methods for upcasting of salt tide in estuaries include empirical formula method, physical experiment method, and numerical simulation method. Among them: the empirical formula method mainly establishes the empirical formula between the salt tide and its influencing factors, and derives the mathematical relationship between the two based on the statistics and analysis of a large number of measured data. For example, in Estuarine Dynamics and Morphology, David Plantell summarizes a chloride simulation equation suitable for partially mixed estuaries. coefficient, longitudinal mixing coefficient, runoff, and the distance from the point to the estuary, but this empirical formula method ignores the time factor of the horizontal convection-diffusion equation. The physical experiment method is mainly the method of physical model or field test, which restores and simulates the physical process. For example, through a field test in the Twitt Estuary in the United Kingdom, the response law of chloride content to runoff and tide is analyzed, and it is found that the Richardson number of the estuary reaches the empirical maximum value during the ebb tide period, and the minimum value during the high tide period. In "Modaomen Salt Tide Intrusion and Salt Suppression Technology", Lu Chen, He Yong, etc. carried out the physical model test of the salt tide upstream of the Modaomen waterway. The changing characteristics of the salt tide in the Modaomen waterway. Numerical simulation method mainly establishes and solves mathematical models through computer methods. For example, the unstructured finite volume nearshore ocean three-dimensional model FVCOM was used to establish a numerical model for the uptake of the salt tide from the Modaomen channel to Lingdingyang, and the temporal and spatial variation of the chloride content in the Modaomen channel and its vertical stratification were studied and analyzed. Phenomenon.
然而,上述三种方法大多对运用条件或数据要求较高,且操作较复杂,从而限制了缺资料地区的咸潮上溯预报研究。并且,影响咸潮上溯的主要因素包括上游来水、潮汐、海平面、风以及河口地形等。目前,上述三种方法对咸潮上溯的研究并没有考虑将大尺度环流气候因子应用于咸潮上溯的中长期预报,且大多通过日尺度的径流与潮汐变化构建咸潮上溯预报模型,对月及其他时间尺度的含氯度模拟研究普遍偏少,从而导致得到模拟咸潮上溯的精度低。However, most of the above three methods have higher requirements on application conditions or data, and the operation is more complicated, which limits the research on the upcasting of salt tide in data-deficient areas. In addition, the main factors affecting the uptake of salt tide include upstream water, tides, sea level, wind, and estuary topography. At present, the above-mentioned three methods have not considered the application of large-scale circulation climatic factors to the mid- and long-term forecasting of salt tide upward, and most of them have constructed salt tide upward forecast models based on daily-scale runoff and tidal changes. In general, there are few studies on the simulation of chloride content on other time scales, which leads to the low accuracy of the simulated salt tide.
发明内容SUMMARY OF THE INVENTION
本发明实施例提供了一种耦合大尺度环流气候信息的咸潮上溯预报系统和方法,用于解决现有咸潮上溯预报模型对数据要求高、信息获取成本大,且存在预报因子互相关的干扰,使得预报精度低的技术问题。The embodiments of the present invention provide a salt tide upward forecasting system and method coupled with large-scale circulation climate information, which are used to solve the problem that the existing salt tide upward forecasting model has high data requirements, high information acquisition costs, and the existence of cross-correlation of forecasting factors. interference, which makes the technical problem of low forecast accuracy.
为了实现上述目的,本发明实施例提供如下技术方案:In order to achieve the above purpose, the embodiments of the present invention provide the following technical solutions:
一种耦合大尺度环流气候信息的咸潮上溯预报系统,包括数据输入模块、模型预报模块和数据输出模块;A salt tide upward forecasting system coupled with large-scale circulation climate information, comprising a data input module, a model forecast module and a data output module;
所述数据输入模块,用于获取影响咸潮上溯的水文气象数据,并对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子;The data input module is used to obtain the hydrometeorological data that affects the uptake of the salt tide, and obtains the key influencing factors that affect the uptake of the salt tide through the random forest importance analysis of the obtained hydrometeorological data;
所述模型预报模块,用于依据所述关键影响因子采用随机森林算法建立咸潮上溯预报模型,并对所述咸潮上溯预报模型进行验证,得到咸潮上溯预报最优模型;The model forecasting module is used for establishing a salt tide upward forecasting model by using a random forest algorithm according to the key influencing factors, and verifying the salt tide upward forecasting model to obtain an optimal salt tide upward forecasting model;
所述数据输出模块,用于输出所述咸潮上溯预报最优模型预测受咸潮上溯影响地区的含氯度。The data output module is used for outputting the optimal model for the upward forecasting of the salt tide to predict the chlorine content of the area affected by the upward salt tide.
优选地,所述数据输入模块包括数据获取子模块、数据分析子模块和筛选子模块;Preferably, the data input module includes a data acquisition sub-module, a data analysis sub-module and a screening sub-module;
所述数据采集子模块,用于获取某地河口地区历史的水文因子、大尺度环流气候因子和含氯度,得到咸潮上溯的水文气象数据;The data acquisition sub-module is used to obtain the historical hydrological factors, large-scale circulation climate factors and chlorine content of the estuary area of a certain place, and obtain the hydrometeorological data of the salt tide upstream;
所述数据分析子模块,用于通过袋外样本集的估计误差和基尼不纯度分别对所述影响咸潮上溯的水文气象数据进行随机重要度分析,得到不同时间尺度下各因子与含氯度之间的第一重要度数集和第二重要度数集;The data analysis sub-module is used to perform random importance analysis on the hydrometeorological data affecting the upward salt tide through the estimation error and Gini impurity of the sample set outside the bag, and obtain each factor and the chlorine content at different time scales. between the first important degree set and the second important degree set;
所述筛选子模块,用于分别对所述第一重要度数集和所述第二重要度数中的各个因子按重要度从大到小排序后进行筛选,得到对应的第一因子集和第二因子集;并从所述第一因子集和所述第二因子集中选择相同的因子作为影响咸潮上溯的关键影响因子。The screening sub-module is used to screen each factor in the first importance degree set and the second importance degree in descending order of importance, to obtain the corresponding first factor set and second importance degree. factor set; and select the same factor from the first factor set and the second factor set as the key influencing factor affecting the uptake of salt tide.
优选地,所述水文因子包括流量和潮位,所述大尺度环流气候因子包括太平洋年代际振荡和南方涛动指数。Preferably, the hydrological factors include flow and tide level, and the large-scale circulation climate factors include the Pacific Interdecadal Oscillation and the Southern Oscillation Index.
优选地,所述筛选子模块用于分别对所述第一重要度数集和所述第二重要度数中的各个因子按重要度从大到小排序后,筛选出重要度排在前三分之二的因子对应构建成第一因子集和第二因子集。Preferably, the screening sub-module is configured to sort each factor in the first importance degree set and the second importance degree in descending order of importance, and filter out the top third of the importance degree. Two factors are correspondingly constructed into a first factor set and a second factor set.
优选地,所述模型预报模块包括数集分类子模块、建立模型子模块和验证模型子模块;Preferably, the model prediction module includes a data set classification submodule, a model establishment submodule and a model verification submodule;
所述数集分类子模块,用于将得到所有所述关键影响因子构建数集,并将数集分为训练集和验证集;The data set classification submodule is used to obtain all the key influencing factors to construct a data set, and divide the data set into a training set and a verification set;
所述建立模型子模块,用于采用随机森林算法对所述训练集进行模型训练,建立咸潮上溯预报模型;The model building submodule is used to perform model training on the training set by using the random forest algorithm, and establish a salt tide upward forecasting model;
所述验证模型子模块,用于将所述验证集输入所述咸潮上溯预报模型,输出与所述验证集对应的纳什效率系数和决定系数,若所述纳什效率系数和所述决定系数均大于系数阈值,得到咸潮上溯预报最优模型。The verification model sub-module is used to input the verification set into the salt tide upward forecasting model, and output the Nash efficiency coefficient and determination coefficient corresponding to the verification set, if both the Nash efficiency coefficient and the determination coefficient are If it is greater than the coefficient threshold, the optimal model for upcasting of salt tide is obtained.
优选地,所述数集分类子模块用于将80%的数集作为训练集,20%的数集作为验证集。Preferably, the data set classification sub-module is used to use 80% of the data set as a training set and 20% of the data set as a validation set.
优选地,所述系数阈值为0.5。Preferably, the coefficient threshold is 0.5.
本发明还提供一种耦合大尺度环流气候信息的咸潮上溯预报方法,包括以下步骤:The present invention also provides a salt tide upward forecasting method coupled with large-scale circulation climate information, comprising the following steps:
获取影响咸潮上溯的水文气象数据,并对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子;Obtain the hydrometeorological data affecting the uptake of the salty tide, and analyze the importance of the acquired hydrometeorological data through random forests to obtain the key influencing factors that affect the uptake of the salty tide;
依据所述关键影响因子采用随机森林算法建立咸潮上溯预报模型,并对所述咸潮上溯预报模型进行验证,得到咸潮上溯预报最优模型;According to the key influencing factors, a random forest algorithm is used to establish a salt tide upward forecast model, and the salt tide upward forecast model is verified to obtain an optimal salt tide upward forecast model;
输出所述咸潮上溯预报最优模型预测受咸潮上溯影响地区的含氯度。Outputting the optimal model for the upward forecasting of the salt tide to predict the chlorine content of the area affected by the upward salt tide.
优选地,对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子的步骤包括:Preferably, the steps of obtaining the key influencing factors affecting the upward trace of the salt tide through random forest importance analysis on the obtained hydrometeorological data include:
获取某地河口地区历史的水文因子、大尺度环流气候因子和含氯度,得到咸潮上溯的水文气象数据:Obtain the historical hydrological factors, large-scale circulation climatic factors and chlorine content of a certain estuary area, and obtain the hydrometeorological data of the salt tide upstream:
通过袋外样本集的估计误差和基尼不纯度分别对所述影响咸潮上溯的水文气象数据进行随机重要度分析,得到不同时间尺度下各因子与含氯度之间的第一重要度数集和第二重要度数集;Through the estimation error and Gini impurity of the out-of-bag sample set, the random importance analysis was carried out on the hydrometeorological data affecting the uptake of the salt tide, respectively, and the first importance set and the chloride content at different time scales were obtained. The second important degree set;
对所述第一重要度数集和所述第二重要度数中的各个因子按重要度从大到小排序后进行筛选,得到对应的第一因子集和第二因子集;并从所述第一因子集和所述第二因子集中选择相同的因子作为影响咸潮上溯的关键影响因子。After sorting each factor in the first importance degree set and the second importance degree degree according to the importance degree from large to small, the corresponding first factor set and the second factor set are obtained; and from the first factor set and the second factor set; The factor set and the second factor set select the same factor as the key influencing factor affecting the uptake of the salt tide.
优选地,得到咸潮上溯预报最优模型的步骤包括:Preferably, the step of obtaining the optimal model for upcasting of salt tide includes:
将得到所有所述关键影响因子构建数集,并将数集分为训练集和验证集;All the key impact factors will be obtained to construct a data set, and the data set will be divided into a training set and a validation set;
采用随机森林算法对所述训练集进行模型训练,建立咸潮上溯预报模型;The random forest algorithm is used to perform model training on the training set, and a salt tide upward forecasting model is established;
将所述验证集输入所述咸潮上溯预报模型,输出与所述验证集对应的纳什效率系数和决定系数,若所述纳什效率系数和所述决定系数均大于系数阈值,得到咸潮上溯预报最优模型。Input the verification set into the salt tide upward forecast model, and output the Nash efficiency coefficient and determination coefficient corresponding to the verification set. If both the Nash efficiency coefficient and the determination coefficient are greater than the coefficient threshold, the salt tide upward forecast is obtained. optimal model.
从以上技术方案可以看出,本发明实施例具有以下优点:该耦合大尺度环流气候信息的咸潮上溯预报系统和方法,包括数据输入模块、模型预报模块和数据输出模块,数据输入模块用于获取影响咸潮上溯的水文气象数据,并对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子;模型预报模块用于依据关键影响因子采用随机森林算法建立咸潮上溯预报模型,并对咸潮上溯预报模型进行验证,得到咸潮上溯预报最优模型;数据输出模块用于输出咸潮上溯预报最优模型预测受咸潮上溯影响地区的含氯度。该耦合大尺度环流气候信息的咸潮上溯预报系统通过对需要预测受咸潮上溯影响地区的地区获取历史影响咸潮上溯的水文气象数据,依据从数据中筛选出关键影响因子建立咸潮上溯预报模型,并对咸潮上溯预报模型进行验证,验证通过的咸潮上溯预报模型输出的含氯度作为某个地区咸潮上溯的预报数据;该耦合大尺度环流气候信息的咸潮上溯预报系统得到的预报数据精度高,解决了现有咸潮上溯预报模型对数据要求高、信息获取成本大,且存在预报因子互相关的干扰,使得预报精度低的技术问题。It can be seen from the above technical solutions that the embodiments of the present invention have the following advantages: the salt tide upward forecasting system and method coupled with large-scale circulation climate information includes a data input module, a model forecast module and a data output module, and the data input module is used for Obtain the hydrometeorological data that affects the uptake of the salty tide, and analyze the importance of the obtained hydrometeorological data through random forests to obtain the key influencing factors affecting the uptake of the salty tide; The data output module is used to output the optimal model of salt tide upward forecast to predict the chlorine content in the area affected by the salt tide upward forecast. The salt tide upward forecasting system coupled with large-scale circulation climate information obtains the hydrometeorological data of the historically affected salt tide upward for the areas that need to predict the area affected by the salt tide upward movement, and establishes the salt tide upward forecast based on the key influencing factors selected from the data. model, and verify the salt tide upward forecast model. The chloride content output by the salt tide upward forecast model that has passed the verification is used as the forecast data of the salt tide upward forecast in a certain area; the salt tide upward forecast system coupled with the large-scale circulation climate information obtains The forecast data has high precision, which solves the technical problems that the existing salt tide upward forecast model has high data requirements, high information acquisition cost, and the interference of the cross-correlation of forecast factors, resulting in low forecast precision.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其它的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained based on these drawings without any creative effort.
图1为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统的框架图。FIG. 1 is a frame diagram of a salt tide upward forecasting system coupled with large-scale circulation climate information according to an embodiment of the present invention.
图2为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统采用的随机森林算法框架图。FIG. 2 is a frame diagram of a random forest algorithm adopted by the salt tide upward forecasting system coupled with large-scale circulation climate information according to an embodiment of the present invention.
图3为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统数据输入模块的框架图。FIG. 3 is a frame diagram of a data input module of a salt tide upward forecasting system coupled with large-scale circulation climate information according to an embodiment of the present invention.
图4为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统的关键影响因子。FIG. 4 shows the key influencing factors of the salt tide upward forecasting system coupled with the large-scale circulation climate information according to the embodiment of the present invention.
图5为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统模型预报模块的框架图。FIG. 5 is a frame diagram of a model forecasting module of a salt tide upward forecasting system coupled with large-scale circulation climate information according to an embodiment of the present invention.
图6为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统模型预报模块的月预报流程图。FIG. 6 is a monthly forecast flow chart of the model forecast module of the salt tide upward forecast system coupled with the large-scale circulation climate information according to the embodiment of the present invention.
图7为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统数据输出模块的基于水文影响因子预报月尺度咸潮预报图。FIG. 7 is a monthly-scale salt tide forecast diagram based on the hydrological influence factor forecast of the data output module of the salt tide upward forecasting system coupled with the large-scale circulation climate information according to the embodiment of the present invention.
图8为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统数据输出模块的基于水文与大尺度环流气候影响因子相结合预报月尺度咸潮预报图。FIG. 8 is a forecast diagram of monthly-scale salt tide forecast based on the combination of hydrology and large-scale circulation climate influence factors of the data output module of the salt tide upward forecasting system coupled with large-scale circulation climate information according to an embodiment of the present invention.
具体实施方式Detailed ways
为使得本发明的发明目的、特征、优点能够更加的明显和易懂,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,下面所描述的实施例仅仅是本发明一部分实施例,而非全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其它实施例,都属于本发明保护的范围。In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the following The described embodiments are only some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
本申请实施例提供了一种耦合大尺度环流气候信息的咸潮上溯预报系统和方法,用于解决了现有咸潮上溯预报模型对数据要求高、信息获取成本大,且存在预报因子互相关的干扰,使得预报精度低的技术问题。The embodiments of the present application provide a salt tide upward forecasting system and method coupled with large-scale circulation climate information, which are used to solve the problem that the existing salt tide upward forecasting model has high data requirements, high information acquisition costs, and the existence of cross-correlation of forecasting factors. interference, which makes the technical problem of low forecast accuracy.
实施例一:Example 1:
图1为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统的框架图,图2为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统采用的随机森林算法框架图。FIG. 1 is a frame diagram of the salt tide upward forecasting system coupled with large-scale circulation climate information according to the embodiment of the present invention, and FIG. 2 is the salt tide upward forecast system used in the coupled large-scale circulation climate information according to the embodiment of the present invention. Random forest algorithm framework diagram.
如图1和图2所示,本发明实施例提供了一种耦合大尺度环流气候信息的咸潮上溯预报系统,包括数据输入模块10、模型预报模块20和数据输出模块30;As shown in FIG. 1 and FIG. 2 , an embodiment of the present invention provides a salt tide upward forecasting system coupled with large-scale circulation climate information, including a data input module 10, a model forecast module 20 and a data output module 30;
数据输入模块10,用于获取影响咸潮上溯的水文气象数据,并对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子;The data input module 10 is used to obtain the hydrometeorological data affecting the uptake of the salt tide, and obtain the key influencing factors affecting the uptake of the salt tide by analyzing the acquired hydrometeorological data through random forest importance;
模型预报模块20,用于依据关键影响因子采用随机森林算法建立咸潮上溯预报模型,并对咸潮上溯预报模型进行验证,得到咸潮上溯预报最优模型;The model forecasting module 20 is used for establishing a salt tide upward forecasting model by using a random forest algorithm according to key influencing factors, and verifying the salt tide upward forecasting model to obtain an optimal salt tide upward forecasting model;
数据输出模块30,用于输出咸潮上溯预报最优模型预测受咸潮上溯影响地区的含氯度。The data output module 30 is used for outputting the optimal model for the upward forecasting of the salt tide to predict the chlorine content of the area affected by the upward salt tide.
在本发明实施例中,数据输入模块10主要是获取历史以来某个地区影响咸潮上溯的水文气象数据,对获取的水文气象数据采用随机森林重要度分析筛选出影响咸潮上溯的关键影响因子。In the embodiment of the present invention, the data input module 10 mainly acquires the hydrometeorological data affecting the uptake of the salt tide in a certain area in history, and uses random forest importance analysis on the acquired hydrometeorological data to screen out the key influencing factors that affect the uptake of the salt tide .
在本发明实施例中,模型预报模块20主要是将筛选出的关键影响因子通过随机森林算法建立咸潮上溯预报模型,之后将得到的咸潮上溯预报模型进行验证,验证通过的咸潮上溯预报模型作为咸潮上溯预报最优模型。In the embodiment of the present invention, the model forecast module 20 mainly uses the selected key influencing factors to establish a salt tide upward forecast model through random forest algorithm, and then verifies the obtained salt tide upward forecast model, and the verified salt tide upward forecast model is verified. The model is regarded as the optimal model for upcasting of salt tide.
在本发明实施例中,数据输出模块30主要是将得到的咸潮上溯预报最优模型输出的含氯度进行输出,咸潮上溯预报最优模型输出的含氯度为预测某个地区咸潮上溯的预报数据。In the embodiment of the present invention, the data output module 30 mainly outputs the obtained chlorine content of the salt tide upward forecast optimal model output, and the output chlorine content of the salt tide upward forecast optimal model is to predict the salt tide in a certain area. Backward forecast data.
本发明提供的一种耦合大尺度环流气候信息的咸潮上溯预报系统,包括数据输入模块、模型预报模块和数据输出模块,数据输入模块用于获取影响咸潮上溯的水文气象数据,并对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子;模型预报模块用于依据关键影响因子采用随机森林算法建立咸潮上溯预报模型,并对咸潮上溯预报模型进行验证,得到咸潮上溯预报最优模型;数据输出模块用于输出咸潮上溯预报最优模型预测受咸潮上溯影响地区的含氯度。该耦合大尺度环流气候信息的咸潮上溯预报系统通过对需要预测受咸潮上溯影响地区的地区获取历史影响咸潮上溯的水文气象数据,依据从数据中筛选出关键影响因子建立咸潮上溯预报模型,并对咸潮上溯预报模型进行验证,验证通过的咸潮上溯预报模型输出的含氯度作为某个地区咸潮上溯的预报数据;该耦合大尺度环流气候信息的咸潮上溯预报系统得到的预报数据精度高,解决了现有咸潮上溯预报模型对数据要求高、信息获取成本大,且存在预报因子互相关的干扰,使得预报精度低的技术问题。The present invention provides a salt tide upward forecasting system coupled with large-scale circulation climate information, comprising a data input module, a model forecast module and a data output module. The data input module is used to obtain hydrometeorological data affecting the salt tide upward movement, and to obtain The hydrometeorological data obtained through the random forest importance analysis obtains the key influencing factors that affect the upcasting of the salt tide; the model forecast module is used to establish the upcasting forecast model of the salty tide by using the random forest algorithm according to the key influencing factors, and to verify the upward forecasting model of the salty tide. , to obtain the optimal model for the upward forecast of the salt tide; the data output module is used to output the optimal model of the upward forecast of the salt tide to predict the chlorine content in the area affected by the upward salt tide. The salt tide upward forecasting system coupled with large-scale circulation climate information obtains the hydrometeorological data of the historically affected salt tide upward for the areas that need to predict the area affected by the salt tide upward movement, and establishes the salt tide upward forecast based on the key influencing factors selected from the data. model, and verify the salt tide upward forecast model. The chloride content output by the salt tide upward forecast model that has passed the verification is used as the forecast data of the salt tide upward forecast in a certain area; the salt tide upward forecast system coupled with the large-scale circulation climate information obtains The forecast data has high precision, which solves the technical problems that the existing salt tide upward forecast model has high data requirements, high information acquisition cost, and the interference of the cross-correlation of forecast factors, resulting in low forecast precision.
图3为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统数据输入模块的框架图,图4为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统的关键影响因子。Fig. 3 is a frame diagram of the data input module of the salt tide upward forecasting system coupled with large-scale circulation climate information according to the embodiment of the present invention, and Fig. 4 is the salt tide upward forecast coupled with large-scale circulation climate information according to the embodiment of the present invention key factors of the system.
如图3和图4所示,本发明的一个实施例中,数据输入模块10包括数据获取子模块11、数据分析子模块12和筛选子模块13;As shown in FIG. 3 and FIG. 4 , in an embodiment of the present invention, the data input module 10 includes a data acquisition sub-module 11, a
数据采集子模块11,用于获取某地河口地区历史的水文因子、大尺度环流气候因子和含氯度,得到咸潮上溯的水文气象数据;The data collection sub-module 11 is used to obtain the historical hydrological factors, large-scale circulation climate factors and chlorine content of the estuary area of a certain place, and obtain the hydrometeorological data of the salt tide upstream;
数据分析子模块12,用于通过袋外样本集的估计误差和基尼不纯度分别对影响咸潮上溯的水文气象数据进行随机重要度分析,得到不同时间尺度下各因子与含氯度之间的第一重要度数集和第二重要度数集;The
筛选子模块13,用于分别对第一重要度数集和第二重要度数中的各个因子按重要度从大到小排序后进行筛选,得到对应的第一因子集和第二因子集;并从第一因子集和第二因子集中选择相同的因子作为影响咸潮上溯的关键影响因子;The screening sub-module 13 is used to screen each factor in the first importance degree set and the second importance degree degree in descending order of importance, to obtain the corresponding first factor set and second factor set; and from The first factor set and the second factor set select the same factor as the key influencing factor affecting the uptrend of salt tide;
其中,水文因子包括流量和潮位等,大尺度环流气候因子包括太平洋年代际振荡和南方涛动指数等。Among them, the hydrological factors include flow and tidal level, etc., and the large-scale circulation climate factors include the Pacific Interdecadal Oscillation and the Southern Oscillation Index.
需要说明的是,筛选子模块13主要用于分别对第一重要度数集和第二重要度数中的各个因子按重要度从大到小排序后,筛选出重要度排在前三分之二的因子对应构建成第一因子集和第二因子集。其中,数据输入模块10基于随机森林重要度评估全面细致地探讨了14个大尺度环流气候因子与某地区咸潮上溯活动之间的遥相关关系及时滞,确定了不同时间尺度下咸潮上溯的预报因子组合。It should be noted that the screening sub-module 13 is mainly used to sort each factor in the first importance degree set and the second importance degree in descending order of importance, and then filter out the top two-thirds of the importance degree. The factors are correspondingly constructed into a first factor set and a second factor set. Among them, the data input module 10 comprehensively and meticulously discussed the telecorrelation relationship and time lag between 14 large-scale circulation climate factors and the upward movement of the salt tide in a certain area based on the random forest importance assessment, and determined the upward movement of the salt tide at different time scales. combination of predictors.
在本发明实施例中,数据输入模块10主要是通过数据采集子模块11获取的流量、潮位等水文因子、太平洋年代际振荡、南方涛动指数等大尺度环流因子和含氯度等数据组成影响咸潮上溯的水文气象数据;之后通过数据分析子模块12的随机森林重要度分析,探究数据中日、月、年等不同时间尺度下各因子与含氯度之间的相关关系,即是重要度;最后通过筛选子模块13从第一重要度数集和第二重要度数集中筛选出相同且重要度排在前2/3的因子作为影响咸潮上溯的关键影响因子。In the embodiment of the present invention, the data input module 10 is mainly composed of data such as flow, tidal level and other hydrological factors, large-scale circulation factors such as the Pacific decadal oscillation, the Southern Oscillation Index, and chlorine content obtained through the data acquisition sub-module 11. The hydrometeorological data of the salt tide upstream; then, through the random forest importance analysis of the
需要说明的是,重要度越高,表示因子与含氯度之间的关联性越密切,因此,从中可以得到影响某地河口地区咸潮上溯活动强弱的关键影响因子。将得到的关键影响因子作为咸潮上溯预报模型的输入数据,使得得到的咸潮上溯预报最优模型输出的预报数据精度高。咸潮上溯预报最优模型考虑了大尺度环流气候因子与咸潮上溯活动之间的相关关系,增强了咸潮上溯预报模型对咸潮上溯特征的捕捉与提取,且由于模型输入数据的易获取性,使得得到的咸潮上溯预报最优模型也能有效应用于缺资料地区的咸潮中长期预报。It should be noted that the higher the importance, the closer the correlation between the factor and the chlorine content. Therefore, the key influencing factors that affect the strength of the salt tide upstream activity in a certain estuary area can be obtained. The obtained key influencing factors are used as the input data of the salt tide upward forecasting model, so that the forecast data output by the obtained salt tide upward forecasting optimal model has high precision. The optimal model for the upward forecast of salt tide takes into account the correlation between the large-scale circulation climatic factors and the upward movement of salt tide, which enhances the capture and extraction of the upward forecast of salt tide by the upward forecast model of salt tide. Therefore, the obtained optimal model for upcasting of salt tide can also be effectively applied to mid- and long-term forecasting of salt tide in data-deficient areas.
在本发明的实施例中,袋外样本集的估计误差进行重要性评估也称置换重要性评估,主要通过比较随机置换解释变量值前后模型效果的变化来评估变量的重要性,变化越大,说明解释变量在模型模拟过程中所发挥的作用越重要。其计算公式如下:In the embodiment of the present invention, the importance evaluation of the estimation error of the out-of-bag sample set is also called the replacement importance evaluation. Explain the more important the role of explanatory variables in the model simulation process. Its calculation formula is as follows:
式中,VIM(xi)OOB为根据袋外样本集估计误差进行评估的第i个解释变量的重要性指数;n为决策树的数量;OOBerrj和OOBerr′j分别为置换前后第j棵决策树的袋外样本集估计误差。解释变量指的是用于评价重要度的各影响因子。In the formula, VIM(x i ) OOB is the importance index of the ith explanatory variable evaluated according to the estimation error of the out-of-bag sample set; n is the number of decision trees; OOBerr j and OOBerr′ j are the jth tree before and after replacement, respectively. Out-of-bag sample set estimation error for decision trees. The explanatory variables refer to each influencing factor used to evaluate the importance.
在本发明的实施例中,基尼不纯度是指从一个数据集中随机抽取子项,该子项被错误标记的频率。对于随机抽取的子项X1,X2,...,Xm,基尼不纯度(即某一子项被错误标记的频率)为:In the embodiment of the present invention, Gini impurity refers to the frequency of randomly extracting sub-items from a data set, and the sub-items are incorrectly marked. For randomly drawn sub-items X 1 , X 2 ,...,X m , the Gini impurity (ie, how often a sub-item is mislabeled) is:
式中,fi为随机样本Xi被正确标记的概率。In the formula, f i is the probability that the random sample X i is correctly marked.
式中,IGt(f)和IGr(f)分别为节点分裂后两个新节点所对应的基尼不纯度;n为决策树的数量;k是解释变量xi在第j棵决策树中出现m节点的次数。In the formula, IG t (f) and IG r (f) are the Gini impurity corresponding to the two new nodes after node splitting; n is the number of decision trees; k is the explanatory variable x i in the jth decision tree. The number of times m nodes appear.
图5为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统模型预报模块的框架图,图6为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统模型预报模块的月预报流程图。Fig. 5 is a frame diagram of the model forecast module of the salt tide upward forecasting system coupled with large-scale circulation climate information according to the embodiment of the present invention, and Fig. 6 is the salt tide upward forecast coupled with large-scale circulation climate information according to the embodiment of the present invention The monthly forecast flow chart of the system model forecast module.
如图5所示,在本发明的一个实施例中,模型预报模块20包括数集分类子模块21、建立模型子模块22和验证模型子模块23;As shown in FIG. 5, in one embodiment of the present invention, the model prediction module 20 includes a data set classification submodule 21, a model establishment submodule 22 and a model verification submodule 23;
数集分类子模块21,用于将得到所有关键影响因子构建数集,并将数集分为训练集和验证集;The data set classification submodule 21 is used to construct a data set by obtaining all the key influencing factors, and divide the data set into a training set and a verification set;
建立模型子模块22,用于采用随机森林算法对训练集进行模型训练,建立咸潮上溯预报模型;establishing a model sub-module 22, which is used to perform model training on the training set by using the random forest algorithm, and establish a salt tide upward forecasting model;
验证模型子模块23,用于将验证集输入咸潮上溯预报模型,输出与验证集对应的纳什效率系数和决定系数,若纳什效率系数和决定系数均大于系数阈值,得到咸潮上溯预报最优模型。The verification model sub-module 23 is used to input the verification set into the salt tide upward forecast model, and output the Nash efficiency coefficient and determination coefficient corresponding to the verification set. If the Nash efficiency coefficient and the determination coefficient are both greater than the coefficient threshold, the optimal salt tide upward forecast is obtained. Model.
需要说明的是,系数阈值优先选为0.5。数集分类子模块用于将80%的咸潮上溯数据和数集作为训练集,20%的咸潮上溯数据和数集作为验证集。其中,模型预报模块20采用随机森林算法建立了耦合大尺度环流气候信息的咸潮上溯预报模型。It should be noted that the coefficient threshold is preferably 0.5. The data set classification submodule is used to take 80% of the data and data sets from the salt tide as the training set, and 20% of the data and data sets from the salt tide as the validation set. Among them, the model forecast module 20 uses the random forest algorithm to establish a salt tide upward forecast model coupled with large-scale circulation climate information.
如图6所示,在本发明实施例中,模型预报模块20采用随机森林算法构建不同时间尺度下某地河口地区的咸潮上溯预报模型,将训练集中的关键影响因子按照重要度大小逐步输入至基于随机森林算法的咸潮上溯预报模型中,结合已有的含氯度数据,对咸潮上溯预报模型进行训练与验证,最终选择拟合度最优的输入变量组合作为咸潮上溯预报最优模型的预报因子进行咸潮上溯预报。在本实施例中,以咸潮上溯的月尺度预报为例,咸潮上溯预报模型的建立与应用分别为:咸潮上溯预报模型设置不同的输入变量情景,其中:情景1输入水文因子,情景2输入水文耦合大尺度环流气候因子,情景3输入大尺度环流气候因子。将三种情景分别输入至咸潮上溯预报模型,通过训练与验证,使咸潮上溯预报模型达到三种情景下的最优拟合效果,模型的拟合效果采用纳什效率系数(NSE)和决定系数(R2)进行评估。将训练得到的咸潮上溯预报最优模型应用于咸潮上溯预报,比较三种情景下的预报精度。As shown in FIG. 6 , in the embodiment of the present invention, the model forecasting module 20 adopts the random forest algorithm to construct a salt tide upward forecasting model in a certain estuary area under different time scales, and gradually inputs the key influencing factors in the training set according to their importance. In the salt tide upward forecast model based on random forest algorithm, combined with the existing chlorine content data, the salt tide upward forecast model is trained and verified, and the input variable combination with the best fitting degree is finally selected as the most suitable salt tide upward forecast model. The forecast factors of the excellent model are used to forecast the salt tide upward. In this embodiment, taking the monthly-scale forecast of the upward salt tide as an example, the establishment and application of the upward forecast model of the salt tide are as follows: different input variable scenarios are set for the upward forecast model of the salt tide, wherein:
图7为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统数据输出模块的基于水文影响因子预报月尺度咸潮预报图,图8为本发明实施例所述的耦合大尺度环流气候信息的咸潮上溯预报系统数据输出模块的基于水文与大尺度环流气候影响因子相结合预报月尺度咸潮预报图。FIG. 7 is a monthly-scale salt tide forecast diagram based on the hydrological influence factor forecast of the data output module of the salt tide upward forecasting system coupled with large-scale circulation climate information according to the embodiment of the present invention, and FIG. The monthly-scale salt tide forecast map based on the combination of hydrology and large-scale circulation climate influence factors of the data output module of the salt tide upward forecasting system of the scale circulation climate information.
在本发明的实施例中,该耦合大尺度环流气候信息的咸潮上溯预报系统若以月尺度预报为例,选取2005-2015年磨刀门水道的流量、潮位等水文影响要素及太平洋年代际振荡、南方涛动指数等大尺度环流因子的数据,通过随机森林重要度分析,探究不同时间尺度下磨刀门水道咸潮上溯及其影响因素之间的关联程度,从而识别影响该地区咸潮上溯的关键影响因子。采用咸潮上溯数据和数集对咸潮上溯预报模型进行训练与验证,其中80%的咸潮上溯数据和数集作为咸潮上溯预报模型的训练集,20%的咸潮上溯数据和数集作为咸潮上溯预报模型的验证集。如图7和图8所示,在验证过程中,基于水文与大尺度环流气候影响因子相结合预报的咸潮上溯预报模型的拟合效果最优,得到的NSE和R2值分别为0.80和0.82,因此耦合大尺度环流气候信息可显著提高模型的预报效果,对指导缺资料地区的咸潮上溯预报具有重要的现实意义。In the embodiment of the present invention, if the salt tide upward forecasting system coupled with large-scale circulation climate information takes the monthly scale forecast as an example, the hydrological influencing factors such as flow and tidal level of the Modaomen waterway from 2005 to 2015 and the interdecadal hydrological factors of the Pacific Ocean are selected. Based on the data of large-scale circulation factors such as oscillation and Southern Oscillation Index, through random forest importance analysis, the degree of correlation between the uptake of the salt tide in the Modaomen Channel and its influencing factors at different time scales was explored, so as to identify the influence of the salt tide in the region. The key influencing factors of the backtracking. The salt tide upward forecast model is trained and verified by using the salt tide upward data and data sets, of which 80% of the salt tide upward data and data sets are used as the training set of the salt tide upward forecast model, and 20% of the salt tide upward data and data sets are used as the training set of the salt tide upward forecast model. As the validation set of the salt tide upward forecasting model. As shown in Figures 7 and 8, during the verification process, the salt tide upward forecast model based on the combined forecast of hydrology and large-scale circulation climate factors has the best fitting effect, and the obtained NSE and R2 values are 0.80 and 0.82, respectively. Therefore, coupling the large-scale circulation climate information can significantly improve the forecasting effect of the model, which is of great practical significance for guiding the upcasting of salt tides in data-deficient areas.
本发明提供的耦合大尺度环流气候信息的咸潮上溯预报系统能够将大尺度环流气候因子纳入咸潮上溯中长期预报,表明遥相关影响下大尺度环流气候因子在咸潮中长期预报中应用具有可行性与有效性;为了有效避免了预报信息的冗余以及预报因子之间的互相关性,该咸潮上溯预报系统通过比选的方法确定不同时间尺度下咸潮上溯的预报要素及滞时,且该咸潮上溯预报系统实现对数据资料要求低且操作简便。The salt tide upward forecasting system coupled with the large-scale circulation climate information provided by the invention can incorporate the large-scale circulation climate factors into the salt tide upward mid- and long-term forecast, indicating that the application of the large-scale circulation climate factors in the salt tide medium and long-term forecast under the influence of teleconnection has the advantages of Feasibility and effectiveness; in order to effectively avoid the redundancy of forecast information and the cross-correlation between forecast factors, the salt tide upward forecast system determines the forecast elements and delay time of salt tide upward at different time scales through comparison and selection. , and the implementation of the salt tide upward forecasting system has low data requirements and is easy to operate.
实施例二:Embodiment 2:
本发明还提供了一种耦合大尺度环流气候信息的咸潮上溯预报方法,包括以下步骤:The present invention also provides a salt tide upward forecasting method coupled with large-scale circulation climate information, comprising the following steps:
获取影响咸潮上溯的水文气象数据,并对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子;Obtain the hydrometeorological data that affects the uptake of the salty tide, and analyze the importance of the obtained hydrometeorological data through random forests to obtain the key influencing factors that affect the uptake of the salty tide;
依据关键影响因子采用随机森林算法建立咸潮上溯预报模型,并对咸潮上溯预报模型进行验证,得到咸潮上溯预报最优模型;According to the key influencing factors, the random forest algorithm is used to establish the salt tide upward forecast model, and the salt tide upward forecast model is verified, and the optimal model of the salt tide upward forecast is obtained;
输出咸潮上溯预报最优模型预测受咸潮上溯影响地区的含氯度。Output the optimal model for upcasting of salt tide to predict the chloride content in the area affected by the upcast salt tide.
在本发明实施例中,对获取的水文气象数据通过随机森林重要度分析,得到影响咸潮上溯的关键影响因子的步骤包括:In the embodiment of the present invention, the steps of obtaining the key influencing factors affecting the upward trace of the salt tide by analyzing the acquired hydrometeorological data through random forest importance include:
获取某地河口地区历史的水文因子、大尺度环流气候因子和含氯度,得到咸潮上溯的水文气象数据:Obtain the historical hydrological factors, large-scale circulation climatic factors and chlorine content of a certain estuary area, and obtain the hydrometeorological data of the salt tide upstream:
通过袋外样本集的估计误差和基尼不纯度分别对影响咸潮上溯的水文气象数据进行随机重要度分析,得到不同时间尺度下各因子与含氯度之间的第一重要度数集和第二重要度数集;Through the estimation error and Gini impurity of the out-of-bag sample set, the random importance analysis was carried out on the hydrometeorological data affecting the uptrend of the salt tide, and the first and second importance degrees between each factor and the chloride content at different time scales were obtained. important degree set;
对第一重要度数集和第二重要度数中的各个因子按重要度从大到小排序后进行筛选,得到对应的第一因子集和第二因子集;并从第一因子集和第二因子集中选择相同的因子作为影响咸潮上溯的关键影响因子。Sort each factor in the first and second importance degree sets according to their importance in descending order to obtain the corresponding first and second factor sets; and from the first factor set and the second factor set The same factors are selected as the key influencing factors affecting the uptake of salt tide.
在本发明实施例中,得到咸潮上溯预报最优模型的步骤包括:In the embodiment of the present invention, the steps of obtaining the optimal model for the upcasting of salt tide include:
将得到所有关键影响因子构建数集,并将数集分为训练集和验证集;All key impact factors will be obtained to construct a dataset, and the dataset will be divided into training set and validation set;
采用随机森林算法对训练集进行模型训练,建立咸潮上溯预报模型;The random forest algorithm is used to train the model on the training set, and the salt tide upward forecasting model is established;
将验证集输入咸潮上溯预报模型,输出与验证集对应的纳什效率系数和决定系数,若纳什效率系数和决定系数均大于系数阈值,得到咸潮上溯预报最优模型。Input the validation set into the salt tide upward forecast model, and output the Nash efficiency coefficient and determination coefficient corresponding to the validation set. If the Nash efficiency coefficient and the determination coefficient are both greater than the coefficient threshold, the optimal model of salt tide upward forecast is obtained.
需要说明的是,实施例二方法中的步骤对应于实施例一系统中的模块设置,在实施例一系统中已对各个模块的内容进行了详细描述,再次不再对实施例二方法中的步骤内容进行详细的阐述。It should be noted that the steps in the method of the second embodiment correspond to the module settings in the system of the first embodiment. The content of each module has been described in detail in the system of the first embodiment, and the details of the method of the second embodiment will not be described again. The content of the steps is explained in detail.
以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。The above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: The recorded technical solutions are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110944776.9A CN113609783B (en) | 2021-08-17 | 2021-08-17 | Upward forecasting system and method of salt tide coupled with large-scale circulation climate information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110944776.9A CN113609783B (en) | 2021-08-17 | 2021-08-17 | Upward forecasting system and method of salt tide coupled with large-scale circulation climate information |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113609783A CN113609783A (en) | 2021-11-05 |
CN113609783B true CN113609783B (en) | 2022-05-03 |
Family
ID=78341010
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110944776.9A Active CN113609783B (en) | 2021-08-17 | 2021-08-17 | Upward forecasting system and method of salt tide coupled with large-scale circulation climate information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113609783B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117709242B (en) * | 2024-02-05 | 2024-04-09 | 珠江水利委员会珠江水利科学研究院 | Estuary salty tide tracing strength evaluation method |
CN117709603B (en) * | 2024-02-06 | 2024-05-17 | 珠江水利委员会珠江水利科学研究院 | Estuary salty tide early warning grade classification method |
CN118468689A (en) * | 2024-04-15 | 2024-08-09 | 中山大学 | A saltwater tide upstream ensemble forecasting method and system based on classification regression analysis |
CN118674101A (en) * | 2024-06-06 | 2024-09-20 | 南方海洋科学与工程广东省实验室(珠海) | Method for carrying out upstream prediction on salt tide in estuary area by utilizing machine learning algorithm |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107992960A (en) * | 2017-11-03 | 2018-05-04 | 中山大学 | Hekou District salty tide traces back chlorosity Simulation prediction method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9579700B2 (en) * | 2014-05-30 | 2017-02-28 | Iteris, Inc. | Measurement and modeling of salinity contamination of soil and soil-water systems from oil and gas production activities |
-
2021
- 2021-08-17 CN CN202110944776.9A patent/CN113609783B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107992960A (en) * | 2017-11-03 | 2018-05-04 | 中山大学 | Hekou District salty tide traces back chlorosity Simulation prediction method |
Non-Patent Citations (4)
Title |
---|
Modeling saltwater intrusion using an integrated Bayesian model averaging method in the Pearl River Delta;Lin, Kairong 等;《JOURNAL OF HYDROINFORMATICS》;20191101;第21卷(第6期);全文 * |
基于Delft3D-flow模型的长江口盐度扩散规律模拟;李林娟等;《人民长江》;20161214(第23期);全文 * |
基于随机森林模型的咸潮预报;苏程佳等;《热带地理》;20180502(第03期);全文 * |
闽江咸潮上溯与保障供水安全综合措施的现状与思考;陈奔月;《水利科技》;20200301(第01期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN113609783A (en) | 2021-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113609783B (en) | Upward forecasting system and method of salt tide coupled with large-scale circulation climate information | |
CN111310968B (en) | LSTM neural network circulating hydrologic forecasting method based on mutual information | |
Nourani et al. | ANN-based statistical downscaling of climatic parameters using decision tree predictor screening method | |
AU2018101946A4 (en) | Geographical multivariate flow data spatio-temporal autocorrelation analysis method based on cellular automaton | |
Lin et al. | Storm surge return levels induced by mid-to-late-twenty-first-century extratropical cyclones in the Northeastern United States | |
Asadi et al. | Extremes on river networks | |
Ghil et al. | Extreme events: dynamics, statistics and prediction | |
CN112506990B (en) | An anomaly detection method for hydrological data based on spatiotemporal information | |
CN108897977A (en) | A kind of uncertain attribution method of Runoff Evolution based on the simulation of big Regional Hydrologic | |
Tang et al. | Flood forecasting based on machine learning pattern recognition and dynamic migration of parameters | |
Thakur et al. | Bringing statistical learning machines together for hydro-climatological predictions-Case study for Sacramento San joaquin River Basin, California | |
CN112149871B (en) | A Pollutant Point Source Analysis Method Based on GIS Spatial Statistics and Stochastic Simulation | |
Apurv et al. | Evaluation of the stationarity assumption for meteorological drought risk estimation at the multidecadal scale in contiguous United States | |
Yan et al. | Design flood estimation with varying record lengths in Norway under stationarity and nonstationarity scenarios | |
CN116502775A (en) | Hydrologic sequence enhancement and prediction method | |
Samiran Das et al. | Assessment of uncertainty in flood flows under climate change impacts in the Upper Thames River basin, Canada. | |
Gao et al. | A Graph Convolutional Encoder-Decoder Model for Methane Concentration Forecasting in Coal Mines | |
Kim et al. | Probabilistic coastal storm surge analyses using synthesized tracks based on historical typhoon parameters | |
Marshall | The influence of restoration efforts in the freshwater Everglades on the salinity regime of Florida Bay | |
CN116701974A (en) | Spatial-temporal variation analysis and attribution identification method of multi-element precipitation under climate change | |
Szolgayová et al. | Hybrid forecasting of daily river discharges considering autoregressive heteroscedasticity | |
Goodarzi et al. | Assessment of climate change using SDSM downscaling Model (A case study: West of Iran) | |
Zhang | Utilizing Hybrid LSTM and GRU Models for Urban Hydrological Prediction | |
Moghim et al. | Downscaling of the flood discharge in a probabilistic framework | |
Kyojo et al. | Frequentist and Bayesian approaches in modeling and prediction of extreme rainfall series: A case study from Southern Highlands Region of tanzania |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
OL01 | Intention to license declared |