CN114386677A

CN114386677A - Flood forecasting method based on novel universal input/output structure and long-and-short time memory network

Info

Publication number: CN114386677A
Application number: CN202111594179.4A
Authority: CN
Inventors: 吴剑; 丁伟; 王浅宁; 陈思文; 刘海星; 彭勇; 张弛; 周惠成
Original assignee: Dalian University of Technology
Current assignee: Dalian University of Technology
Priority date: 2021-12-24
Filing date: 2021-12-24
Publication date: 2022-04-22
Anticipated expiration: 2041-12-24
Also published as: CN114386677B

Abstract

A flood forecasting method based on a novel universal input and output structure and a long-time and short-time memory network is disclosed. Firstly, analyzing the confluence characteristic according to the historical flood data of the research basin, and calculating the average confluence time of the research basin; and secondly, determining the number of output time segments of the LSTM flood forecasting model according to the average convergence time of the drainage basin, and giving the number of hidden layer layers and the number of hidden layer neuron nodes of the LSTM flood forecasting model. And then, designing an input/output structure of the novel universal LSTM flood forecasting model, inputting a training set and a verification set sample training model, and obtaining the flood forecasting models with different structures and parameters. And finally, comparing and analyzing the performances of the LSTM flood forecasting models under different input lengths in a training set and a verification set, determining the final superior LSTM flood forecasting model, and evaluating and analyzing the forecasting effect of the LSTM flood forecasting model in a testing set. The method has strong universality, and the established LSTM flood forecasting model can obtain a good forecasting effect and provide a new technical support for the flood disaster defense work of the drainage basin.

Description

A Flood Forecasting Method Based on Novel Universal Input-Output Structure and Long Short-Term Memory Network

技术领域technical field

本发明属于流域洪水预报技术领域，涉及一种基于新型通用输入输出结构与长短时记忆网络的洪水预报方法。The invention belongs to the technical field of flood forecasting in river basins, and relates to a flood forecasting method based on a novel general input-output structure and a long-short-term memory network.

背景技术Background technique

长短时记忆网络(LSTM)是目前机器学习技术与流域洪水预报交叉应用研究中最常用的模型之一。LSTM洪水预报模型以样本为输入输出数据，样本是输入特征变量与相应输出目标值的组合，且样本的生成只取决于模型的输入与输出结构。确定模型输入与输出结构是构建机器学习洪水预报模型的关键步骤，更直接关系到LSTM洪水预报模型的可靠性与合理性。在以往基于机器学习技术的洪水预报研究与应用中，前期降雨和流量是机器学习洪水预报模型中最为常用的两个输入特征变量，模型输出为流域出口预见期内的流量值。然而，实际应用结果表明，当前期实测降雨、流量均作为模型输入特征因子时，由于不同时间步的流量之间存在显著的相关性，输出流量很容易受输入特征变量中前期流量的驱动，模型难以识别出前期降雨、流量的作用，导致模拟或预报流量过程出现峰现时间滞后或输出两个洪峰值的现象，且预见期越长洪峰滞后越明显。因此，以降雨、前期流量为输入和以预报流量目标值为输出的结构建立的洪水预报模型可能与流域内径流形成的机制不相符，有必要重新设计 LSTM洪水预报模型的输入输出结构，增强机器学习洪水预报模型的可解释性。Long short-term memory network (LSTM) is one of the most commonly used models in the cross-application research of machine learning technology and watershed flood forecasting. LSTM flood forecasting model takes samples as input and output data, samples are the combination of input feature variables and corresponding output target values, and the generation of samples only depends on the input and output structure of the model. Determining the input and output structure of the model is a key step in building a machine learning flood forecasting model, and it is more directly related to the reliability and rationality of the LSTM flood forecasting model. In the past research and application of flood forecasting based on machine learning technology, precipitation and flow are the two most commonly used input feature variables in machine learning flood forecasting models. However, the actual application results show that when the current measured rainfall and flow are both used as the model input characteristic factors, due to the significant correlation between flows at different time steps, the output flow is easily driven by the previous flow in the input characteristic variables. It is difficult to identify the role of precipitation and flow in the early stage, which leads to the phenomenon of time lag between peak occurrence time or output of two flood peaks in the process of simulating or forecasting flow, and the longer the forecast period, the more obvious the delay of flood peak. Therefore, the flood forecasting model established with the structure of rainfall and pre-flow flow as input and forecast flow target as output may not be consistent with the mechanism of runoff formation in the basin. It is necessary to redesign the input and output structure of the LSTM flood forecasting model to enhance the machine Learn about the interpretability of flood forecasting models.

为此，本发明立足于提升机器学习洪水预报模型的水文学可解释性，提出一种基于新型通用输入输出结构与长短时记忆网络的洪水预报方法。结合流域水文模拟的理论与方法以及流域暴雨洪水响应机制，选择降雨作为LSTM洪水预报模型的唯一输入特征变量，以流域内各雨量站的长序列降雨信息作为模型输入变量(特征因子数为流域内雨量站点数)，依托于 LSTM的递归连接以及特殊控制门和细胞单元状态，充分发挥LSTM潜在的长期学习记忆能力，通过对长序列输入数据进行反复学习，将流域降雨的时空分布、流域暴雨洪水响应时间以及前期土壤蓄水状态等水文要素信息融入至模型的输入与输出结构设计中，在此基础上构建基于长短时记忆网络的洪水预报模型，丰富有资料地区的洪水预报模型，提高流域洪水预报精度。To this end, the present invention is based on improving the hydrological interpretability of the machine learning flood forecasting model, and proposes a flood forecasting method based on a novel general input-output structure and a long-short-term memory network. Combined with the theory and method of watershed hydrological simulation and the response mechanism of rainstorm and flood in the watershed, rainfall is selected as the only input characteristic variable of the LSTM flood forecasting model, and the long-sequence rainfall information of each rainfall station in the watershed is used as the model input variable (the number of characteristic factors is the number of characteristic factors in the watershed). The number of rainfall stations), relying on the recursive connection of LSTM and the state of special control gates and cell units, give full play to the potential long-term learning and memory ability of LSTM, through repeated learning of long-sequence input data, the spatial and temporal distribution of rainfall in the basin, the rainstorm flood in the basin The information of hydrological elements such as response time and previous soil water storage status is integrated into the input and output structure design of the model. On this basis, a flood forecasting model based on long and short-term memory network is constructed, which enriches the flood forecasting model in areas with data and improves the flood forecasting in the basin. Forecast accuracy.

发明内容SUMMARY OF THE INVENTION

本发明针对现有技术存在的问题，提供了一种基于新型通用输入输出结构与长短时记忆网络的洪水预报方法。Aiming at the problems existing in the prior art, the present invention provides a flood forecasting method based on a novel general-purpose input-output structure and a long-short-term memory network.

为了达到上述目的，本发明采用的技术方案如下：In order to achieve the above object, the technical scheme adopted in the present invention is as follows:

一种基于新型通用输入输出结构与长短时记忆网络的洪水预报方法，包括以下步骤：A flood forecasting method based on a novel general input-output structure and long-short-term memory network, including the following steps:

第一步，收集与整理流域历史场次洪水数据。The first step is to collect and organize the historical flood data of the basin.

收集与整理研究流域场次洪水资料，将所有场次洪水划分为训练集、验证集、测试集场次。其中，训练集场次洪水用于优化LSTM洪水预报模型的内部权重矩阵和偏置向量参数；验证集场次洪水用于辅助确定超参数、损失函数等模型外部设置；测试集场次洪水则用于检验训练后模型的外推(预测)能力。Collect and organize the flood data of the research watershed events, and divide all flood events into training set, validation set, and test set. Among them, the training set floods are used to optimize the internal weight matrix and bias vector parameters of the LSTM flood forecasting model; the validation set floods are used to assist in determining the external settings of the model such as hyperparameters and loss functions; the test set floods are used to test the training The extrapolation (prediction) ability of the post model.

第二步，计算流域平均汇流时间。The second step is to calculate the average confluence time of the watershed.

某一研究流域的平均汇流时间是确定的，其时间长短等于流域洪水预报预见期，综合反映了流域对降雨水流汇集过程的调蓄作用大小。根据流域收集与整理的历史场次洪水过程数据，统计场次洪水中主降雨与相应洪峰流量间的时间差，即洪峰滞时。计算所有场次洪水的洪峰滞时平均值，即为流域平均汇流时间。The average confluence time of a certain research basin is determined, and its duration is equal to the forecast period of flood forecasting in the basin, which comprehensively reflects the regulation and storage effect of the basin on the convergence process of rainfall and water flow. According to the historical flood process data collected and organized in the watershed, the time difference between the main rainfall and the corresponding flood peak flow in the flood events is counted, that is, the flood peak delay time. Calculate the average peak delay time of all floods, which is the average confluence time of the basin.

第三步，给定长短时记忆(LSTM)网络洪水预报模型的隐藏层层数和隐藏层神经元节点数量。In the third step, the number of hidden layer layers and the number of hidden layer neurons in the flood prediction model of the Long Short Term Memory (LSTM) network are given.

第四步，设计新型通用LSTM洪水预报模型的输入输出结构。The fourth step is to design the input and output structure of the new general LSTM flood forecasting model.

LSTM细胞单元状态与流域土壤蓄水状态较为相似，三个控制门(遗忘门、输入门和输出门)与细胞单元状态间的作用可视为流域土壤蓄水状态的消耗、增加和释放。依据传统的流域水文模拟理论与方法，选取降雨作为LSTM洪水预报模型的唯一输入因子，LSTM洪水预报模型输入为流域内各雨量站的长序列降雨信息，输入长度为n+l个时段。其中，n表示前期降雨的输入长度，且前期降雨可视为反映流域前期土壤蓄水状态、短临降雨等信息对模型输出流量值的影响，可选取多个n并根据后续模型性能确定较优模型；LSTM洪水预报模型输出为与研究流域汇流时间(第二步计算结果)相等的流量值序列，即输出长度为l个时段，l等于流域汇流时间且l≤n。The LSTM cell unit state is similar to the watershed soil water storage state. The interaction between the three control gates (forgetting gate, input gate and output gate) and the cell unit state can be regarded as the consumption, increase and release of the watershed soil water storage state. According to the traditional watershed hydrological simulation theory and method, rainfall is selected as the only input factor of the LSTM flood forecast model. The input of the LSTM flood forecast model is the long-sequence rainfall information of each rainfall station in the watershed, and the input length is n+l time periods. Among them, n represents the input length of the previous rainfall, and the previous rainfall can be regarded as reflecting the influence of information such as the soil water storage state and short-term rainfall in the basin on the output flow value of the model. Multiple n can be selected and the better one can be determined according to the subsequent model performance. Model; the output of the LSTM flood forecast model is a sequence of flow values equal to the confluence time of the study watershed (the calculation result of the second step), that is, the output length is l time periods, where l is equal to the watershed confluence time and l≤n.

第五步，生成训练、验证和测试样本集。The fifth step is to generate training, validation and test sample sets.

根据第四步设计的新型通用LSTM洪水预报模型的输入输出结构(n、l的取值)确定样本长度，每个样本的输入降雨序列长度等于n+l个时段，输出目标流量序列长度等于l个时段。根据第一步划分的训练集、验证集和测试集场次，每次洪水均按照逐时段滑动截取的方式生成相应的训练、验证和测试样本集，且每一次洪水生成多个样本，每个样本由输入降雨序列P＝[P_t-n+1,P_t-n+2,…,P_t,…,P_t+l]和输出目标流量序列Q＝[Q_t+1,Q_t+2,…,Q_t,…,Q_t+l]组成输入输出数据对。其中，n为LSTM洪水预报模型的输入前期降雨序列的时段数，l为模型输出流量的时段数，第t个时段的输入降雨P_t包含流域内各雨量站的实测降雨值。The sample length is determined according to the input and output structure (values of n and l) of the new general LSTM flood forecasting model designed in the fourth step. The length of the input rainfall sequence of each sample is equal to n+l periods, and the length of the output target flow sequence is equal to l period. According to the training set, validation set and test set divided in the first step, the corresponding training, validation and test sample sets are generated for each flood according to the method of sliding interception by time period, and each flood generates multiple samples, each sample From the input rainfall sequence P=[P _t-n+1 ,P _t-n+2 ,…,P _t ,…,P _t+1 ] and the output target flow sequence Q=[Q _t+1 ,Q _t+2 ,…,Q _t ,…,Q _t+l ] constitute the input and output data pairs. Among them, n is the number of periods of the input precipitation sequence of the LSTM flood forecast model, l is the number of periods of the model output flow, and the input rainfall Pt of the _t -th period includes the measured rainfall value of each rainfall station in the basin.

第六步，模型构建与训练。The sixth step is model construction and training.

LSTM洪水预报模型采用随时间反向传播算法(BPTT)进行有监督学习方式的训练，模型构建和训练均基于开源的Keras与TensorFlow实现。运行在TensorFlow平台的Keras框架集成了较为成熟的机器学习算法包，可直接调用对应的算法完成LSTM洪水预报模型构建和训练。The LSTM flood forecasting model uses the back-propagation over time (BPTT) algorithm for supervised learning training. The model construction and training are based on the open source Keras and TensorFlow implementations. The Keras framework running on the TensorFlow platform integrates a relatively mature machine learning algorithm package, and can directly call the corresponding algorithm to complete the construction and training of the LSTM flood forecasting model.

6.1)LSTM洪水预报模型构建：依据第三步给定的隐藏层层数、隐藏层神经元节点数量以及第四步设计的模型输入输出结构，调用Keras的层包(layers)定义LSTM的输入层、隐藏层和输出层，构建LSTM洪水预报模型；6.1) LSTM flood forecasting model construction: According to the number of hidden layers given in the third step, the number of hidden layer neurons and the model input and output structure designed in the fourth step, call Keras's layer package (layers) to define the input layer of LSTM , hidden layer and output layer to build an LSTM flood forecasting model;

6.2)LSTM洪水预报模型训练：输入第五步生成的训练、验证样本集，设置模型训练过程涉及的超参数、激活函数、损失函数、优化算法等，在TensorFlow平台运行基于Keras框架构建的LSTM洪水预报模型，得到训练后的LSTM洪水预报模型。6.2) LSTM flood forecasting model training: input the training and verification sample sets generated in the fifth step, set the hyperparameters, activation functions, loss functions, optimization algorithms, etc. involved in the model training process, and run the LSTM flood based on the Keras framework on the TensorFlow platform Forecast model, get the trained LSTM flood forecast model.

第七步，确定较优模型，提取分析流域场次洪水模拟结果。The seventh step is to determine the optimal model, and extract and analyze the flood simulation results of the watershed.

比较分析不同输入长度下LSTM洪水预报模型的性能，确定最终较优的LSTM洪水预报模型，进一步提取研究流域场次洪水模拟、预报流量过程结果，分析LSTM洪水预报模型的预报性能。Compare and analyze the performance of the LSTM flood forecasting model under different input lengths, determine the final optimal LSTM flood forecasting model, further extract the results of flood simulation and forecast flow process in the basin, and analyze the forecasting performance of the LSTM flood forecasting model.

本发明引入长短时记忆网络作为流域洪水预报建模工具，创新性地设计出以长序列降雨为输入和以多步流量为输出的输入输出结构，以细胞单元状态学习记住长序列降雨隐含的前期流域土壤蓄水状态信息、控制门调节流域土壤蓄水状态的动态变化提供理论与方法支撑，夯实机器学习洪水预报模型的可解释性基础，在此基础上构建基于LSTM的洪水预报模型，以提高流域洪水预报精度。The invention introduces a long-short-term memory network as a modeling tool for flood forecasting in the basin, innovatively designs an input-output structure with long-sequence rainfall as input and multi-step flow as output, and learns and remembers long-sequence rainfall implicits in cell unit state. It provides theoretical and methodological support to provide theoretical and methodological support for early-stage soil water storage status information in the basin, and control gates to adjust the dynamic changes of the basin soil water storage status, consolidate the interpretability foundation of the machine learning flood forecasting model, and build a flood forecasting model based on LSTM on this basis. In order to improve the accuracy of flood forecasting in the basin.

上述一种基于新型通用输入输出结构与长短时记忆网络的洪水预报方法应用于流域洪水预报。The above-mentioned flood forecasting method based on a new general input-output structure and long-short-term memory network is applied to flood forecasting in the basin.

本发明的有益效果为：The beneficial effects of the present invention are:

传统机器学习洪水预报模型多以降雨与流量作为输入变量，模型无法真正地发挥出输入变量中降雨对预报流量的作用，造成了机器学习洪水预报模型的水文学解释较差。本发明深入解析了长短时记忆网络内部特殊的控制门与细胞单元状态的基本原理和计算机制，设计了一种新型通用的长序列降雨输入与多步流量输出的LSTM洪水预报模型输入输出结构，能够适用于不同空间尺度的流域洪水预报建模，充分发挥了LSTM长期学习记忆能力和内部细胞单元的信息遗忘、存储和释放作用机制，实现了LSTM洪水预报模型在山区流域的实例化应用，有效提高了流域场次洪水的模拟、预报精度，为流域洪水灾害预报预警工作提供了新的技术支撑。Traditional machine learning flood forecasting models mostly use rainfall and flow as input variables, and the model cannot really play the role of rainfall on forecast flow in the input variables, resulting in poor hydrological interpretation of machine learning flood forecasting models. The invention deeply analyzes the basic principle and computing mechanism of the special control gate and cell unit state inside the long-short-term memory network, and designs a new general input and output structure of the LSTM flood forecast model with long-sequence rainfall input and multi-step flow output. It can be applied to flood forecasting and modeling of watersheds at different spatial scales, fully utilizes the long-term learning and memory ability of LSTM and the information forgetting, storing and releasing mechanism of internal cell units, and realizes the instantiated application of the LSTM flood forecasting model in mountainous watersheds. The simulation and forecast accuracy of floods in the basin has been improved, and new technical support has been provided for the forecast and early warning of flood disasters in the basin.

附图说明Description of drawings

图1是本发明实例应用所采用的安和流域图；Fig. 1 is the Anhe watershed map that the example application of the present invention adopts;

图2是本发明LSTM内部构造的水文学含义示意图；Fig. 2 is the hydrological meaning schematic diagram of the internal structure of LSTM of the present invention;

图3是本发明设计的LSTM洪水预报模型的输入与输出结构示意图；Fig. 3 is the input and output structure schematic diagram of the LSTM flood forecasting model designed by the present invention;

图4a是本发明场次洪水第t时刻生成样本示意图；Figure 4a is a schematic diagram of a sample generated at the t-th moment of flood in the present invention;

图4b是本发明场次洪水第t+1时刻生成样本示意图；Figure 4b is a schematic diagram of a sample generated at time t+1 of a flood in the present invention;

图5是本发明LSTM洪水预报模型的层次结构与矩阵特征维度示意图；5 is a schematic diagram of the hierarchical structure and matrix feature dimension of the LSTM flood forecasting model of the present invention;

图6是本发明LSTM洪水预报模型训练过程示意图；6 is a schematic diagram of the training process of the LSTM flood forecasting model of the present invention;

图7(a)是本发明LSTM洪水预报模型不同输入降雨长度训练集场次NSE均值图；Fig. 7 (a) is the NSE mean value diagram of different input rainfall length training set sessions of the LSTM flood forecasting model of the present invention;

图7(b)是本发明LSTM洪水预报模型不同输入降雨长度训练集场次MAE均值图；Fig. 7(b) is the MAE mean value diagram of the training set sessions of different input rainfall lengths of the LSTM flood forecasting model of the present invention;

图7(c)是本发明LSTM洪水预报模型不同输入降雨长度训练集场次RMSE均值图；Fig. 7 (c) is the RMSE mean value diagram of different input rainfall length training set sessions of the LSTM flood forecasting model of the present invention;

图7(d)是本发明LSTM洪水预报模型不同输入降雨长度测试集场次NSE均值图；Fig. 7 (d) is the NSE mean value diagram of different input rainfall length test sets of the LSTM flood forecasting model of the present invention;

图7(e)是本发明LSTM洪水预报模型不同输入降雨长度测试集场次MAE均值图；Fig. 7(e) is the MAE mean value diagram of different input rainfall length test sets of the LSTM flood forecasting model of the present invention;

图7(f)是本发明LSTM洪水预报模型不同输入降雨长度测试集场次RMSE均值图；Fig. 7(f) is the RMSE mean value diagram of different input rainfall length test sets of the LSTM flood forecasting model of the present invention;

图8是本发明实例应用中较优LSTM洪水预报模型的输入与输出结构示意图；8 is a schematic diagram of the input and output structure of the preferred LSTM flood forecasting model in the application of the example of the present invention;

图9(a)是本发明LSTM洪水预报模型的训练集场次19840501模拟流量过程对比图；Figure 9(a) is a comparison diagram of the simulation flow process of the training set session 19840501 of the LSTM flood forecasting model of the present invention;

图9(b)是本发明LSTM洪水预报模型的训练集场次19900730模拟流量过程对比图；Figure 9(b) is a comparison diagram of the simulation flow process of training set 19900730 of the LSTM flood forecasting model of the present invention;

图9(c)是本发明LSTM洪水预报模型的训练集场次19940613模拟流量过程对比图；Figure 9 (c) is a comparison diagram of the simulation flow process of the training set session 19940613 of the LSTM flood forecasting model of the present invention;

图9(d)是本发明LSTM洪水预报模型的训练集场次19970620模拟流量过程对比图；Figure 9(d) is a comparison diagram of the simulation flow process of the training set 19970620 of the LSTM flood forecasting model of the present invention;

图9(e)是本发明LSTM洪水预报模型的训练集场次20020805模拟流量过程对比图；Figure 9(e) is a comparison diagram of the simulation flow process of the training set session 20020805 of the LSTM flood forecasting model of the present invention;

图9(f)是本发明LSTM洪水预报模型的测试集场次20060505模拟流量过程对比图；Figure 9(f) is a comparison diagram of the simulated flow process of the test set field 20060505 of the LSTM flood forecasting model of the present invention;

图9(g)是本发明LSTM洪水预报模型的测试集场次20060725模拟流量过程对比图；Fig. 9 (g) is the test set field 20060725 simulation flow process comparison diagram of the LSTM flood forecasting model of the present invention;

图9(h)是本发明LSTM洪水预报模型的测试集场次20080524模拟流量过程对比图；Figure 9 (h) is a comparison diagram of the simulated flow process of the test set field 20080524 of the LSTM flood forecasting model of the present invention;

图9(i)是本发明LSTM洪水预报模型的测试集场次20080608模拟流量过程对比图；Fig. 9 (i) is the test set field 20080608 simulation flow process comparison diagram of the LSTM flood forecasting model of the present invention;

图9(j)是本发明LSTM洪水预报模型的测试集场次20120621模拟流量过程对比图。Figure 9(j) is a comparison diagram of the simulated flow process of the test set 20120621 of the LSTM flood forecasting model of the present invention.

具体实施方式Detailed ways

以下结合具体实施例对本发明作进一步说明。The present invention will be further described below in conjunction with specific embodiments.

本发明提出了一种基于新型通用输入输出结构与长短时记忆网络的洪水预报方法。研究流域实例应用中训练集、测试集场次洪水的结果分别代表了LSTM洪水预报模型的模拟、预报性能。下面通过实施案例，并结合附图，对本发明做进一步说明。The invention proposes a flood forecasting method based on a novel general input-output structure and a long-short-term memory network. The flood results of the training set and test set in the application of the watershed example represent the simulation and prediction performance of the LSTM flood forecasting model, respectively. The present invention will be further described below through implementation cases and in conjunction with the accompanying drawings.

安和流域位于江西省赣州市上犹县，地处东经114°-114°40′，北纬25°42′-26°01′，流域面积为251km²。安和流域水文观测资料条件较好，能够为机器学习洪水预报模型构建提供较好的数据支撑。此外，该流域内植被发育良好，属于亚热带丘陵山区湿润季风气候区，雨量充沛，年均降雨量1497mm，平均气温18.8℃，安和流域地形及水文测站分布见图1。安和流域内雨量站数量为8个，且空间分布较为均匀。选取该山丘区流域作为研究实例进行洪水预报，实现基于长短时记忆网络的流域洪水预报。主要步骤如下：The Anhe River Basin is located in Shangyou County, Ganzhou City, Jiangxi Province, at 114°-114°40'E, 25°42'-26°01'N, with a drainage area of 251km ² . The condition of hydrological observation data in Anhe Basin is good, which can provide better data support for the construction of machine learning flood forecasting model. In addition, the vegetation in the basin is well-developed and belongs to the subtropical hilly and mountainous humid monsoon climate zone, with abundant rainfall, with an average annual rainfall of 1497 mm and an average temperature of 18.8 °C. The number of rainfall stations in the Anhe watershed is 8, and the spatial distribution is relatively uniform. The hill basin is selected as a research example for flood forecasting, and flood forecasting based on long and short-term memory network is realized. The main steps are as follows:

收集了安和流域1984～2012年的实测水文资料(8个雨量站、1个水文站)，通过对历年实测降雨、流量数据序列的对照检查，剔除了少数水量不平衡的场次洪水，以保证水文资料的可靠性。同时，收集的降雨、流量资料中存在起止时间逻辑关系错误、观测记录重复等问题，需人为进行错误识别与修正。此外，降雨资料为逐时段数据，起止时间间隔不统一，而流量资料为瞬时数据，需对降雨与流量资料进行合理性检验、时段插值等前处理工作，将降雨、流量资料整理成逐小时累积降雨、平均流量时间序列。在此基础上按场次洪水发生时间先后顺序，选取全部场次洪水前50％为训练集、中间20％为验证集，训练、验证和测试集场次洪水的比例为5:2:3。The measured hydrological data (8 rainfall stations, 1 hydrological station) in the Anhe River Basin from 1984 to 2012 were collected, and a few floods with unbalanced water volume were eliminated by comparing the measured rainfall and flow data sequences over the years to ensure Reliability of hydrological data. At the same time, there are problems in the collected rainfall and flow data, such as errors in the logical relationship between the start and end times, and duplication of observation records, which need to be manually identified and corrected. In addition, the rainfall data is time-by-period data, and the start and end time intervals are not uniform, while the flow data is instantaneous data. It is necessary to carry out pre-processing such as rationality inspection and time-period interpolation of the rainfall and flow data, and organize the rainfall and flow data into hourly accumulation. Rainfall, mean flow time series. On this basis, according to the sequence of flood occurrences, the first 50% of all floods are selected as the training set, the middle 20% as the validation set, and the ratio of floods in training, validation and test sets is 5:2:3.

第二步，计算安和流域平均汇流时间。The second step is to calculate the average confluence time in the Anhe watershed.

根据安和流域整理得到的场次洪水降雨流量过程资料，安和流域大多数场次洪水洪峰流量的滞后时间为4～6h，平均滞后时间为5.21h。据此确定安和流域平均汇流时间为6h，即洪水预报预见期为6h。According to the precipitation flow process data of flood events in Anhe watershed, the lag time of flood peak flow in most flood events in Anhe watershed is 4-6 hours, and the average lag time is 5.21 hours. Accordingly, the average confluence time in the Anhe River Basin is determined to be 6 hours, that is, the forecast period of flood forecasting is 6 hours.

第三步，给定LSTM洪水预报模型的隐藏层为单层，隐藏层神经元节点数量为5个。In the third step, the hidden layer of the given LSTM flood forecasting model is a single layer, and the number of neurons in the hidden layer is 5.

LSTM细胞单元状态与流域土壤蓄水状态较为相似，三个控制门(遗忘门、输入门和输出门)与细胞单元状态间的作用可视为流域土壤蓄水状态的消耗、增加和释放。图2是LSTM 模型在第t个(当前)时段的输入输出计算流程及模型输入变量的水文学含义。设计LSTM 洪水预报模型的输入与输出结构如图3所示。其中，模型输出为安和流域洪水预报预见期内不同时段的流量值Q＝[Q_t+1,Q_t+2,…,Q_t+6]，输入为流域内各雨量站的长序列实测降雨(前期降雨+短期降雨)。前期降雨序列反映了流域前期土壤蓄水量等流域状态的变化对输出流量的影响，具体选取多少时段的前期降雨应结合流域特性采用试算法确定。可通过设置系列前期降雨的输入方案(24h、48h、120h、240h、480h、720h、960h和1440h)构建LSTM洪水预报模型，比较模型的模拟、预测性能确定前期降雨输入的最佳阶数。The LSTM cell unit state is similar to the watershed soil water storage state. The interaction between the three control gates (forgetting gate, input gate and output gate) and the cell unit state can be regarded as the consumption, increase and release of the watershed soil water storage state. Figure 2 shows the input and output calculation process of the LSTM model in the t-th (current) period and the hydrological meaning of the model input variables. The input and output structure of designing the LSTM flood forecasting model is shown in Figure 3. Among them, the output of the model is the flow value Q=[Q _t+1 ,Q _t+2 ,…,Q _t+6 ] in different time periods during the forecast period of flood forecasting in the Anhe Basin, and the input is the long-sequence actual measurement of each rainfall station in the basin Rainfall (pre-rain + short-term rainfall). The previous rainfall sequence reflects the impact of changes in the basin state, such as soil water storage capacity, on the output flow. The specific selection period of the previous rainfall should be determined by a trial algorithm based on the characteristics of the basin. The LSTM flood forecast model can be constructed by setting the input scheme of the series of early rainfall (24h, 48h, 120h, 240h, 480h, 720h, 960h and 1440h), and the simulation and prediction performance of the model can be compared to determine the optimal order of the previous rainfall input.

按照第一步划分的研究流域训练、验证和测试集场次，根据第四步不同前期降雨输入方案对应的模型输入与输出结构，每次洪水均按照逐时段滑动截取的方式生成样本。以模型输入降雨量16h、输出目标流量6h为例，图4为由场次洪水过程相邻时段t～t+1时刻生成机器学习模型样本的示例图。安和流域训练集、验证集和测试集的样本数量如表1所示。According to the training, verification and test set sessions of the research watershed divided in the first step, and according to the model input and output structure corresponding to the different previous rainfall input schemes in the fourth step, each flood is generated according to the method of sliding interception period by period. Taking the model input rainfall of 16h and output target flow of 6h as an example, Figure 4 is an example diagram of generating machine learning model samples from the adjacent time period t to t+1 of the flood process. The number of samples in the Anhe watershed training set, validation set and test set is shown in Table 1.

表1安和流域训练集、验证集和测试集样本数量Table 1 Number of samples in Anhe watershed training set, validation set and test set

第六步，LSTM洪水预报模型构建与训练。The sixth step is to build and train the LSTM flood forecasting model.

LSTM模型采用随时间反向传播算法(BPTT)进行有监督学习方式的训练，模型构建和训练均基于开源的Keras与TensorFlow实现。根据第三步、第四步确定的模型结构设置，以前期降雨输入480h为例，图5为基于TensorFlow和Keras构建的LSTM洪水预报模型的层次结构与输入、输出变量的特征维度可视化图。其中，模型输入为安和流域8个雨量站点长序列(486h)实测降雨；模型输出为长序列最后6个时段对应的流量值；TimeDistributed层的作用是使每个时刻的隐藏状态h_t都连接至神经元节点数量为1的输出层(Dense)，每个时刻均输出流量值；Lambda层为自定义层，实现对输出流量值序列数据的切片处理，从而提取出每个样本最后6个时段的输出流量值作为模型最终输出序列。The LSTM model uses the back-propagation over time (BPTT) algorithm for supervised learning training. The model construction and training are based on the open source Keras and TensorFlow implementations. According to the model structure settings determined in the third and fourth steps, taking the previous rainfall input of 480h as an example, Figure 5 is a visualization of the hierarchical structure of the LSTM flood forecasting model based on TensorFlow and Keras and the feature dimensions of input and output variables. Among them, the model input is the long sequence (486h) measured rainfall of 8 rainfall stations in the Anhe Basin; the model output is the flow value corresponding to the last 6 periods of the long sequence; the function of the TimeDistributed layer is to connect the hidden states h _t at each moment. To the output layer (Dense) where the number of neuron nodes is 1, the flow value is output at each moment; the Lambda layer is a custom layer, which realizes the slice processing of the output flow value sequence data, so as to extract the last 6 time periods of each sample The output flow value is used as the final output sequence of the model.

图6为TensorFlow深度学习框架下LSTM洪水预报模型的训练过程示意图。其中，超参数epoch＝300、batchsize＝64，损失函数设置为均方误差(MSE)，输出层激活函数采用线性 ReLU函数，优化算法采用Adam算法，学习速率为0.0006。MSE公式见式(1)。Figure 6 is a schematic diagram of the training process of the LSTM flood forecasting model under the TensorFlow deep learning framework. Among them, the hyperparameters epoch=300, batchsize=64, the loss function is set to mean square error (MSE), the activation function of the output layer adopts the linear ReLU function, the optimization algorithm adopts the Adam algorithm, and the learning rate is 0.0006. The MSE formula is shown in formula (1).

式中：y_k,obs、y_k,out分别为某场洪水第k时段的实测流量值和预报流量值，单位为m³/s；l为模拟流量的输出长度。ReLU激活函数具有收敛快、计算简单等优点，且能够保证LSTM洪水预报模型的输出流量值均为非负值，计算表达式见式(2)。In the formula: y _k,obs and y _k,out are the measured flow value and forecast flow value of a flood in the kth period, respectively, in m ³ /s; l is the output length of the simulated flow. The ReLU activation function has the advantages of fast convergence and simple calculation, and can ensure that the output flow values of the LSTM flood forecast model are all non-negative values. The calculation expression is shown in Equation (2).

ReLU(x)＝max(x,0)(2)ReLU(x)=max(x,0)(2)

构建不同前期降雨输入方案下对应的LSTM洪水预报模型，比较分析不同前期降雨输入方案(24h、48h、120h、240h、480h、720h、960h和1440h)下LSTM模型在安和流域场次洪水模拟、预测流量过程的纳什效率系数(NSE)、平均绝对误差(MAE)和均方根误差(RMSE) 计算结果，如图7所示。NSE、MAE和RMSE计算表达式分别如下：Construct the corresponding LSTM flood forecasting models under different previous rainfall input schemes, and compare and analyze the flood simulation and prediction of LSTM models under different previous rainfall input schemes (24h, 48h, 120h, 240h, 480h, 720h, 960h and 1440h) in the Anhe watershed The Nash efficiency coefficient (NSE), mean absolute error (MAE), and root mean square error (RMSE) calculations for the flow process are shown in Figure 7. The calculation expressions of NSE, MAE and RMSE are as follows:

式中：y_k,obs、y_k,out分别为某场洪水第k时段的实测流量值和预报流量值，单位为m³/s；l为模拟流量的输出长度。In the formula: y _k,obs and y _k,out are the measured flow value and forecast flow value of a flood in the kth period, respectively, in m ³ /s; l is the output length of the simulated flow.

根据图7，当前期降雨输入≥480h时，训练集、验证集的场次洪水模拟、预测流量过程的NSE、MAE、RMSE均值明显优于前期降雨输入≤240h的方案，且进一步延长前期降雨长度(720h、960h和1440h)未能改善模型的结果。由此确定安和流域最终较优的LSTM洪水预报模型的输入输出结构如图8所示。According to Figure 7, when the current rainfall input is ≥480h, the mean values of NSE, MAE, and RMSE of the training set and validation set of flood simulation and predicted flow process are significantly better than those of the previous rainfall input ≤240h, and the length of the previous rainfall is further extended ( 720h, 960h and 1440h) failed to improve the results of the model. The input and output structure of the final optimal LSTM flood forecasting model in the Anhe watershed is thus determined as shown in Figure 8.

将最终建立的LSTM洪水预报模型应用于安和流域洪水预报，提取分析训练集、测试集场次洪水计算结果。除NSE、MAE和RMSE外，采用峰值合格率(QRP)和峰现时间合格率(QRT)指标用于评价模型对场次洪水洪峰流量预报结果的好坏。QRP和QRT的计算如公式(6)、公式(7)所示：The finally established LSTM flood forecasting model was applied to flood forecasting in the Anhe watershed, and the flood calculation results of the training set and test set were extracted and analyzed. In addition to NSE, MAE and RMSE, the peak pass rate (QRP) and peak time pass rate (QRT) indicators are used to evaluate the model's quality of forecast results for flood peak flow. The calculation of QRP and QRT is shown in formula (6) and formula (7):

式中：N_P表示场次洪水洪峰流量合格数量，N_T表示场次洪水峰现时间合格数量，N表示场次洪水总数。以模型输出第6个时段对应的场次洪水模拟、预测结果为例，表2给出了LSTM 洪水预报模型场次洪水模拟结果，并以传统新安江概念性水文模型作为比较基准。由表2可知，LSTM模型的峰值模拟、预测结果均明显优于XAJ模型。与XAJ模型相比，LSTM模型训练集场次洪水的峰值合格率QRP由77％提高至82％，测试集场次洪水的峰值合格率QRP由72％提高至80％；训练集场次洪水NSE均值由0.825提高至0.871，测试集场次洪水NSE 均值由0.815提高至0.821；LSTM模型训练集、测试集场次洪水的评价指标MAE、RMSE 值也更低。上述结果分析说明构建的LSTM洪水预报模型成功建立了流域内降雨与出口流量间的复杂非线性关系，模型在安和流域的洪水预报精度较高。In the formula: NP represents the qualified quantity of flood peak flow of the event, N _T represents the qualified number of flood peak current time of the event, and N represents the total number of floods of the event _. Taking the flood simulation and prediction results corresponding to the sixth period of model output as an example, Table 2 shows the flood simulation results of the LSTM flood forecast model, and takes the traditional Xin'anjiang conceptual hydrological model as a comparison benchmark. It can be seen from Table 2 that the peak simulation and prediction results of the LSTM model are significantly better than those of the XAJ model. Compared with the XAJ model, the peak pass rate QRP of the floods in the training set of the LSTM model was increased from 77% to 82%, and the peak pass rate QRP of the floods in the test set was increased from 72% to 80%; the average NSE of the floods in the training set was increased from 0.825 When it is increased to 0.871, the mean NSE of the floods in the test set is increased from 0.815 to 0.821; the evaluation indicators MAE and RMSE of the floods in the training set and test set of the LSTM model are also lower. The analysis of the above results shows that the constructed LSTM flood forecast model has successfully established the complex nonlinear relationship between rainfall and outlet flow in the basin, and the model has high flood forecast accuracy in the Anhe basin.

表2安和流域LSTM洪水预报模型场次洪水模拟统计结果Table 2 Statistical results of flood simulations in the LSTM flood forecasting model in Anhe Basin

以模型输出第6个时段对应的场次洪水模拟、预测流量过程为例，分别选取5场训练集、测试集场次洪水，绘制LSTM与XAJ两种洪水预报模型的场次洪水模拟、预测与实测流量过程对比图，如图9所示。由图9可知，与XAJ模型的结果相比，LSTM洪水预报模型的模拟、预测流量过程与实测流量更为吻合，场次洪水的起涨、退水阶段与XAJ模型的结果也基本一致，进一步说明了构建的LSTM洪水预报模型充分学习到了流域内降雨径流间的转换机制，较好地建立了降雨与出口流量间的非线性映射关系。Taking the flood simulation and predicted flow process corresponding to the sixth period of model output as an example, five training sets and test sets of floods were selected respectively, and the flood simulation, prediction and measured flow processes of the two flood forecast models, LSTM and XAJ, were drawn. The comparison chart is shown in Figure 9. It can be seen from Figure 9 that compared with the results of the XAJ model, the simulated and predicted flow processes of the LSTM flood forecast model are more consistent with the measured flow, and the rise and fall stages of floods are basically consistent with the results of the XAJ model. The constructed LSTM flood forecast model fully learned the conversion mechanism between rainfall and runoff in the basin, and established the nonlinear mapping relationship between rainfall and outlet flow.

以上结果表明本发明提出的基于新型通用输入输出结构与长短时记忆网络的洪水预报方法，新型通用输入输出结构能够成功地指导LSTM洪水预报模型挖掘学习流域降雨与出口流量间的复杂非线性映射关系，模型对降雨流量关系的拟合能力、场次洪水洪峰值的预报能力均较优，在山区流域的场次洪水峰值和峰现时间模拟、预报精度较高。此外，设计的新型通用输入输出结构在不同空间尺度流域的洪水预报建模中实用性较强，根据此输入输出结构建立的机器学习洪水预报模型，可以充分发挥流域内降雨和流域蓄水状态对洪水起涨消退的贡献作用，其水文学的可解释性基础较强。The above results show that the flood forecasting method based on the novel general-purpose input-output structure and long-short-term memory network proposed by the present invention can successfully guide the LSTM flood forecasting model to mine and learn the complex nonlinear mapping relationship between rainfall and outlet flow in the basin , the fitting ability of the model to the relationship between rainfall and flow, and the forecasting ability of the flood peak value of the flood events are better. In addition, the designed new general input and output structure has strong practicability in flood forecasting and modeling of watersheds at different spatial scales. The machine learning flood forecasting model established based on this input and output structure can give full play to the influence of rainfall and water storage status in the watershed. Floods play a contributing role in rising and retreating, and their hydrological interpretability is based on a strong foundation.

以上所述实施例仅表达本发明的实施方式，但并不能因此而理解为对本发明专利的范围的限制，应当指出，对于本领域的技术人员来说，在不脱离本发明构思的前提下，还可以做出若干变形和改进，这些均属于本发明的保护范围。The above-mentioned embodiments only represent the embodiments of the present invention, but should not be construed as a limitation on the scope of the present invention. It should be pointed out that for those skilled in the art, without departing from the concept of the present invention, Several modifications and improvements can also be made, which all belong to the protection scope of the present invention.

Claims

1. a flood forecasting method based on a novel general input-output structure and a long-short-term memory network, characterized in that the hydrological implication of the flood forecasting method is significant, and the model input-output structure can be applied to the flood forecasting of river basins of different spatial scales, Include the following steps:

The first step is to collect and organize the historical flood data of the basin;

Collect and organize the flood data of the research watershed, and divide all the floods into training set, validation set, and test set. The training set flood is used to optimize the internal weight matrix and bias vector parameters of the LSTM flood forecasting model; The floods are used to assist in determining the external settings of the model such as hyperparameters and loss functions; the floods in the test set are used to test the extrapolation ability of the model after training;

The second step is to calculate the average confluence time of the watershed;

The average confluence time of a certain research basin is determined, and its duration is equal to the forecast period of flood forecasting in the basin, which comprehensively reflects the regulation and storage effect of the basin on the convergence process of rainfall and water flow; The time difference between the main rainfall and the corresponding flood peak flow in a flood event is the peak delay time; the average value of the flood peak delay time of all flood events is calculated, which is the average confluence time of the basin;

The third step is to give the number of hidden layers and the number of hidden layer neurons in the flood prediction model of the long-short-term memory LSTM network;

The fourth step is to design the input and output structure of the new general LSTM flood forecasting model;

The LSTM cell unit state is similar to the watershed soil water storage state. The interaction between the three control gates and the cell unit state can be regarded as the consumption, increase and release of the watershed soil water storage state. The three control gates include the forget gate and the input gate. and output gate; according to the traditional watershed hydrological simulation theory and method, rainfall is selected as the only input factor of the LSTM flood forecast model. The input of the LSTM flood forecast model is the long-sequence rainfall information of each rainfall station in the watershed, and the input length is n+l time period; among them, n represents the input length of the previous rainfall, and the previous rainfall can be regarded as reflecting the influence of information such as soil water storage status and short-term rainfall in the basin on the output flow value of the model. Multiple n can be selected and determined according to the performance of the subsequent model. The optimal model; the output of the LSTM flood forecasting model is a sequence of discharge values equal to the confluence time of the study basin, that is, the output length is l time periods, where l is equal to the basin confluence time and l≤n;

The fifth step is to generate training, validation and test sample sets;

The sample length is determined according to the input and output structure (values of n and l) of the new general LSTM flood forecasting model designed in the fourth step. The length of the input rainfall sequence of each sample is equal to n+l periods, and the length of the output target flow sequence is equal to l each time period; according to the training set, validation set and test set divided in the first step, each flood will generate corresponding training, validation and test sample sets according to the method of sliding interception by time period, and each flood will generate multiple samples, Each sample consists of the input rainfall sequence P=[P _t-n+1 ,P _t-n+2 ,...,P _t ,...,P _t+l ] and the output target flow sequence Q=[Q _t+1 ,Q _t+2 ,…,Q _t ,…,Q _t+l ] constitute the input and output data pairs; among them, n is the number of periods of the input precipitation sequence of the LSTM flood forecast model, l is the number of periods of the output flow of the model, and the t th The input rainfall P _t of each time period includes the measured rainfall value of each rainfall station in the basin;

The sixth step, model construction and training;

The LSTM flood forecasting model uses the back-propagation over time algorithm BPTT for supervised learning training. Model construction and training are based on open source Keras and TensorFlow. The Keras framework running on the TensorFlow platform integrates a relatively mature machine learning algorithm package. The corresponding algorithm can be directly called to complete the construction and training of the LSTM flood forecasting model;

6.1) Construction of LSTM flood forecasting model: According to the number of hidden layers given in the third step, the number of hidden layer neurons and the model input and output structure designed in the fourth step, the Keras layer package is called to define the input layer and hidden layer of LSTM and the output layer to build an LSTM flood forecasting model;

6.2) LSTM flood forecasting model training: input the training and verification sample sets generated in the fifth step, set the hyperparameters, activation functions, loss functions, optimization algorithms, etc. involved in the model training process, and run the LSTM flood based on the Keras framework on the TensorFlow platform forecasting model to obtain the trained LSTM flood forecasting model;

The seventh step is to determine the optimal model, and extract and analyze the flood simulation results of the basin floods;

Compare and analyze the performance of the LSTM flood forecasting model under different input lengths, determine the final optimal LSTM flood forecasting model, further extract the results of flood simulation and forecast flow process in the basin, and analyze the forecasting performance of the LSTM flood forecasting model.