CN116340868A - A method and device for ensemble forecasting of wave features based on machine learning - Google Patents
A method and device for ensemble forecasting of wave features based on machine learning Download PDFInfo
- Publication number
- CN116340868A CN116340868A CN202310312853.8A CN202310312853A CN116340868A CN 116340868 A CN116340868 A CN 116340868A CN 202310312853 A CN202310312853 A CN 202310312853A CN 116340868 A CN116340868 A CN 116340868A
- Authority
- CN
- China
- Prior art keywords
- wave
- machine learning
- model
- forecasting
- period
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000010801 machine learning Methods 0.000 title claims abstract description 54
- 238000000034 method Methods 0.000 title claims abstract description 23
- 238000003066 decision tree Methods 0.000 claims abstract description 17
- 238000013277 forecasting method Methods 0.000 claims abstract description 10
- 238000004458 analytical method Methods 0.000 claims abstract description 5
- 238000012549 training Methods 0.000 claims description 24
- 101001095088 Homo sapiens Melanoma antigen preferentially expressed in tumors Proteins 0.000 claims description 17
- 102100037020 Melanoma antigen preferentially expressed in tumors Human genes 0.000 claims description 17
- 238000004590 computer program Methods 0.000 claims description 16
- 238000012950 reanalysis Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 8
- 230000008901 benefit Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Operations Research (AREA)
- Data Mining & Analysis (AREA)
- Tourism & Hospitality (AREA)
- Game Theory and Decision Science (AREA)
- General Business, Economics & Management (AREA)
- Marketing (AREA)
- Development Economics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Quality & Reliability (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Entrepreneurship & Innovation (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
技术领域technical field
本发明涉及海洋波浪预测技术,具体涉及一种基于机器学习的波浪特征集合预报方法及装置。The invention relates to ocean wave prediction technology, in particular to a machine learning-based wave feature set prediction method and device.
背景技术Background technique
目前很多基于人工智能预测海洋波浪的方法大部分只针对波高一个要素,波周期和波向的预测结果无法获取,而且仅通过学习波高时间序列数据集达到预报波高的目的,这样无法通过波浪的生成物理机制来解释预测结果。At present, most of the methods for predicting ocean waves based on artificial intelligence only focus on one element of wave height, and the prediction results of wave period and wave direction cannot be obtained, and the purpose of predicting wave height is only achieved by learning wave height time series data sets, which cannot be generated through wave generation. physical mechanisms to explain the predicted results.
海洋波浪生成的物理机制目前研究比较成熟,基于波浪能平衡方程建立的第三代波浪数学模型也被广泛使用,目前主流的模型主要包括WAM,WWIII和SWAN三种,它们基于波能量平衡方程:The physical mechanism of ocean wave generation is relatively mature at present, and the third-generation wave mathematical model based on the wave energy balance equation is also widely used. The current mainstream models mainly include WAM, WWIII and SWAN, which are based on the wave energy balance equation:
式中,N是N(x,y,t;σ,θ)缩写,其中σ是频率,θ是方向;Cgx和Cgy是波能在x和y方向上的传播速度;Cgσ和Cgθ是在频率和方向上的传播速度;S是包含不同物理过程的源项,主要包括深水和浅水过程,有风浪成长过程、波波相互作用、海底摩擦、白帽过程和波浪破碎等物理过程。In the formula, N is the abbreviation of N(x,y,t; σ,θ), where σ is the frequency and θ is the direction; C gx and C gy are the propagation speeds of wave energy in the x and y directions; C gσ and C gθ is the propagation velocity in frequency and direction; S is the source term that includes different physical processes, mainly including deep water and shallow water processes, including physical processes such as wind and wave growth process, wave interaction, seabed friction, white hat process, and wave breaking .
目前构建的预报模型只考虑了风和海浪间的复杂非线性关系,包括波波相互作、风浪成长等机制,但是波流相互作用、海底摩阻、地形对波浪的影响等因素仍然没有考虑The current forecast model only considers the complex nonlinear relationship between wind and waves, including wave-wave interaction, wind-wave growth and other mechanisms, but factors such as wave-current interaction, seabed friction, and the influence of terrain on waves are still not considered.
发明内容Contents of the invention
针对现有预报模型只考虑了风和海浪间的复杂非线性关系的问题,本发明提供一种基于机器学习的波浪特征集合预报方法及装置。Aiming at the problem that the existing forecasting model only considers the complex nonlinear relationship between wind and sea waves, the present invention provides a wave feature set forecasting method and device based on machine learning.
为实现上述目的,本发明的技术方案是:For realizing the above object, technical scheme of the present invention is:
第一方面,本发明提供一种基于机器学习的波浪特征集合预报方法,包括:In the first aspect, the present invention provides a method for forecasting wave feature sets based on machine learning, including:
通过多层感知器MLP和决策树DT机器学习模型来构建波高、波周期和波向的集合预报模型;Construct an ensemble prediction model of wave height, wave period, and wave direction through multi-layer perceptron MLP and decision tree DT machine learning model;
将历史波高、波周期和波向数据和未来几天风的参数作为所述集合预报模型的输入;Using historical wave height, wave period and wave direction data and the parameters of the wind in the next few days as the input of the ensemble forecast model;
所述集合预报模型输出未来的波高,波周期和波向预报结果。The ensemble forecasting model outputs forecast results of future wave height, wave period and wave direction.
进一步地,所述集合预报模型通过如下方式构建:Further, the ensemble prediction model is constructed in the following way:
获取风和波浪要素的再分析历史数据集,以生成机器学习模型学习所需的训练数据集样本;所述波浪要素包括波高、波周期和波向Obtain reanalysis historical data sets of wind and wave elements to generate training data set samples required for machine learning model learning; said wave elements include wave height, wave period, and wave direction
基于训练数据集样本来训练MLP模型和DT模型;Train the MLP model and DT model based on the training data set samples;
根据判断因子,选取不同预报提前时间的最优预报样本和模型的组合,得到所述集合预报模型。According to the judgment factor, the combination of optimal forecast samples and models with different forecast lead times is selected to obtain the ensemble forecast model.
进一步地,所述判断因子包括决定系数R2,平均绝对误差MAE和平均绝对百分数误差MAPE。Further, the judgment factors include coefficient of determination R 2 , mean absolute error MAE and mean absolute percentage error MAPE.
进一步地,所述决定系数其中 Further, the determination coefficient in
进一步地,所述平均绝对误差 Further, the mean absolute error
进一步地,所述平均相对误差 Further, the average relative error
进一步地,所述风和波浪要素的再分析历史数据集采用模式再分析数据,或者采用以公开的再分析数据集。Further, the reanalysis historical data set of the wind and wave elements adopts model reanalysis data, or adopts a public reanalysis data set.
进一步地,预报提前时长为训练数据集样本时间间隔的整数倍。Further, the forecast lead time is an integer multiple of the training data set sample time interval.
第二方面,本发明提供一种基于机器学习的波浪特征集合预报装置,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如上任一所述方法的步骤。In a second aspect, the present invention provides a machine learning-based wave feature set forecasting device, including a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor executes the When the above-mentioned computer program realizes the steps of any one of the above-mentioned methods.
第三方面,本发明提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如上任一所述方法的步骤In a third aspect, the present invention provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of any one of the methods described above are implemented
本发明与现有技术相比,其有益效果在于:Compared with the prior art, the present invention has the beneficial effects of:
本发明过多层感知器(MLP)和决策树(DT)机器学习模型构建波高、波周期和波向的集合预报模型,该模型是通过学习风和波浪要素(包括波高、波周期和波向)已经公开的再分析历史数据集之间的非线性关系,建立机器学习预报模型,通过决定系数R2,平均绝对误差MAE和平均绝对百分数误差MAPE进行比较模型在不同预报提前时间的优劣,选择精度相对较高的预报结果,从而能够准确地预报波浪特征。The present invention constructs the ensemble prediction model of wave height, wave period and wave direction through multi-layer perceptron (MLP) and decision tree (DT) machine learning model, and this model is by learning wind and wave element (comprising wave height, wave period and wave direction) ) has been published to reanalyze the nonlinear relationship between historical data sets, establish a machine learning forecast model, and compare the pros and cons of the model at different forecast lead times through the coefficient of determination R 2 , the average absolute error MAE and the average absolute percentage error MAPE, The forecast results with relatively high accuracy are selected so that the wave characteristics can be accurately forecasted.
附图说明Description of drawings
图1为本发明实施例1提供的基于机器学习的波浪特征集合预报方法的流程图;Fig. 1 is the flow chart of the method for forecasting wave feature sets based on machine learning provided by Embodiment 1 of the present invention;
图2为有效波高的预报模型的训练集组成;Fig. 2 is the training set composition of the prediction model of significant wave height;
图3为有效波高预报机器学习模型得分和平均绝对误差统计图;Figure 3 is a statistical chart of the machine learning model score and mean absolute error for significant wave height prediction;
图4为有效波高预报机器学习模型平均相对误差统计图;Figure 4 is a statistical diagram of the average relative error of the significant wave height forecasting machine learning model;
图5为平均波周期预报机器学习模型得分和平均绝对误差统计图;Fig. 5 is the average wave cycle forecasting machine learning model score and the average absolute error statistics;
图6为平均周期预报机器学习模型平均相对误差统计图;Fig. 6 is the average relative error statistical diagram of the average period forecast machine learning model;
图7为平均波向预报机器学习模型得分和平均绝对误差统计图;Fig. 7 is the average wave direction prediction machine learning model score and the statistical graph of the average absolute error;
图8为平均波向预报机器学习模型平均相对误差统计图;Fig. 8 is the average relative error statistical diagram of the average wave direction forecasting machine learning model;
图9为预测结果精度统计图;Fig. 9 is a statistical diagram of prediction result accuracy;
图10为发明实施例2提供的基于机器学习的波浪特征集合预报装置的组成示意图。Fig. 10 is a schematic diagram of the composition of the machine learning-based wave feature set forecasting device provided by
具体实施方式Detailed ways
下面结合附图和具体实施方式对本发明的内容做进一步详细说明。The content of the present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.
实施例1:Example 1:
相比于传统的基于物理关系的数值模型,机器学习预报波浪一般有两大优势:1.预报所需计算资源少,预报所需时间短;2.预报精度高,有效解决海洋波浪的预报时效的难题。Compared with traditional numerical models based on physical relationships, machine learning generally has two advantages in forecasting waves: 1. Less computing resources are required for forecasting, and the time required for forecasting is short; 2. High forecasting accuracy effectively solves the time limit for ocean wave forecasting problem.
目前构建的预报模型只考虑了风和海浪间的复杂非线性关系,包括波波相互作、风浪成长等机制,但是波流相互作用、海底摩阻、地形对波浪的影响等因素仍然没有考虑,其中地形因素和底摩阻一般不随时间变化,其影响在历史数据集中有一定体现,对预报结果影响较小;海流随时间变化,应该和风一样作为输入条件对模型进行训练,这样训练数据集大小增加,对机器学习模型的训练过程所需计算机内存大大增加。The current forecast model only considers the complex nonlinear relationship between wind and waves, including wave-wave interaction, wind-wave growth and other mechanisms, but factors such as wave-current interaction, seabed friction, and the influence of terrain on waves are still not considered. Among them, terrain factors and bottom friction generally do not change with time, and their influence is reflected in the historical data set to a certain extent, and has little impact on the forecast results; ocean currents change with time, and should be used as input conditions to train the model like wind, so the size of the training data set Increase, the computer memory required for the training process of the machine learning model is greatly increased.
为此,本发明过多层感知器(MLP)和决策树(DT)机器学习模型构建波高、波周期和波向的集合预报模型,该集合预报模型是通过学习风和波浪要素(包括波高、波周期和波向)已经公开的再分析历史数据集之间的非线性关系,建立机器学习预报模型,通过决定系数R2,平均绝对误差MAE和平均绝对百分数误差MAPE进行比较模型在不同预报提前时间的优劣,选择精度相对较高的预报结果。For this reason, the present invention constructs the ensemble prediction model of wave height, wave period and wave direction through multi-layer perceptron (MLP) and decision tree (DT) machine learning model, and this ensemble prediction model is by learning wind and wave element (comprising wave height, Wave period and wave direction) have been published to reanalyze the nonlinear relationship between historical data sets, establish a machine learning forecasting model, and compare the models in different forecasting advances through the coefficient of determination R 2 , the mean absolute error MAE and the mean absolute percentage error MAPE According to the advantages and disadvantages of time, choose the forecast result with relatively high accuracy.
多层感知器模型是基于人脑神经模式发展的,是一种前向结构的人工神经网络,映射一组输入向量到一组输出向量。MLP可以被看作是一个有向图,由多个的节点层所组成,每一层都全连接到下一层。除了输入节点,每个节点都是一个带有非线性激活函数的神经元(或称处理单元)。一种被称为反向传播算法的监督学习方法常被用来训练MLP。多层感知器遵循人类神经系统原理,学习并进行数据预测。它首先学习,然后使用权重存储数据,并使用算法来调整权重并减少训练过程中的偏差,即实际值和预测值之间的误差。主要优势在于其快速解决复杂问题的能力。多层感知的基本结构由三层组成:第一输入层,中间隐藏层和最后输出层,输入元素和权重的乘积被馈给具有神经元偏差的求和结点,主要优势在于其快速解决复杂问题的能力。它可以学习非线性函数样本来识别有n个特征向量的事务。The multi-layer perceptron model is developed based on the neural model of the human brain. It is a forward-structured artificial neural network that maps a set of input vectors to a set of output vectors. An MLP can be viewed as a directed graph consisting of multiple layers of nodes, each fully connected to the next layer. Except for the input node, each node is a neuron (or processing unit) with a nonlinear activation function. A supervised learning method known as the backpropagation algorithm is often used to train MLPs. A multilayer perceptron follows the principles of the human nervous system, learning and making predictions from data. It first learns, then stores the data with weights, and uses algorithms to adjust the weights and reduce bias, the error between actual and predicted values, during training. The main strength is its ability to quickly solve complex problems. The basic structure of multi-layer perception consists of three layers: the first input layer, the middle hidden layer and the final output layer. The product of input elements and weights is fed to the summation node with neuron bias. The main advantage lies in its fast resolution of complex ability to problem. It can learn non-linear function samples to identify transactions with n feature vectors.
决策树回归模型也是本预报模型中集合的一类机器学习方法,回归决策树主要指CART(classification and regression tree)算法,内部结点特征的取值为“是”和“否”,为二叉树结构。所谓回归,就是根据特征向量来决定对应的输出值。回归树就是将特征空间划分成若干单元,每一个划分单元有一个特定的输出。因为每个结点都是“是”和“否”的判断,所以划分的边界是平行于坐标轴的。The decision tree regression model is also a kind of machine learning method integrated in this forecast model. The regression decision tree mainly refers to the CART (classification and regression tree) algorithm. The values of internal node features are "yes" and "no", which is a binary tree structure . The so-called regression is to determine the corresponding output value according to the feature vector. The regression tree is to divide the feature space into several units, and each division unit has a specific output. Because each node is a "yes" and "no" judgment, the boundaries of the division are parallel to the coordinate axes.
参阅图1所示,本实施例提供的基于机器学习的波浪特征集合预报方法主要包括如下步骤:Referring to Fig. 1, the machine learning-based wave feature set prediction method provided in this embodiment mainly includes the following steps:
101、通过多层感知器MLP和决策树DT机器学习模型来构建波高、波周期和波向的集合预报模型;101. Construct an ensemble prediction model of wave height, wave period, and wave direction through multi-layer perceptron MLP and decision tree DT machine learning model;
102、将历史波高、波周期和波向数据和未来几天风的参数作为所述集合预报模型的输入;102. Using historical wave height, wave period and wave direction data and wind parameters in the next few days as the input of the ensemble forecast model;
103、所述集合预报模型输出未来的波高,波周期和波向预报结果。103. The ensemble forecast model outputs future forecast results of wave height, wave period and wave direction.
在一具体实施例中,上述的集合预报模型通过如下方式构建:In a specific embodiment, the above-mentioned ensemble prediction model is constructed in the following way:
获取风和波浪要素的再分析历史数据集(例如ERA5数据集,见https://www.ecmwf.int/en/forecasts/datasets/reanalysis-datasets/era5),以生成机器学习模型学习所需的训练数据集;该波浪要素包括波高、波周期和波向。Obtain reanalysis historical datasets of wind and wave elements (e.g. ERA5 dataset, see https://www.ecmwf.int/en/forecasts/datasets/reanalysis-datasets/era5) to generate the data needed for machine learning model learning Training dataset; the wave features include wave height, wave period, and wave direction.
基于训练数据集样本来训练MLP模型和DT模型;以有效波高的预报模型为例,训练数据集的组成如图2所示:The MLP model and DT model are trained based on the samples of the training data set; taking the forecast model of significant wave height as an example, the composition of the training data set is shown in Figure 2:
图2中包括波浪初始条件:T时刻的有效波高(SWH)、平均周期(MWP)和平均波向(MWD);初始风场和预报步长(n-1)的风场;输出端为1+n时刻的波浪特征变量,如果训练的是有效波高,则是对应的有效波高值。Figure 2 includes initial wave conditions: significant wave height (SWH), mean period (MWP) and mean wave direction (MWD) at time T; initial wind field and wind field of forecast step (n-1); the output is 1 The wave characteristic variable at +n time, if the effective wave height is trained, it is the corresponding effective wave height value.
根据判断因子,选取不同预报提前时间的最优预报样本和模型的组合,得到所述集合预报模型。According to the judgment factor, the combination of optimal forecast samples and models with different forecast lead times is selected to obtain the ensemble forecast model.
具体第,上述判断因子包括决定系数R2,平均绝对误差MAE和平均绝对百分数误差MAPE。Specifically, the above judgment factors include coefficient of determination R 2 , mean absolute error MAE and mean absolute percentage error MAPE.
该决定系数其中/> The coefficient of determination where />
该平均绝对误差 The mean absolute error
该平均相对误差 The mean relative error
此外,为了提高预测的计算时效和精度,可供学习的再分析历史数据集可利用模式再分析数据,或者采用以公开的再分析数据集,本发明利用ERA5的小时数据进行验证;用于训练的再分析数据集的时间间隔对预报提前时间有决定作用,本发明预报提前时长为训练样本时间间隔的整数倍;预报不同波浪要素,需要改变训练数据集的输出变量,不同的输出变量对应不同的波浪要素预报。In addition, in order to improve the calculation timeliness and accuracy of prediction, the reanalysis historical data set available for learning can use the model reanalysis data, or use the public reanalysis data set, the present invention uses the hourly data of ERA5 for verification; for training The time interval of the reanalysis data set has a decisive effect on the forecast lead time, and the forecast lead time of the present invention is an integer multiple of the time interval of the training samples; to predict different wave elements, it is necessary to change the output variable of the training data set, and different output variables correspond to different Forecast of wave elements.
故而,本发明与现有技术相比,具有如下技术优势:Therefore, compared with the prior art, the present invention has the following technical advantages:
1、占用计算资源少和所需时间短:1. Occupies less computing resources and takes less time:
本发明主要技术内容可分为两步:1训练模型;2模型预报The main technical content of the present invention can be divided into two steps: 1 training model; 2 model prediction
使用计算机cpu类型:Intel(R)Xeon(R)Gold 6226R CPU@2.90GHzComputer cpu type: Intel(R) Xeon(R) Gold 6226R CPU@2.90GHz
训练模型所需时间根据样本数量不同有所不同,用时最常的预报8个步长的模型勇士约3600秒(一个小时),预测所需时间约5秒。用同样级别的cpu和同样预报区域可能需要4-5小时,大幅度提高计算时效。The time required to train the model varies depending on the number of samples. The most commonly used model for forecasting 8 steps is about 3600 seconds (one hour), and the time required for prediction is about 5 seconds. It may take 4-5 hours to use the same level of cpu and the same forecast area, greatly improving the calculation time.
2、预报精度高:2. High forecast accuracy:
本处采era5算例,如图3-8给出不同预报提前时长的预报精度:Here, era5 is used as an example, as shown in Figure 3-8, which shows the forecast accuracy of different forecast lead times:
对于有效波高SWH,当提前1小时预报时,MAE≈0.102m,MAPE≈0.04;当提前1天预报时,MAE≈0.26m,MAPE≈0.127;当提前4天预报时,MAE≈0.1037m,MAPE≈0.189;For the significant wave height SWH, when forecasting 1 hour in advance, MAE≈0.102m, MAPE≈0.04; when forecasting 1 day in advance, MAE≈0.26m, MAPE≈0.127; when forecasting 4 days in advance, MAE≈0.1037m, MAPE ≈0.189;
对于平均周期MWP,当提前1小时预报时,MAE≈0.188秒,MAPE≈0.02;当提前1天预报时,MAE≈0.487秒,MAPE≈0.071;当提前4天预报时,MAE≈0.848秒,MAPE≈0.108;For the average period MWP, when forecasting 1 hour in advance, MAE≈0.188 seconds, MAPE≈0.02; when forecasting 1 day in advance, MAE≈0.487 seconds, MAPE≈0.071; when forecasting 4 days in advance, MAE≈0.848 seconds, MAPE ≈0.108;
对于平均周期MWD,当提前1小时预报时,MAE≈6.185度,MAPE≈0.159;当提前1天预报时,MAE≈12.34度,MAPE≈0.178;当提前4天预报时,MAE≈17.06度,MAPE≈0.27;For the average period MWD, when forecasting 1 hour in advance, MAE≈6.185 degrees, MAPE≈0.159; when forecasting 1 day in advance, MAE≈12.34 degrees, MAPE≈0.178; when forecasting 4 days in advance, MAE≈17.06 degrees, MAPE ≈0.27;
下面结合一个实验例来对比本发明的技术效果进行进一步的说明:The technical effect of the present invention is further described in conjunction with an experimental example below:
通过设计实验对珠江口[112°E-117°E,18°N-23°N]区域海浪进行预报,训练集数据采用ERA5中2012-2021年11年的风和浪的再分析数据集,并分析本方法的可行性和优缺点。本算例选取2012–2021年,总共11年间87672个小时的风的径向和纬向分量,以及有效波高SWH、平均波周期MWP和平均波向MWD为训练数据集输入变量,将各项数据按照一定顺序排列;为了预报时长的增加,本发明按照1小时,3小时、6小时和12小时四种间隔从原始样本中抽取四种样本进行计算,每种样本预报未来8个步长(一个步长和样本间隔时间相等)的数据,不同样本和预报步长的组合,可以得到不同的训练样本尺寸大小和预报提前时长,详细情况可见下表。The sea waves in the Pearl River Estuary [112°E-117°E, 18°N-23°N] area are predicted by designing experiments. The training set data uses the reanalysis data set of wind and waves from 2012 to 2021 in ERA5 for 11 years. And analyze the feasibility, advantages and disadvantages of this method. This calculation example selects the radial and latitudinal components of the wind for a total of 87,672 hours in 11 years from 2012 to 2021, as well as the effective wave height SWH, average wave period MWP, and average wave direction MWD as the input variables of the training data set. Arranged in a certain order; in order to increase the forecast duration, the present invention extracts four kinds of samples from the original samples according to 1 hour, four intervals of 3 hours, 6 hours and 12 hours for calculation, and each kind of sample predicts 8 steps in the future (one Step size and sample interval time are equal), the combination of different samples and forecast step size can get different training sample size and forecast lead time, details can be seen in the table below.
表各时间间隔训练数据、预报提前时长和相应训练数据数组大小Table training data for each time interval, forecast lead time and corresponding training data array size
各种组合的训练模型的精度如下表:The accuracy of the training model for various combinations is as follows:
Table B.2:MAE for SWHTable B.2: MAE for SWH
TableB.3:MAPE for SWHTable B.3: MAPE for SWH
TableB.5:MAE for MWPTable B.5: MAE for MWP
TableB.6:MAPE for MWPTable B.6: MAPE for MWP
Table B.8:MAE for MWDTable B.8: MAE for MWD
Table B.9:MAPE for MWDTable B.9: MAPE for MWD
利用训练后的模型,输入历史数据和未来几天风的参数,就能用于预报未来的波高,波周期和波向。Using the trained model, input historical data and wind parameters in the next few days, it can be used to predict future wave height, wave period and wave direction.
台风期间波要素预报结果分析:Analysis of forecast results of wave elements during typhoons:
提取ERA52001-2011年间74场台风期间的风和波浪数据,利用前面训练好的模型进行预测,两种模型预测结果精度统计如图9所示,预报结果和实际的风和波浪数据基本保持一致。The wind and wave data during 74 typhoons during ERA52001-2011 were extracted, and the pre-trained models were used to make predictions. The accuracy statistics of the prediction results of the two models are shown in Figure 9, and the forecast results are basically consistent with the actual wind and wave data.
实施例2:Example 2:
参阅图3所示,本实施例提供的基于机器学习的波浪特征集合预报装置包括处理器101、存储器102以及存储在该存储器102中并可在所述处理器101上运行的计算机程序103,例如基于机器学习的波浪特征集合预报程序。该处理器101执行所述计算机程序103时实现上述实施例1步骤,例如图1所示的步骤。Referring to Fig. 3, the machine learning-based wave feature collection forecasting device provided in this embodiment includes a
示例性的,所述计算机程序103可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器102中,并由所述处理器101执行,以完成本发明。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述所述计算机程序103在所述基于机器学习的波浪特征集合预报装置中的执行过程。Exemplarily, the
所述基于机器学习的波浪特征集合预报装置可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。所述基于机器学习的波浪特征集合预报装置可包括,但不仅限于,处理器101、存储器102。本领域技术人员可以理解,图5仅仅是基于机器学习的波浪特征集合预报装置的示例,并不构成基于机器学习的波浪特征集合预报装置的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述基于机器学习的波浪特征集合预报装置还可以包括输入输出设备、网络接入设备、总线等。The machine learning-based wave feature set forecasting device can be computing devices such as desktop computers, notebooks, palmtop computers, and cloud servers. The apparatus for predicting wave feature sets based on machine learning may include, but is not limited to, a
所称处理器101可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(FieldProgrammable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The so-called
所述存储器102可以是所述基于机器学习的波浪特征集合预报装置的内部存储元,例如基于机器学习的波浪特征集合预报装置的硬盘或内存。所述存储器102也可以是所述基于机器学习的波浪特征集合预报装置的外部存储设备,例如所述基于机器学习的波浪特征集合预报装置上配备的插接式硬盘,智能存储卡(SmartMedia Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器102还可以既包括所述基于机器学习的波浪特征集合预报装置的内部存储单元也包括外部存储设备。所述存储器102用于存储所述计算机程序以及所述基于机器学习的波浪特征集合预报装置所需的其他程序和数据。所述存储器102还可以用于暂时地存储已经输出或者将要输出的数据。The
实施例3:Example 3:
本实施例提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现实施例1所述方法的步骤。This embodiment provides a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the method described in
所示计算机可读介质可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理再以电子方式获得所述程序,然后将其存储在计算机存储器中。The illustrated computer readable medium can be any means that can contain, store, communicate, propagate or transport the program for use by or in connection with an instruction execution system, apparatus or device. More specific examples (non-exhaustive list) of computer-readable media include the following: electrical connection with one or more wires (electronic device), portable computer disk case (magnetic device), random access memory (RAM), Read Only Memory (ROM), Erasable and Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer-readable medium may even be paper or other suitable medium on which the program can be printed, for example by optically scanning the paper or other medium, followed by editing, interpretation or other suitable means if necessary The process then obtains the program electronically and stores it in the computer memory.
上述实施例只是为了说明本发明的技术构思及特点,其目的是在于让本领域内的普通技术人员能够了解本发明的内容并据以实施,并不能以此限制本发明的保护范围。凡是根据本发明内容的实质所做出的等效的变化或修饰,都应涵盖在本发明的保护范围内。The above-mentioned embodiments are only to illustrate the technical concept and characteristics of the present invention, and its purpose is to enable those of ordinary skill in the art to understand the content of the present invention and implement it accordingly, and cannot limit the protection scope of the present invention. All equivalent changes or modifications made according to the essence of the content of the present invention shall fall within the protection scope of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310312853.8A CN116340868A (en) | 2023-03-27 | 2023-03-27 | A method and device for ensemble forecasting of wave features based on machine learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310312853.8A CN116340868A (en) | 2023-03-27 | 2023-03-27 | A method and device for ensemble forecasting of wave features based on machine learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116340868A true CN116340868A (en) | 2023-06-27 |
Family
ID=86887374
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310312853.8A Pending CN116340868A (en) | 2023-03-27 | 2023-03-27 | A method and device for ensemble forecasting of wave features based on machine learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116340868A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117909713A (en) * | 2024-01-03 | 2024-04-19 | 中国人民解放军国防科技大学 | Ocean wave data feature selection method, device and equipment based on imbalanced learning |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020209704A1 (en) * | 2019-04-10 | 2020-10-15 | 임효혁 | Method for generating high-resolution marine data, and device using same |
CN115169439A (en) * | 2022-06-16 | 2022-10-11 | 中国人民解放军国防科技大学 | A method and system for effective wave height prediction based on sequence-to-sequence network |
CN115222163A (en) * | 2022-09-20 | 2022-10-21 | 中国海洋大学 | Medium- and long-term real-time forecasting method, system and application of multi-element wave at the entrance of harbor basin |
-
2023
- 2023-03-27 CN CN202310312853.8A patent/CN116340868A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020209704A1 (en) * | 2019-04-10 | 2020-10-15 | 임효혁 | Method for generating high-resolution marine data, and device using same |
CN115169439A (en) * | 2022-06-16 | 2022-10-11 | 中国人民解放军国防科技大学 | A method and system for effective wave height prediction based on sequence-to-sequence network |
CN115222163A (en) * | 2022-09-20 | 2022-10-21 | 中国海洋大学 | Medium- and long-term real-time forecasting method, system and application of multi-element wave at the entrance of harbor basin |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117909713A (en) * | 2024-01-03 | 2024-04-19 | 中国人民解放军国防科技大学 | Ocean wave data feature selection method, device and equipment based on imbalanced learning |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Mukherjee et al. | Stock market prediction using deep learning algorithms | |
Hatami et al. | Classification of time-series images using deep convolutional neural networks | |
Reddy et al. | A deep neural networks based model for uninterrupted marine environment monitoring | |
WO2023284142A1 (en) | Signal processing method for neuron in spiking neural network and method for training said network | |
Ren et al. | Deep attention-based neural networks for explainable heart sound classification | |
Guo et al. | A stock market forecasting model combining two-directional two-dimensional principal component analysis and radial basis function neural network | |
Boutros et al. | Quantface: Towards lightweight face recognition by synthetic data low-bit quantization | |
Hu et al. | The MBPEP: a deep ensemble pruning algorithm providing high quality uncertainty prediction | |
Li et al. | Evolving deep gated recurrent unit using improved marine predator algorithm for profit prediction based on financial accounting information system | |
WO2021042857A1 (en) | Processing method and processing apparatus for image segmentation model | |
Hussain et al. | A dynamic neural network architecture with immunology inspired optimization for weather data forecasting | |
CN113609773B (en) | Method and system for predicting performance based on data reliability evaluation result of small sample | |
Faraz et al. | Multi-step-ahead stock market prediction based on least squares generative adversarial network | |
Jiang et al. | An intelligent recommendation approach for online advertising based on hybrid deep neural network and parallel computing | |
CN116340868A (en) | A method and device for ensemble forecasting of wave features based on machine learning | |
CN116681159A (en) | Short-term power load prediction method based on whale optimization algorithm and DRESN | |
CN108763418A (en) | A kind of sorting technique and device of text | |
Resmi et al. | Analogy-based approaches to improve software project effort estimation accuracy | |
Zeebaree et al. | The prediction process based on deep recurrent neural networks: a review | |
Nikoo et al. | Wave height prediction using artificial immune recognition systems (AIRS) and some other data mining techniques | |
Hou et al. | Adaptive learning cost‐sensitive convolutional neural network | |
CN117056694A (en) | Training method of error predictor, transaction data prediction method and device | |
CN111860556A (en) | A model processing method, device and storage medium | |
Kim et al. | COVID-19 outbreak prediction using Seq2Seq+ Attention and Word2Vec keyword time series data | |
CN116186388A (en) | User's interest recommendation method, electronic device and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |