CN109685249A - Air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network - Google Patents

Air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network Download PDF

Info

Publication number
CN109685249A
CN109685249A CN201811411059.4A CN201811411059A CN109685249A CN 109685249 A CN109685249 A CN 109685249A CN 201811411059 A CN201811411059 A CN 201811411059A CN 109685249 A CN109685249 A CN 109685249A
Authority
CN
China
Prior art keywords
data
prediction
model
bilstm
autoencoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811411059.4A
Other languages
Chinese (zh)
Inventor
张波
张旱文
李美子
赵勤
秦东明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Normal University
University of Shanghai for Science and Technology
Original Assignee
Shanghai Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Normal University filed Critical Shanghai Normal University
Priority to CN201811411059.4A priority Critical patent/CN109685249A/en
Publication of CN109685249A publication Critical patent/CN109685249A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N15/00Investigating characteristics of particles; Investigating permeability, pore-volume, or surface-area of porous materials
    • G01N15/06Investigating concentration of particle suspensions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Chemical & Material Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Primary Health Care (AREA)
  • Artificial Intelligence (AREA)
  • Pathology (AREA)
  • Immunology (AREA)
  • Biochemistry (AREA)
  • Analytical Chemistry (AREA)
  • Dispersion Chemistry (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Educational Administration (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network that the present invention relates to a kind of, it include: step S1: according to the data of environmental monitoring PM2.5 pollutant concentration and meteorological factor, take PM2.5 as the target contaminant of prediction, constructs the model of target cities PM2.5 concentration prediction;Step S2: selecting trained and test data from environmental monitoring data, completes initialization and training to prediction model;Step S3: air PM2.5 concentration is predicted using the model that training is completed.Compared with prior art, the present invention is able to carry out profound analysis to the feature of contamination data, so as to extract profound connection between data, effectively utilizes environment big data, realizes the promotion of environmental management level.

Description

Air PM2.5 concentration based on AutoEncoder and BiLSTM fused neural network is pre- Survey method
Technical field
The present invention relates to a kind of urban air pollution object concentration prediction methods, are based on more particularly, to one kind The air PM2.5 concentration prediction method of AutoEncoder and BiLSTM fused neural network.
Background technique
Nowadays, many developing countries still suffer from air pollution problems inherent.For example, in 2016, the north is big in China The air quality of urban is below the number of days that national health air quality standard is more than 40%.Air pollution will lead to many Serious health problem, such as respiratory disease, cardiovascular disease and decline in pulmonary function.Therefore, control air pollution causes public affairs Many extensive concerns.Environmental Protection Agency (EPA) is by particulate matter (PM), ground level ozone (O3), carbon monoxide (CO), titanium dioxide Sulphur (SO2) and nitrogen oxides (NOx) are classified as the key pollutants in National Ambient Air Quality Standards.In these pollutants, Particulate matter (PM) is most dangerous, because it is maximum to the negative effect of publilc health and environment, is sent out according to the World Health Organization The information of cloth, PM2.5 (aerodynamic diameter is less than 2.5um) even increase mortality risk.Therefore, quasi- under existing situation The concentration of true prediction pollutant PM2.5 has for providing timely, complete environmental quality information and then protecting public health It is of great importance.Accurate Prediction pollutant concentration needs to make full use of the mass data under current environment, to make more accurate Prediction.
Currently, many researchers study the prediction of pollutant concentration both at home and abroad, wherein most is used Conventional model prediction technique is not bound with deep learning method.The CMAQ such as issued by U.S.EPA traditional prediction model, The pollution level that air is evaluated by the simulation to the concentration of various pollutants matter in air, predicts following air regime. Zhang Kaimei etc. by predicting with statistical analysis and meteorological statistical method the PM2.5 in Nanchang, and establish summer and The statistics specialty of non-summer PM2.5 mass concentration and meteorological element.Giorgio Corani et al. devises multi-tag point The Air Pollution Forecasting model of class, the prediction using bayesian algorithm to multiple pollutant concentrations.Kuremoto et al. using by It limits the depth network that Boltzmann machine is constituted and carries out time series forecasting, introduce time series, there is precision of prediction compared with conventional model It is improved.BT Ong et al. carries out the prediction of pollutant concentration using Zhan Shi from coding and RNN model, and Zhan Shi encodes work certainly For the bottom of prediction model, the input as high level RNN network is exported, makes network with the time using the method for dynamic training Sequence gradually adjusts weight, is that model can reach an ideal state, has compared with other conventional machines learning models Preferable result.
The above conventional model prediction technique and machine learning prediction technique, all in air pollutant concentration prediction work In have the advantages that it is respective.This quadrat method does not all carry out the analysis of deeper to data, from without that can extract data depth The connection of level.On the other hand, there is also correlations with other weather variables for pollutant concentration, and based on traditional prediction side Method is usually to be predicted according to historical data and experience, is not able to verify that influence of other meteorological datas to pollutant concentration.
Summary of the invention
It is an object of the present invention to overcome the above-mentioned drawbacks of the prior art and provide one kind to be based on The air PM2.5 concentration prediction method of AutoEncoder and BiLSTM fused neural network.
The purpose of the present invention can be achieved through the following technical solutions:
A kind of air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network, comprising:
Step S1: taking PM2.5 as the mesh of prediction according to the data of environmental monitoring PM2.5 pollutant concentration and meteorological factor Pollutant is marked, the model of target cities PM2.5 concentration prediction is constructed;
Step S2: selecting trained and test data from environmental monitoring data, completes initialization and instruction to prediction model Practice;
Step S3: air PM2.5 concentration is predicted using the model that training is completed.
The model is using AutoEnocder as bottom, using BiLSTM as upper layer, the lower compression and extraction input data Feature, and with extraction time sequence signature and final prediction result is generated for result as the input on upper layer.
The step S2 is specifically included:
Step S21: the data for modeling are normalized, wherein the data for modeling include original Parent pollutant data and original meteorological data;
Step S22: by data set according to 80%, 10%, 10% ratio cut partition training set, verifying collection and test set;
Step S23: training set data training pattern is utilized;
Step S24: the generalization ability of verifying collection and the resulting prediction model of test set data verification training is utilized.
In the step S21, specifically it is normalized using min-max standardized method:
Wherein: NewVariable be data normalization treated value, max (variable) be data maximum value, Min (variable) is the minimum value of data, and variable is the original value of data.
The step S23 is specifically included:
Step S231: initial data is converted to two-dimensional matrix:
Wherein: X is the two-dimensional matrix converted by initial data;
Step S232: reconfiguring two-dimensional matrix, the data splitting needed:
X '=X* θ (β12,…,βn)
Wherein: X ' is the data after being reconfigured based on two-dimensional matrix X, θ is the two-dimensional matrix being made of 0 and 1, βiIt is 0 Vector or 1 vector;
Step S233: data characteristics is compressed and extracted to data splitting, and reconstructs data splitting:
X "=σ ' (W ' Y+b ')
Y=σ (WX '+b)
Wherein: σ is relu activation primitive, and W is weight matrix, and b is bias vector;X " (i, j) is the number of combinations after reconstruct According to σ ' is relu activation primitive, and W ' is the weight matrix of decoding stage, and Y is after encoding as a result, b ' is decoding stage Bias vector.
Step S234: being trained using AutoEncoder of the data splitting after weight to model, is set until error is less than Determine threshold value, executes step S235;
Step S235: entire model is trained.
Reconstructed error are as follows:
∈ (X ', X ")=| | X '-σ ' (W ' (σ (WX '+b))+b ') | |2
Wherein: ∈ (X ', X ") is reconstructed error, | | | | it is norm.
Step S24 is specifically included:
Step S241: the prediction result that test set data input model is obtained;
Step S242: related coefficient is calculated:
Wherein: ρ (O, P) is related coefficient, and Cov (O, P) is the covariance of observation and predicted value, and D (O) is observation Variance, D (P) are the variance of predicted value;
Step S243: root-mean-square error is calculated:
Wherein: RMSE is root-mean-square error, OiFor the value of truthful data, PiFor the value of model prediction, i is time serial number, n For the total duration of prediction.
Compared with prior art, the invention has the following advantages:
1) in view of the influence that other meteorological variables predict pollutant concentration, and determine any meteorological variables to pollution Object concentration influences maximum.
2) prediction work of pollutant the experience and historical experience that a large amount of historical datas sum up are not depended only on into The changing rule of pollutant is concluded, so as to fully consider atmospheric environment this problem complicated and changeable.
3) it is able to carry out profound analysis, to the feature of contamination data so as to extract profound connection between data System effectively utilizes environment big data, realizes the promotion of environmental management level.
4) the more traditional prediction technique of the accuracy predicted is high, under same operating time and operating condition, can produce Raw better result.
Detailed description of the invention
Fig. 1 is the key step flow diagram of the method for the present invention;
Fig. 2 is the flow diagram in the embodiment of the present invention;
Fig. 3 is the structural schematic diagram of prediction model constructed by the present invention.
Specific embodiment
The present invention is described in detail with specific embodiment below in conjunction with the accompanying drawings.The present embodiment is with technical solution of the present invention Premised on implemented, the detailed implementation method and specific operation process are given, but protection scope of the present invention is not limited to Following embodiments.
The present invention is first defined air pollutant concentration prediction:
Define the prediction of 1 air pollutant concentration: right mainly by Historical Pollution object and relevant weather data information Concentration of the PM2.5 air pollutants in the following certain time is predicted, is environmental science, Meteorological Science, computer science Deng all in one of project of primary study, thus there is certain subject crossing.
Define 2 Classical forecast methods: the air pollutant concentration prediction technique of non-deep learning is referred to as traditional prediction side Method, the prediction of the empirical model such as based on historical data and statistical method;Based on statistics and mathematical method or model foundation The prediction of probabilistic model;Utilize the prediction of integrated approach;And the prediction model etc. established based on conventional machines study, it belongs to Classical forecast method.
In the following, providing pre- using the prediction model of the invention based on AutoEncoder and BiLSTM fused neural network The method for surveying PM2.5 concentration, as depicted in figs. 1 and 2, comprising:
Step S1: taking PM2.5 as the mesh of prediction according to the data of environmental monitoring PM2.5 pollutant concentration and meteorological factor Pollutant is marked, the model of target cities PM2.5 concentration prediction is constructed;The model is using AutoEnocder as bottom, with BiLSTM For upper layer, the lower compression and extraction input data feature, and it is special with extraction time sequence using result as the input on upper layer It levies and generates final prediction result.
In Fig. 2, prior to the training of entire model, loss function uses MSE, and passes through anti-for the training of AutoEncoder Error propagation is carried out to propagation algorithm and is connected to the network the update of weight;Entire model is added in trained AutoEncoder It is trained.The two-dimensional matrix of input carries out feature extraction and compression by the Encoder of AutoEncoder, and by its result It is input in BiLSTM, carries out the extraction of time series feature, and export final prediction result.Wherein autoencoder network is sharp Function living uses relu, and loss function uses mse, and the activation primitive of BiLSTM uses tanh, and loss function uses mse, uses Adam optimization algorithm updates neural network weight.
Step S2: selecting trained and test data from environmental monitoring data, completes initialization and instruction to prediction model Practice, specifically include:
Step S21: since pollutant is different with the unit of meteorological data, place is normalized in the data for being accordingly used in modeling Reason improves the training speed and precision of prediction of model, chooses min-max standardized method in the present invention, i.e., specifically use min- Max standardized method is normalized:
Wherein: NewVariable be data normalization treated value, max (variable) be data maximum value, Min (variable) is the minimum value of data, and variable is the original value of data.Initial data mean value and standard deviation are given, It is indicated respectively with μ and σ, treated data all meet μ=0, the standardized normal distribution of σ=1.
Step S22: by data set according to 80%, 10%, 10% ratio cut partition training set, verifying collection and test set;
The rationally error threshold of setting model, value range is between 0.001-0.0001, and learning rate is in 0.01-0.3 Between value, maximum number of iterations be 1000 times, the self-loopa coefficient of BiLSTM takes 0.006, λ that 1e-6, ζ is taken to take 0.9.For AutoEncoder network, coding layer and decoding layer take bilayer, and wherein activation primitive uses relu, and loss function uses MSE, optimizer use gradient descent algorithm (sgd), and BiLSTM network is single layer, neuronal quantity 50.
Step S23: training set data training pattern is utilized;
Firstly, it is necessary to convert two-dimensional matrix for the pollutant data and meteorological data of the training set of input, matrix it is every One behavior contaminant information and various meteorological data information, it is each to be classified as a certain specific contaminant information or meteorological data Information,
Firstly, defining loss function:
1. the loss function for defining its training stage is as follows for AutoEncoder network:
Wherein, OiIt is the true value of target contaminant, PiIt is the predicted value of target contaminant, i is time serial number, and N is prediction Total duration.MSE is smaller, it was demonstrated that the accuracy of prediction is higher.
2. the loss function for defining its training stage is as follows for entire model:
Function setup is root-mean-square error.In formula, OiIt is the true value of target contaminant, PiIt is the prediction of target contaminant Value, i are time serial number, and N is the total duration of prediction.Loss function is smaller, it was demonstrated that the accuracy of prediction is higher.
Then training process specifically includes:
Step S231:
Wherein: X is the two-dimensional matrix converted by initial data;
Step S232: reconfiguring two-dimensional matrix, the data splitting needed:
X '=X* θ (β12,…,βn) (5)
Wherein: X ' is the data after being reconfigured based on two-dimensional matrix X, and θ is the two-dimensional matrix being made of 0 and 1, βiIt is 0 Vector or 1 vector;
Step S233: data characteristics is compressed and extracted to data splitting, and reconstructs data splitting:
X "=σ ' (W ' Y+b ') (6)
Y=σ (WX '+b) (7)
Wherein: σ is relu activation primitive, and W is weight matrix, and b is bias vector;X " (i, j) is the number of combinations after reconstruct According to σ ' is relu activation primitive, and W ' is the weight matrix of decoding stage, and Y is after encoding as a result, b ' is the inclined of decoding stage Set vector.
Step S234: being trained using AutoEncoder of the data splitting after weight to model, is set until error is less than Determine threshold value, executes step S235;
In order to which training is from encoding model, need to minimize reconstructed error, reconstructed error calculation formula is as follows:
∈ (X ', X ")=| | X '-σ ' (W ' (σ (WX '+b))+b ') | |2 (8)
Wherein: ∈ (X ', X ") is reconstructed error, | | | | it is norm.
It mainly include the following factor { PM2.5 concentration, temperature, wind speed, dew point temperature in the two-dimensional matrix of this stage input Degree, snowfall, precipitation, other meteorological variables }, the two-dimensional matrix of input is compressed, real data characteristics, net are obtained Input value accurately can be translated into pollutant concentration situation by network, establish the mapping for being input to output.With formula (7) weighing apparatus Amount is decoded the data characteristics after compression from coding from the accuracy of coding, make according to the obtained data of decoding with not The data of compression are compared, constantly training, keep error smaller and smaller, after network meets expected expectation, stop the first rank The training of section, into the training of second stage.
Step S235: entire model is trained.Two-dimentional input matrix is after AutoEncoder compression and feature extraction The one-dimensional vector with timing for highly concentratedization being converted into has time series pre- as BiLSTM layers of input, model The function of survey, input of the d hours values as entire model before t moment, the target of prediction is N hours after t moment The concentration value (d be all the time window set with N) of PM2.5.X is enabled to indicate input, t indicates dynamic time series, U, W table Showing that weight matrix, h indicate hidden layer information, b indicates biasing, indicates training process unidirectional in BiLSTM with following formula, Opposite direction process is similar with one-way process:
Certain data informations of the past PM2.5 of A.BiLSTM selective amnesia first and other meteorological variables,
ft=σ (Ufht-1+Wfxt+bf) (9)
B. it determines to store new information in location mode, which comes from two parts, the sigmoid layer of " input threshold " Determining the information updated, tanh layers create new candidate value vector,
it=σ (Uiht-1+Wixt+bi) (10)
C. the update of oldState is carried out,
D. final decision output information, the PM2.5 concentration as predicted,
ot=σ (Uoht-1+Woxt+bo) (12)
The predicted value of BiLSTM output exports final result by softmax.Adam optimization algorithm is used in entire model, Network weight is updated by the continuous iteration of training data, while in order to exclude neural metwork training when is also easy to produce overfitting problem It influences, the present invention is controlled using dropout the relevant technologies, avoids overfitting problem, training process continues to model Performance meets expectation.After model training, each connection weight and parameter also determine therewith.
Step S24: specific to wrap using the generalization ability of verifying collection and the resulting prediction model of test set data verification training It includes:
Step S241: the prediction result that test set data input model is obtained;
Step S242: related coefficient is calculated:
Wherein: ρ (O, P) is related coefficient, and Cov (O, P) is the covariance of observation and predicted value, and D (O) is observation Variance, D (P) are the variance of predicted value;
Step S243: root-mean-square error is calculated:
Wherein: RMSE is root-mean-square error, OiFor the value of truthful data, PiFor the value of model prediction, i is time serial number, n For the total duration of prediction.
If calculating the error of resulting true value and observation in preset threshold value, and it is better than traditional prediction method Prediction result under square one then illustrates that model meets and is expected, can be used for predicting the city in the following certain time The concentration of PM2.5.
Performance Evaluation is carried out to the resulting prediction model of training.Through experimental analysis and comparison, compared to other existing sides Method, in the same circumstances, it is higher as a result, and can make full use of the pollution of magnanimity that model of the invention can generate accuracy Object and meteorological data.
Step S3: air PM2.5 concentration is predicted using the model that training is completed.
In conclusion the prediction model based on AutoEncoder and BiLSTM fused neural network constructed by the present invention It is built upon in the research of already present two kinds of deep neural networks, using the characteristics of two kinds of neural networks and advantage, establishes A kind of model that can predict the PM2.5 concentration in the future certain time of target cities.Used loss function is also previous Research present in, and being proved to can weighing result accuracy well.So the present invention is polluted for previous prediction The deficiency of the method for object concentration takes full advantage of already present research achievement, proposes and is merged based on two kinds of deep neural networks Prediction model.The model carries out the extraction and compressed data of important feature to input data using AutoEncoder as bottom Model training efficiency can be improved, export input of the result as high level BiLSTM network, extract the time series of pollutant Feature can fully take into account the temporal associativity of pollutant, can adequately utilize the letter in the past with future using BiLSTM Breath can obtain more having accurate prediction result, thus have substantial application prospect.

Claims (7)

1. a kind of air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network, feature exist In, comprising:
Step S1: according to the data of environmental monitoring PM2.5 pollutant concentration and meteorological factor, with the target dirt that PM2.5 is prediction Object is contaminated, the model of target cities PM2.5 concentration prediction is constructed;
Step S2: selecting trained and test data from environmental monitoring data, completes initialization and training to prediction model;
Step S3: air PM2.5 concentration is predicted using the model that training is completed.
2. a kind of air PM2.5 based on AutoEncoder and BiLSTM fused neural network according to claim 1 is dense Spend prediction technique, which is characterized in that the model is using AutoEnocder as bottom, using BiLSTM as upper layer, the lower compression With extract input data feature, and with extraction time sequence signature and generate final prediction using result as the input on upper layer and tie Fruit.
3. a kind of air PM2.5 based on AutoEncoder and BiLSTM fused neural network according to claim 2 is dense Spend prediction technique, which is characterized in that the step S2 is specifically included:
Step S21: the data for modeling are normalized, wherein the data for modeling include original dirt Contaminate object data and original meteorological data;
Step S22: by data set according to 80%, 10%, 10% ratio cut partition training set, verifying collection and test set;
Step S23: training set data training pattern is utilized;
Step S24: the generalization ability of verifying collection and the resulting prediction model of test set data verification training is utilized.
4. a kind of air PM2.5 based on AutoEncoder and BiLSTM fused neural network according to claim 3 is dense Spend prediction technique, which is characterized in that in the step S21, be specifically normalized using min-max standardized method:
Wherein: NewVariable be data normalization treated value, max (variable) be data maximum value, min It (variable) is the minimum value of data, variable is the original value of data.
5. a kind of air PM2.5 based on AutoEncoder and BiLSTM fused neural network according to claim 3 is dense Spend prediction technique, which is characterized in that the step S23 is specifically included:
Step S231: initial data is converted to two-dimensional matrix:
Wherein: X is the two-dimensional matrix converted by initial data;
Step S232: reconfiguring two-dimensional matrix, the data splitting needed:
X '=X* θ (β12,…,βn)
Wherein: X ' is the data after being reconfigured based on two-dimensional matrix X, and θ is the two-dimensional matrix being made of 0 and 1, βiFor 0 vector or 1 vector;
Step S233: data characteristics is compressed and extracted to data splitting, and reconstructs data splitting:
X "=σ ' (W ' Y+b ')
Y=σ (WX '+b)
Wherein: σ is relu activation primitive, and W is weight matrix, and b is bias vector;X " (i, j) is the data splitting after reconstruct, σ ' For relu activation primitive, W ' is the weight matrix of decoding stage, and Y is after encoding as a result, b ' is being biased towards for decoding stage Amount.
Step S234: being trained using AutoEncoder of the data splitting after weight to model, until error is less than setting threshold Value executes step S235;
Step S235: entire model is trained.
6. a kind of air PM2.5 based on AutoEncoder and BiLSTM fused neural network according to claim 5 is dense Spend prediction technique, which is characterized in that reconstructed error are as follows:
∈ (X ', X ")=| | X '-σ ' (W ' (σ (WX '+b))+b ') | |2
Wherein: ∈ (X ', X ") is reconstructed error, | | | | it is norm.
7. a kind of air PM2.5 based on AutoEncoder and BiLSTM fused neural network according to claim 3 is dense Spend prediction technique, which is characterized in that step S24 is specifically included:
Step S241: the prediction result that test set data input model is obtained;
Step S242: related coefficient is calculated:
Wherein: ρ (O, P) is related coefficient, and Cov (O, P) is the covariance of observation and predicted value, and D (O) is the side of observation Difference, D (P) are the variance of predicted value;
Step S243: root-mean-square error is calculated:
Wherein: RMSE is root-mean-square error, OiFor the value of truthful data, PiFor the value of model prediction, i is time serial number, and n is pre- The total duration of survey.
CN201811411059.4A 2018-11-24 2018-11-24 Air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network Pending CN109685249A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811411059.4A CN109685249A (en) 2018-11-24 2018-11-24 Air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811411059.4A CN109685249A (en) 2018-11-24 2018-11-24 Air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network

Publications (1)

Publication Number Publication Date
CN109685249A true CN109685249A (en) 2019-04-26

Family

ID=66185545

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811411059.4A Pending CN109685249A (en) 2018-11-24 2018-11-24 Air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network

Country Status (1)

Country Link
CN (1) CN109685249A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110082283A (en) * 2019-05-23 2019-08-02 山东科技大学 A kind of Atmospheric particulates SEM image recognition methods and system
CN110147877A (en) * 2019-05-24 2019-08-20 福州大学 PM2.5 prediction technique based on the long network in short-term of convolution
CN110568127A (en) * 2019-09-09 2019-12-13 北京工业大学 air pollutant concentration monitoring method based on time domain weighting
CN111178438A (en) * 2019-12-31 2020-05-19 象辑知源(武汉)科技有限公司 ResNet 101-based weather type identification method
CN113206756A (en) * 2021-04-22 2021-08-03 大连大学 Network flow prediction method based on combined model
CN114330120A (en) * 2021-12-28 2022-04-12 武汉大学 24-hour PM prediction based on deep neural network2.5Method of concentration

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106485353A (en) * 2016-09-30 2017-03-08 中国科学院遥感与数字地球研究所 Air pollutant concentration forecasting procedure and system
CN108009674A (en) * 2017-11-27 2018-05-08 上海师范大学 Air PM2.5 concentration prediction methods based on CNN and LSTM fused neural networks
CN108133295A (en) * 2018-01-11 2018-06-08 安徽优思天成智能科技有限公司 A kind of motor-driven vehicle gas concentration continuous time Forecasting Methodology for target road section

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106485353A (en) * 2016-09-30 2017-03-08 中国科学院遥感与数字地球研究所 Air pollutant concentration forecasting procedure and system
CN108009674A (en) * 2017-11-27 2018-05-08 上海师范大学 Air PM2.5 concentration prediction methods based on CNN and LSTM fused neural networks
CN108133295A (en) * 2018-01-11 2018-06-08 安徽优思天成智能科技有限公司 A kind of motor-driven vehicle gas concentration continuous time Forecasting Methodology for target road section

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110082283A (en) * 2019-05-23 2019-08-02 山东科技大学 A kind of Atmospheric particulates SEM image recognition methods and system
CN110082283B (en) * 2019-05-23 2021-12-14 山东科技大学 Atmospheric particulate SEM image recognition method and system
CN110147877A (en) * 2019-05-24 2019-08-20 福州大学 PM2.5 prediction technique based on the long network in short-term of convolution
CN110568127A (en) * 2019-09-09 2019-12-13 北京工业大学 air pollutant concentration monitoring method based on time domain weighting
CN110568127B (en) * 2019-09-09 2021-07-30 北京工业大学 Air pollutant concentration monitoring method based on time domain weighting
CN111178438A (en) * 2019-12-31 2020-05-19 象辑知源(武汉)科技有限公司 ResNet 101-based weather type identification method
CN113206756A (en) * 2021-04-22 2021-08-03 大连大学 Network flow prediction method based on combined model
CN113206756B (en) * 2021-04-22 2023-10-03 大连大学 Network flow prediction method based on combined model
CN114330120A (en) * 2021-12-28 2022-04-12 武汉大学 24-hour PM prediction based on deep neural network2.5Method of concentration
CN114330120B (en) * 2021-12-28 2024-04-23 武汉大学 24-Hour PM prediction based on deep neural network2.5Concentration method

Similar Documents

Publication Publication Date Title
CN109685249A (en) Air PM2.5 concentration prediction method based on AutoEncoder and BiLSTM fused neural network
CN108009674A (en) Air PM2.5 concentration prediction methods based on CNN and LSTM fused neural networks
Eslamirad et al. Thermal comfort prediction by applying supervised machine learning in green sidewalks of Tehran
CN109142171B (en) Urban PM10 concentration prediction method based on feature expansion and fusing with neural network
CN110782093B (en) PM fusing SSAE deep feature learning and LSTM2.5Hourly concentration prediction method and system
US11874429B2 (en) High-temperature disaster forecast method based on directed graph neural network
CN106611090B (en) A kind of road side air pollutant concentration Forecasting Methodology based on reconstruct deep learning
CN109214592A (en) A kind of Air Quality Forecast method of the deep learning of multi-model fusion
CN112465243B (en) Air quality forecasting method and system
CN103679263A (en) Thunder and lightning approach forecasting method based on particle swarm support vector machine
CN109146161A (en) Merge PM2.5 concentration prediction method of the stack from coding and support vector regression
CN110766222B (en) PM2.5 concentration prediction method based on particle swarm parameter optimization and random forest
Dai et al. A recurrent neural network using historical data to predict time series indoor PM2. 5 concentrations for residential buildings
Abrahart et al. Using pruning algorithms and genetic algorithms to optimise network architectures and forecasting inputs in a neural network rainfall-runoff model
CN108399470B (en) Indoor PM2.5 prediction method based on multi-example genetic neural network
CN106127242A (en) Year of based on integrated study Extreme Precipitation prognoses system and Forecasting Methodology thereof
CN116187210B (en) Time-space multisource offshore area water quality time sequence prediction method of LSTM coupling mechanism model
CN109615147A (en) A kind of following 72 hours air pollution forecasting method for early warning
CN115860286B (en) Air quality prediction method and system based on time sequence gate mechanism
CN114372631A (en) Data-lacking area runoff prediction method based on small sample learning and LSTM
CN110260914A (en) A kind of project security monitoring system realm division methods based on measuring point space-time characteristic
Li et al. Assessing urban micro-climates with vertical and horizontal building morphological cutting deep transfer learning neural networks
Liu et al. Air Quality Index Forecasting via Genetic Algorithm-Based Improved Extreme Learning Machine
CN116384565A (en) Hierarchical atmospheric ozone concentration prediction method based on missing data filling
Putra et al. Prediction of PM2. 5 and PM10 parameters using artificial neural network: a case study in Kemayoran, Jakarta

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190426

RJ01 Rejection of invention patent application after publication