CN115879569A - IoT observation data online learning method and system - Google Patents

IoT observation data online learning method and system Download PDF

Info

Publication number
CN115879569A
CN115879569A CN202310214205.9A CN202310214205A CN115879569A CN 115879569 A CN115879569 A CN 115879569A CN 202310214205 A CN202310214205 A CN 202310214205A CN 115879569 A CN115879569 A CN 115879569A
Authority
CN
China
Prior art keywords
data
online
data stream
difference
observation data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310214205.9A
Other languages
Chinese (zh)
Other versions
CN115879569B (en
Inventor
张兆虔
李响
赵志刚
王春晓
耿丽婷
郭莹
吴晓明
王英龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qilu University of Technology
Shandong Computer Science Center National Super Computing Center in Jinan
Original Assignee
Qilu University of Technology
Shandong Computer Science Center National Super Computing Center in Jinan
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qilu University of Technology, Shandong Computer Science Center National Super Computing Center in Jinan filed Critical Qilu University of Technology
Priority to CN202310214205.9A priority Critical patent/CN115879569B/en
Publication of CN115879569A publication Critical patent/CN115879569A/en
Application granted granted Critical
Publication of CN115879569B publication Critical patent/CN115879569B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides an online learning method and system of IoT observation data, which relate to the technical field of data processing and are characterized in that an online deep learning model is initialized according to acquired initial time sequence observation data; acquiring time sequence observation data generated by a sensor in real time, and forming an input data stream according to the time sequence observation data; processing an input data stream by an online deep learning model to generate a final prediction result; in the process of processing the input data stream, the input data stream is immediately learned, and an online deep learning model is dynamically updated in real time; the instant learning is to learn data distribution based on the mean value and variance of data flow, construct quasi-normal distribution, reconstruct a new sample, realize a variation attention network, and dynamically adjust a model based on distribution difference, reconstruction difference and reasoning difference; the method learns the hidden information among different hidden layers, improves the accuracy of model reasoning, and dynamically adjusts the parameters among the different hidden layers through online learning.

Description

IoT observation data online learning method and system
Technical Field
The invention belongs to the technical field of data processing, and particularly relates to an IoT observation data online learning method and system.
Background
The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.
In recent years, with the high-speed development of the Internet of Things (IoT), various types of sensors of the Internet of Things are becoming more and more popular; correspondingly, time series observation data generated by the sensor of the internet of things are exponentially increased, and real-time intelligent analysis of the data is particularly urgent and necessary in the application fields of intelligent environment monitoring, industrial monitoring, data center monitoring and the like.
With the rapid development of artificial intelligence, in the application field, knowledge needs to be accurately mined from data; deep Learning (DL) has a strong nonlinear modeling capability and the ability to extract abstract features layer by layer, with great success in modeling various data modalities, such as images, natural languages, speech, biological signals, etc.; therefore, a large number of deep learning models are continuously proposed and verified in the data modeling of the internet of things.
It should be noted that the deep learning model is mainly a static and basic parameter set, and after the deep learning model is trained offline, the parameters of the deep learning model are not changed; however, the traditional offline batch training and online reasoning solution mainly aims at internet of things data with small distribution variation, such as license plate recognition tasks; the scheme cannot meet the real-time analysis requirement of the data stream of the Internet of things with high sampling rate and unknown change; most of observation data of the internet of things are non-stationary data, the probability distribution of the observation data changes along with the time, and therefore the problem of concept drift is caused, and the concept drift refers to the fact that the data distribution changes along with the time, and therefore an original predictor or model cannot continuously and accurately predict the trend or attribute of new data. In the field of data stream prediction, concept drift is an important challenge because it affects the accuracy and effectiveness of the model.
In the case of concept drift, it is very important how to obtain relatively stable reasoning performance, for which some studies related to the concept drift problem are continuously discussed and explored; although various artificial intelligence based time series prediction models have been widely studied and applied, these models are unreliable for prediction of internet of things data streams with concept drift and can lead to overall performance degradation; how to effectively learn and infer from non-stationary data streams is still a great challenge, and particularly, how to adjust a model to adapt to concept drift is a problem which is concerned and tried to be solved by many researchers, in recent years, a method combining deep learning has a good effect on solving the problem, but the researches have certain limitations, and particularly, the researches only make limited assumptions on solving some types of concept drift modes; therefore, there is a lack of suitable methods and theories to effectively deal with the concept drift problem without affecting the accuracy and reliability of the final result.
As described above, the deep learning model trained offline cannot adapt well to the data flow with evolution distribution, and even if the model is updated periodically and new knowledge is "injected" into the model, the data between two adjacent updates must still face the problem of performance degradation; the higher the sampling rate of the data of the internet of things is, the more serious the problem of performance reduction when the model updating strategy is adopted is; furthermore, the expressive power of these models is limited by their predefined fixed structure; thus, the model will not be able to learn the extremely complex patterns that future data may have; the above problems in the current method will lead to cumulative bias decisions, and since the flow data analysis follows a one-pass processing scheme, each data point is processed only once without further inspection, such errors may be amplified continuously and cannot be repaired, which will cause chain reaction, and potentially significant adverse effects are generated on the monitoring systems of the internet of things such as smart oceans, smart cities, smart data centers and the like, and even the operation of the whole system is endangered.
For time series prediction, the main methods are roughly divided into two categories, namely statistical-based models and deep learning-based models. The statistical model is interpretable and the prediction of smoothed data is very good, provided that the time series data trends are consistent and do not change; such as ARIMA models and statistical machine learning methods GBDT, GBRT, etc., which require extensive data preprocessing and labor-intensive feature engineering and therefore cannot capture complex patterns in time series.
The emphasis of deep learning-based models in the field of time series data prediction is of increasing interest, for example, RNN-based models, seq2seq and LSNET have been shown to be effective in capturing short-term or long-term patterns; compared with the model, the recently proposed Transformer-based model shows more excellent performance in a long sequence, however, the Transformer-based model still needs to rely on abundant data to optimize specific parameters thereof, and cannot meet the accurate inference and prediction of stream data with concept drift; the method cannot effectively solve the concept drift problem, and cannot solve the cold start problem of the model.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides the method and the system for online learning of the IoT observation data, so that the hidden information between different hidden layers is learned, the inference accuracy of the model is improved, and meanwhile, the parameters between different hidden layers are dynamically adjusted through online learning, the problems of concept drift and cold start are solved, so that the model can continuously and accurately learn and update the data along with the time, thereby better adapting to the continuously changing data characteristics and modes and ensuring that the performance and the accuracy of the model are always kept in the optimal state.
In order to achieve the above object, one or more embodiments of the present invention provide the following technical solutions:
the invention provides an online learning method of IoT observation data in a first aspect;
an online learning method of IoT observation data, comprising:
initializing an online deep learning model according to the obtained initial time sequence observation data;
acquiring time sequence observation data generated by a sensor in real time, and forming an input data stream according to the time sequence observation data;
the online deep learning model processes an input data stream to generate and output a final prediction result;
in the process of processing the input data stream, the input data stream is immediately learned, and an online deep learning model is dynamically updated in real time;
the instant learning is based on the mean value and the variance of the data flow, learning data distribution, constructing quasi-normal distribution, reconstructing a new sample, realizing a variation attention network, and dynamically adjusting the model based on distribution difference, reconstruction difference and reasoning difference.
Further, the online deep learning model includes an encoder, a decoder, and a result fusion module, and takes the data stream as input, and outputs the prediction of the data value at the next time.
Further, the encoder performs two operations:
(1) Extracting hidden information between hidden layers, inputting the hidden information into a result fusion module, and calculating a final predicted value;
(2) Targeting a standard normal distributionCoding hidden information, and outputting mean value satisfying quasi-normal distribution through reparameterization setting
Figure SMS_1
And standard deviation->
Figure SMS_2
Further, the decoder randomly selects samples from the quasi-normal distribution generated by the encoder, and outputs data consistent with the original characteristic structure of the data stream.
Further, the result fusion module performs weighted fusion on the plurality of features extracted by the encoder to obtain a current predicted value;
the weight is used for dynamically adjusting the contribution degree of each hidden layer, and the weight of each hidden layer is calculated in real time after each input sample is processed through the variation attention network.
Further, the variation attention network calculates the weight of each hidden layer according to the relationship between each hidden layer in the encoder and the necessary degree of each substructure.
Further, the loss function of the online deep learning model comprises:
reconstructing loss, measuring the difference between the true value and the reconstructed value;
KL loss, which measures the difference between the quasi-normal distribution and the standard normal distribution;
predicting loss, and measuring the difference between the true value and the current predicted value;
and measuring the difference between the final predicted value and the true value through the weighted fusion of the three losses.
A second aspect of the present invention provides an online learning system of IoT observation data.
An online learning system of IoT observation data comprises an initialization module, an acquisition module, a prediction module and a learning module:
an initialization module configured to: initializing an online deep learning model according to the acquired initial time sequence observation data;
an acquisition module configured to: acquiring time sequence observation data generated by a sensor in real time, and forming an input data stream according to the time sequence observation data;
a prediction module configured to: the online deep learning model processes an input data stream to generate and output a final prediction result;
a learning module configured to: in the process of processing the input data stream, the input data stream is immediately learned, and an online deep learning model is dynamically updated in real time;
the instant learning is based on the mean value and the variance of the data flow, learning data distribution, constructing quasi-normal distribution, reconstructing a new sample, realizing a variation attention network, and dynamically adjusting the model based on distribution difference, reconstruction difference and reasoning difference.
A third aspect of the present invention provides a computer-readable storage medium, on which a program is stored, which when executed by a processor, implements the steps in an online learning method of IoT observation data according to the first aspect of the present invention.
A fourth aspect of the present invention provides an electronic device, including a memory, a processor, and a program stored on the memory and executable on the processor, where the processor implements the steps in the method for online learning of IoT observation data according to the first aspect of the present invention when executing the program.
The above one or more technical solutions have the following beneficial effects:
the invention designs a self-adaptive online deep learning model which is used for learning a time series data stream of an Internet of things sensor in real time; the model can start from a shallow network, has higher convergence rate, has higher generalization capability on the Internet of things observation data with concept drift, can automatically evolve along with the change of observation time sequence data, can flexibly adjust the complexity and the representation capability of the model, effectively solves the problem of concept drift in the Internet of things observation data stream, and can be directly applied to process the data stream in a cold start mode; relevant verification is carried out on the data collected by the reference data set and the monitoring system, and the result shows that the real-time accuracy of the reasoning result can be obviously improved compared with the traditional method through the online deep learning model.
Advantages of additional aspects of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, are included to provide a further understanding of the invention, and are included to illustrate an exemplary embodiment of the invention and not to limit the invention.
FIG. 1 is a flow chart of the method of the first embodiment.
Fig. 2 is a structural diagram of an online deep learning model of the first embodiment.
Fig. 3 is a schematic diagram of an online learning operation mechanism according to a first embodiment.
Fig. 4 is a schematic diagram illustrating the evolution of weights of the layers of the inference network according to the first embodiment.
Fig. 5 is a system configuration diagram of the second embodiment.
Detailed Description
The invention is further described with reference to the following figures and examples.
In order to solve the concept drift problem without influencing the accuracy and reliability of a final result, a method for quickly and real-timely learning a data stream is needed, the method is generally called as 'online learning' or 'lifelong learning', the online learning is a learning mode of a time sequence data stream, most of the existing schemes have defects in training speed and reasoning speed, real-time performance cannot be really realized, and the effect of online learning is influenced. For deep learning models, its parameters can be simply updated periodically, however, this approach has the following disadvantages:
1. only the adjustment of different parameters among the online multiple tasks is focused on, the importance of the parameters of different hidden layers is ignored, and the model cannot be adjusted flexibly.
2. The complexity of the model cannot be adjusted, namely, the model selection cannot be carried out, and in an online learning environment, a model is needed to learn the data flow from shallow to deep, so that the problems of overfitting when the data volume is small (cold start stage) and underfitting when the data volume is large are solved.
Therefore, the invention designs an online learning method called as 'instant learning', which can learn hidden information between different hidden layers, meet the final reasoning accuracy of the model, simultaneously meet the dynamic adjustment of parameters between different hidden layers, and realize the real-time dynamic adjustment of the model.
The design concept of the invention follows the following existing online learning paradigm:
with respect to the time-series observation data,
Figure SMS_3
generated by a sensor in real time, wherein T represents the number of data,x t the data representing the time at the t-th instant,Xrepresenting stream data with unbounded T. In an online learning environment, this time series is not available all at once, but is processed one after the other in the form of a stream, for each time t only historical data
Figure SMS_4
Is accessible via a sensor, but can be determined by historical data +>
Figure SMS_5
To predict a value of ^ greater or greater at time t>
Figure SMS_6
And then observe and reveal the true valuesx t
Further, calculating the instantaneous loss of the current time t through the residual error between the real value and the predicted value; of course, for a particular application scenario, the time series
Figure SMS_7
The value at the current time t can also be predicted in the form of a window; according to the predicted value>
Figure SMS_8
Calculating the current instantaneous loss with the corresponding real value: />
Figure SMS_9
The algorithm can meet and realize the online training of mass data by training each input data by using the loss and gradient iterative update generated by each input training sample.
The existing online learning framework can only realize data improvement, but the framework provided by the invention can also realize data generation and learn the distribution of potential variables; and learning the mean value and the variance of the quasi-normal distribution by constructing the quasi-normal distribution, further generating a new sample, and carrying out the following inference prediction based on a fusion module.
Example one
In one or more embodiments, a method for online learning of IoT observation data is disclosed, and fig. 1 is a flow chart of the method, as shown in fig. 1, comprising the following steps:
step S1: and initializing an online deep learning model according to the acquired initial time sequence observation data.
Step S2: and acquiring time sequence observation data generated by the sensor in real time, and forming an input data stream according to the time sequence observation data.
And step S3: the online deep learning model processes an input data stream to generate and output a final prediction result.
And step S4: in the process of processing the input data stream, the input data stream is subjected to instant learning, and an online deep learning model is dynamically updated in real time, wherein the instant learning is to learn data distribution based on the mean value and the variance of the data stream, construct quasi-normal distribution, reconstruct a new sample, realize a variation attention network, and dynamically adjust the model based on distribution difference, reconstruction difference and reasoning difference.
The following describes in detail an implementation procedure of the online learning method of IoT observation data according to this embodiment.
FIG. 2 is a block diagram of an online deep learning model, as shown in FIG. 2, which includes an encoder, a decoder, and a result fusion module, with a data stream as an input, outputting a prediction of a data value at a next time.
The encoder is connected with the result fusion module and used for predicting the data value, and the decoder is introduced and matched with the encoder and the result fusion module and used for dynamically updating the online deep learning model.
Encoder for encoding a video signal
Based on deep neural network construction, the method is composed of a plurality of hidden layers, and the maximum network capacity can be k +1 hidden layers in consideration of available computing power
Figure SMS_10
The method carries out multi-scale learning in a data stream mode from sample to sample, and a learned hidden layer has two functions:
(1) Computing prediction vectors and attention weights for hidden layers
In order to fully utilize information in different hidden layers of an encoder, hidden information in each hidden layer is extracted to obtain k +1 prediction vectorsp i Inputting the prediction vector into a result fusion module, and obtaining a final prediction result through weighted fusion, wherein the prediction vectorp i The calculation formula of (2) is as follows:
Figure SMS_11
wherein ,pred()is a feed-forward network that translates the hidden layer representation into local prediction,h i representing the ith hidden layer.
In order to realize dynamic adjustment of the contribution weight of each hidden layer in the final prediction, a variation attention network capable of meeting the above requirements is designed to learn the attention weight of the hidden layer, that is:
Figure SMS_12
wherein ,attention()is a feed-forward network with a Softmax function, which is primarily responsible for computing the weights of each hidden layer, normalized to
Figure SMS_13
The objective is to determine the relationship between each hidden layer in the overall model and how much the substructures are necessary in the final prediction.
In order to realize the iterative update of the model, an important technology of the heavy weight operation is realized based on the above, that is, the real-time dynamic adjustment of the weight is realized, and through the heavy weight operation, the weights of all hidden layers are recalculated and normalized after each input sample processing, which is completed after each iteration.
(2) Constructing a quasi-normal distribution
The encoder attempts to transcode the distribution of all the characteristic information carried by the sample into a quasi-normal distribution, which is referred to herein as a potential distribution, although it is usually not coded into a standard normal distribution (target distribution), although it is coded with the standard normal distribution as a target; in this process, the encoder directly outputs the mean value satisfying the quasi-normal distribution
Figure SMS_14
And standard deviation->
Figure SMS_15
As a result of the encoding, i.e. the mean value generated in the encoding process +>
Figure SMS_16
And standard deviation->
Figure SMS_17
Not calculated by mean or standard deviation definition, but output directly from the encoder.
To ensure mean value of encoder output
Figure SMS_18
And standard deviation>
Figure SMS_19
And (3) satisfying the standard normal distribution, setting a penalty term in the loss function, and once the fed-back quasi normal distribution (actual distribution) based on the mean and the standard deviation has a difference with the standard normal distribution (target distribution), further adjusting the model of the embodiment under the influence of a penalty mechanism.
Therefore, in the actual operation flow of the algorithm, the encoder is responsible for outputting the mean value and the standard deviation, the loss function ensures that the mean value and the standard deviation conform to a certain quasi-normal distribution, which is equivalent to encoding the original data to the quasi-normal distribution direction, outputting the mean value and the standard deviation of the quasi-normal distribution, and randomly sampling to generate the data
Figure SMS_20
Subsequent decoder operations are performed.
At the same time, to guarantee sampling data
Figure SMS_21
Is matched to the input sample, it is first assumed that there is a->
Figure SMS_22
The posterior probability P (Z | X) for X, and further assume that this probability distribution is a normal distribution: />
Figure SMS_23
Then the sample data is pick>
Figure SMS_24
The probability distribution P (Z) of (A) is:
Figure SMS_25
thus, P (Z) is a priori distributed and
Figure SMS_26
the posterior distribution conforms to the standard normal distribution.
In addition, from a standard normal distribution
Figure SMS_27
In a randomly selected sample->
Figure SMS_28
And inputs the samples to a decoder. Since random sampling exists in the process, the gradient information flow in the framework is broken, and back propagation cannot be performed through the traditional chain rule, so that the back propagation of the model of the embodiment is designed to be completed by using a unique re-parameterization skill so as to meet the characteristics of a typical neural network. In reparameterization, this embodiment uses random sampling to generate a set of variables that satisfy a normal distribution, slave->
Figure SMS_29
Middle sample z, equivalent to slave->
Figure SMS_30
The middle of the samples is the epsilon, then make->
Figure SMS_31
Therefore, a gradient descent method may be applied; finally, the decoder decodes z and outputs reconstructed data for sample x.
Thus, in the encoding stage of the encoder, each input sample is calculated
Figure SMS_32
In (d) is based on the mean value>
Figure SMS_33
And variance>
Figure SMS_34
For constructing a quasi-normal distribution.
Decoder
Randomly selecting samples from the quasi-normal distribution generated by the encoder, and outputting data consistent with the original characteristic structure of the data stream, i.e. restoring the original input structure by the decoder to generate new data samples
Figure SMS_35
The data sampled from the potential variable space is expected to >>
Figure SMS_36
Follow the probability distribution of the original data X such that the sampled data->
Figure SMS_37
New data generated
Figure SMS_38
The probability distribution of the original data is also followed, and the larger the latent variable space of the embodiment is, the more information about the probability distribution can be kept relatively.
Result fusion module
The final prediction of the model is based on the prediction vector of each hidden layer
Figure SMS_39
Performing weighted fusion to obtain a final prediction result, namely a prediction value at the next moment, and performing weighted fusion based on a variational attention mechanism, wherein the weighted fusion specifically comprises the following steps:
Figure SMS_40
Figure SMS_41
wherein ,
Figure SMS_42
is predicted from the representation of the i-th hidden layer>
Figure SMS_43
Is the weight of the i-th hidden layer,
Figure SMS_44
is/>
Figure SMS_45
The parameter (c) of (c).
The above is the formation of the online deep learning model, and after the online deep learning model is constructed, the online deep learning model is divided into two flows: real-time prediction and online learning.
Fig. 3 is a schematic diagram of online learning, as shown in fig. 3, in the process of processing samples of a data stream input into a linear deep learning model one by one, characteristics of the samples are continuously learned, and parameters of the model are updated through changes of the parameters, so as to realize dynamic real-time adjustment of the model, as shown in fig. 4, three hidden layers are set as verification findings through the embodiment, weights of the hidden layer-1, the hidden layer-2 and the hidden layer-3 are continuously changed along with the increase of the number of the samples of the data stream, and weights among the layers are also changed at different stages, which can better explain that the embodiment can dynamically and accurately capture changes when dealing with a concept drift problem; in the learning process, the loss and gradient generated by each input sample are updated iteratively, and each input sample is trained to meet and realize the online training of mass data, wherein the loss is quantified by a loss function, and the final total loss is calculated by using three losses: loss of reconstruction, loss of KL, and loss of prediction.
(1) Predicting loss
Unlike traditional neural network models, the prediction of the model is not the final representation from the last layer, and the final prediction vector is obtained by weighting the prediction vector obtained from each hidden layer to ensure absolute high accuracy.
Thus, for the observation of streaming IoT data, the prediction penalty Lt for the current prediction phase time t is:
Figure SMS_46
wherein, y is the true value,
Figure SMS_47
is a predicted value, is greater than or equal to>
Figure SMS_48
Representing the predicted instantaneous loss function at time t.
The prediction loss function here may be any convex function, and the present embodiment uses a mean square loss function (MSE), where in the original backward propagation, the error derivative is backward propagated from the last output layer, unlike the original backward propagation, the error derivative is backward propagated from each predictor pj, such as:
Figure SMS_49
wherein ,
Figure SMS_50
parameters of the j-th layer network at time t and at time t +1, respectively>
Figure SMS_51
Indicates a renewal rate, is greater than or equal to>
Figure SMS_52
Represents a gradient, <' > based on>
Figure SMS_53
Indicates the predictor value of each layer, is>
Figure SMS_54
Indicating a loss at the current time.
Calculating the gradient of each layer of parameters according to the equation, updating the parameters of the jth layer of the embodiment model in time t, and calculating the final prediction according to the gradient of each layer of parameters; it is worth to be noted that when the data sample is small, the model can realize fast convergence by fast learning the shallow network of the hidden information; using a variable attention mechanism may allow dynamic adjustment of different depth networks as data increases
Figure SMS_55
Thus conforming to the scalability of the model capacity.
(2) Loss of KL
KL Loss (Kullback-Leibler Loss) is a measure of the difference between a quasi-normal distribution and a normal distribution, i.e. the mean
Figure SMS_56
And variance->
Figure SMS_57
The difference between them.
Given having a known mean value
Figure SMS_58
And variance->
Figure SMS_59
K, KL loss is defined as:
Figure SMS_60
wherein ,
Figure SMS_61
representing the mean and variance generated for each dimension.
(3) Loss of reconstruction
The reconstruction loss here uses the standard L2 loss, i.e., MSE, assuming that when there are n samples, there are the original value x and the reconstructed value
Figure SMS_62
The reconstruction loss is defined as:
Figure SMS_63
finally, to enhance the expressive power and prediction accuracy of the model, the total loss of this embodiment consists of the above three loss functions, with coefficients of
Figure SMS_64
The method specifically comprises the following steps:
Figure SMS_65
thus, at the end of each round, the algorithm will self-adjust according to the instantaneous loss of the next few rounds, which also solves the problem of model flexibility in the concept drift scenario.
And real-time prediction, namely inputting a data stream into an online deep learning model, obtaining a prediction vector corresponding to each hidden layer through the coding of the hidden layer of a coder, and performing weighted fusion on the prediction vectors to obtain a final prediction result, namely a prediction value at the next moment.
Example two
The embodiment discloses an online learning system of IoT observation data;
as shown in fig. 5, an online learning system of IoT observation data includes an initialization module, an acquisition module, a prediction module, and a learning module:
an initialization module configured to: initializing an online deep learning model according to the obtained initial time sequence observation data;
an acquisition module configured to: acquiring time sequence observation data generated by a sensor in real time, and forming an input data stream according to the time sequence observation data;
a prediction module configured to: the online deep learning model processes an input data stream to generate and output a final prediction result;
a learning module configured to: in the process of processing the input data stream, the input data stream is immediately learned, and an online deep learning model is dynamically updated in real time;
the instantaneous learning is based on the mean value and the variance of data flow, learning data distribution, constructing quasi-normal distribution, reconstructing a new sample, realizing a variation attention network, and dynamically adjusting a model based on distribution difference, reconstruction difference and reasoning difference.
EXAMPLE III
An object of the present embodiment is to provide a computer-readable storage medium.
A computer-readable storage medium, on which a computer program is stored, which when executed by a processor, implements the steps in an online learning method of IoT observation data according to one embodiment of the present disclosure.
Example four
An object of the present embodiment is to provide an electronic device.
An electronic device comprising a memory, a processor and a program stored on the memory and executable on the processor, wherein the processor when executing the program implements the steps of the method for online learning of IoT observation data according to the first embodiment of the present disclosure.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. A method for online learning of IoT observation data, comprising:
initializing an online deep learning model according to the obtained initial time sequence observation data;
acquiring time sequence observation data generated by a sensor in real time, and forming an input data stream according to the time sequence observation data;
the online deep learning model processes an input data stream to generate and output a final prediction result;
in the process of processing the input data stream, the input data stream is immediately learned, and an online deep learning model is dynamically updated in real time;
the instantaneous learning is based on the mean value and the variance of data flow, learning data distribution, constructing quasi-normal distribution, reconstructing a new sample, realizing a variation attention network, and dynamically adjusting a model based on distribution difference, reconstruction difference and reasoning difference.
2. The method of claim 1, wherein the online deep learning model comprises an encoder, a decoder, and a result fusion module, and the data stream is used as input to output a prediction of a next-time data value.
3. The method of online learning of IoT observation data in accordance with claim 2, wherein the encoder performs two part operations:
(1) Extracting hidden information between hidden layers, inputting the hidden information into a result fusion module, and calculating a final predicted value;
(2) Coding hidden information by taking standard normal distribution as a target, and outputting a mean value meeting the quasi-normal distribution through reparameterization setting
Figure QLYQS_1
And standard deviation->
Figure QLYQS_2
4. The method of claim 2, wherein the decoder randomly selects samples from the quasi-normal distribution generated by the encoder and outputs data that is consistent with the original feature structure of the data stream.
5. The method of claim 2, wherein the result fusion module performs weighted fusion on the plurality of features extracted by the encoder to obtain the current predicted value;
the weight is used for dynamically adjusting the contribution degree of each hidden layer, and the weight of each hidden layer is calculated in real time after each input sample is processed through the variation attention network.
6. The method of claim 5, wherein the variational attention network calculates the weight of each hidden layer according to the relationship between each hidden layer in the encoder and the necessity degree of each substructure.
7. The method of online learning of IoT observation data recited in claim 1, wherein the loss function of the online deep learning model comprises:
reconstructing loss, measuring the difference between the true value and the reconstructed value;
KL loss, which measures the difference between the quasi-normal distribution and the standard normal distribution;
predicting loss, and measuring the difference between the true value and the current predicted value;
and measuring the difference between the final predicted value and the true value through the weighted fusion of the three losses.
8. An online learning system of IoT observation data is characterized by comprising an initialization module, an acquisition module, a prediction module and a learning module:
an initialization module configured to: initializing an online deep learning model according to the obtained initial time sequence observation data;
an acquisition module configured to: acquiring time sequence observation data generated by a sensor in real time, and forming an input data stream according to the time sequence observation data;
a prediction module configured to: the online deep learning model processes an input data stream to generate and output a final prediction result;
a learning module configured to: in the process of processing the input data stream, the input data stream is immediately learned, and an online deep learning model is dynamically updated in real time;
the instant learning is based on the mean value and the variance of the data flow, learning data distribution, constructing quasi-normal distribution, reconstructing a new sample, realizing a variation attention network, and dynamically adjusting the model based on distribution difference, reconstruction difference and reasoning difference.
9. An electronic device, comprising:
a memory for non-transitory storage of computer readable instructions; and
a processor for executing the computer readable instructions,
wherein the computer readable instructions, when executed by the processor, perform the method of any of claims 1-7.
10. A storage medium storing non-transitory computer-readable instructions, wherein the non-transitory computer-readable instructions, when executed by a computer, perform the instructions of the method of any one of claims 1-7.
CN202310214205.9A 2023-03-08 2023-03-08 Online learning method and system for IoT observation data Active CN115879569B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310214205.9A CN115879569B (en) 2023-03-08 2023-03-08 Online learning method and system for IoT observation data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310214205.9A CN115879569B (en) 2023-03-08 2023-03-08 Online learning method and system for IoT observation data

Publications (2)

Publication Number Publication Date
CN115879569A true CN115879569A (en) 2023-03-31
CN115879569B CN115879569B (en) 2023-05-23

Family

ID=85762015

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310214205.9A Active CN115879569B (en) 2023-03-08 2023-03-08 Online learning method and system for IoT observation data

Country Status (1)

Country Link
CN (1) CN115879569B (en)

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109102126A (en) * 2018-08-30 2018-12-28 燕山大学 One kind being based on depth migration learning theory line loss per unit prediction model
US20200160207A1 (en) * 2018-11-15 2020-05-21 General Electric Company Automated model update based on model deterioration
CA3097655A1 (en) * 2019-10-30 2021-04-30 Royal Bank Of Canada System and method for machine learning architecture with differential privacy
CN112836808A (en) * 2021-03-22 2021-05-25 哈尔滨理工大学 Time-space coupling learning method for deep learning-based distributed parameter system modeling
WO2021121354A1 (en) * 2019-12-19 2021-06-24 Beijing Didi Infinity Technology And Development Co., Ltd. Model-based deep reinforcement learning for dynamic pricing in online ride-hailing platform
US20220036201A1 (en) * 2020-08-02 2022-02-03 Actimize Ltd. Real drift detector on partial labeled data in data streams
CN114186668A (en) * 2021-12-09 2022-03-15 山东省计算中心(国家超级计算济南中心) Online deep learning method, system and equipment for Internet of things data flow prediction
CN114676622A (en) * 2022-02-17 2022-06-28 国网冀北电力有限公司承德供电公司 Short-term photovoltaic power prediction method based on self-encoder deep learning model
CN114881331A (en) * 2022-05-09 2022-08-09 西安交通大学 Learner abnormal learning state prediction method facing online education
CN114911831A (en) * 2022-05-12 2022-08-16 大连大学 Agricultural real-time early warning method and system based on Flink streaming calculation and deep learning
CN114970674A (en) * 2022-04-15 2022-08-30 北京大学 Time sequence data concept drift adaptation method based on relevance alignment
CN115293662A (en) * 2022-10-10 2022-11-04 山东省计算中心(国家超级计算济南中心) Parallel and distributed ocean observation data fusion intelligent calculation method and system
CN115731396A (en) * 2022-11-28 2023-03-03 辽宁工程技术大学 Continuous learning method based on Bayesian variation inference

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109102126A (en) * 2018-08-30 2018-12-28 燕山大学 One kind being based on depth migration learning theory line loss per unit prediction model
US20200160207A1 (en) * 2018-11-15 2020-05-21 General Electric Company Automated model update based on model deterioration
CA3097655A1 (en) * 2019-10-30 2021-04-30 Royal Bank Of Canada System and method for machine learning architecture with differential privacy
WO2021121354A1 (en) * 2019-12-19 2021-06-24 Beijing Didi Infinity Technology And Development Co., Ltd. Model-based deep reinforcement learning for dynamic pricing in online ride-hailing platform
US20220036201A1 (en) * 2020-08-02 2022-02-03 Actimize Ltd. Real drift detector on partial labeled data in data streams
CN112836808A (en) * 2021-03-22 2021-05-25 哈尔滨理工大学 Time-space coupling learning method for deep learning-based distributed parameter system modeling
CN114186668A (en) * 2021-12-09 2022-03-15 山东省计算中心(国家超级计算济南中心) Online deep learning method, system and equipment for Internet of things data flow prediction
CN114676622A (en) * 2022-02-17 2022-06-28 国网冀北电力有限公司承德供电公司 Short-term photovoltaic power prediction method based on self-encoder deep learning model
CN114970674A (en) * 2022-04-15 2022-08-30 北京大学 Time sequence data concept drift adaptation method based on relevance alignment
CN114881331A (en) * 2022-05-09 2022-08-09 西安交通大学 Learner abnormal learning state prediction method facing online education
CN114911831A (en) * 2022-05-12 2022-08-16 大连大学 Agricultural real-time early warning method and system based on Flink streaming calculation and deep learning
CN115293662A (en) * 2022-10-10 2022-11-04 山东省计算中心(国家超级计算济南中心) Parallel and distributed ocean observation data fusion intelligent calculation method and system
CN115731396A (en) * 2022-11-28 2023-03-03 辽宁工程技术大学 Continuous learning method based on Bayesian variation inference

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
潘志松;唐斯琪;邱俊洋;胡谷雨;: "在线学习算法综述", 数据采集与处理 *

Also Published As

Publication number Publication date
CN115879569B (en) 2023-05-23

Similar Documents

Publication Publication Date Title
Yu et al. Probabilistic prediction of regional wind power based on spatiotemporal quantile regression
CN109165664B (en) Attribute-missing data set completion and prediction method based on generation of countermeasure network
Sehovac et al. Forecasting building energy consumption with deep learning: A sequence to sequence approach
Ma et al. A hybrid attention-based deep learning approach for wind power prediction
CN108900346B (en) Wireless network flow prediction method based on LSTM network
CN112364975B (en) Terminal running state prediction method and system based on graph neural network
CN113177633B (en) Depth decoupling time sequence prediction method
CN114529051A (en) Long-term power load prediction method based on hierarchical residual self-attention neural network
CN114493014A (en) Multivariate time series prediction method, multivariate time series prediction system, computer product and storage medium
CN114021836B (en) Multi-variable reservoir water inflow prediction system based on different angle fusion, training method and application
CN114817773A (en) Time sequence prediction system and method based on multi-stage decomposition and fusion
CN116432697A (en) Time sequence prediction method integrating long-term memory network and attention mechanism
Basterrech et al. Evolutionary Echo State Network: A neuroevolutionary framework for time series prediction
WO2023179609A1 (en) Data processing method and apparatus
CN117424208A (en) Probability photovoltaic power generation prediction method for comprehensive missing data
CN115713044B (en) Method and device for analyzing residual life of electromechanical equipment under multi-condition switching
CN115879569A (en) IoT observation data online learning method and system
CN116485025A (en) Dual-migration prediction model generation method and waste household appliance recovery amount prediction method
CN112667394B (en) Computer resource utilization rate optimization method
CN114298411A (en) GDP prediction method based on N-BEATS
CN114239934A (en) Power grid load sequence prediction method and system based on generative countermeasure network
Li et al. Mutual information variational autoencoders and its application to feature extraction of multivariate time series
Zhan et al. Learning Latent ODEs with Graph RNN for Multi-Channel Time Series Forecasting
CN110648183A (en) Grey correlation and QGNN-based resident consumption price index prediction method
Ma et al. A novel spatial–temporal generative autoencoder for wind speed uncertainty forecasting

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant