CN111191742A - Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream - Google Patents

Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream Download PDF

Info

Publication number
CN111191742A
CN111191742A CN202010087229.9A CN202010087229A CN111191742A CN 111191742 A CN111191742 A CN 111191742A CN 202010087229 A CN202010087229 A CN 202010087229A CN 111191742 A CN111191742 A CN 111191742A
Authority
CN
China
Prior art keywords
data
free energy
input
matrix
hidden layer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010087229.9A
Other languages
Chinese (zh)
Inventor
王为
张梦君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Tianjin Normal University
Original Assignee
Tianjin Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Normal University filed Critical Tianjin Normal University
Priority to CN202010087229.9A priority Critical patent/CN111191742A/en
Publication of CN111191742A publication Critical patent/CN111191742A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The invention discloses a sliding window length self-adaptive adjusting method for multi-source heterogeneous data streams. Firstly, an expansion model of a Gaussian input limited Boltzmann machine under matrix input and multi-modal input is provided to realize feature extraction and feature fusion of heterogeneous data streams. And secondly, judging the probability distribution change of the data stream through a free energy function generated by the limited Boltzmann machine and an expansion model thereof. Finally, Hough's boundary is adopted to ensure that the data stream change between adjacent windows can be detected in time. The invention measures the change of the data stream by comparing the free energy values of the data stream between adjacent windows, adaptively adjusts the length of the sliding window, and divides the data stream into data blocks with different sizes for batch processing.

Description

Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream
Technical Field
The invention belongs to the technical field of data mining, and particularly relates to a sliding window adaptive adjustment algorithm for multi-source heterogeneous data streams.
Background
With the continuous growth of heterogeneous sensor networks, a large amount of data is continuously generated, the data size is huge, and the heterogeneous sensor networks have various characteristics such as dynamics, diversity and high-dimensional property. Such as: li Na and the like adopt a sliding window technology in a data stream clustering algorithm, and provide an incremental data stream processing method and a data stream clustering algorithm; lughofer proposes an incremental data stream clustering algorithm, and adjusts the clustering number by adopting separation and combination strategies in the process of continuously increasing data stream clustering. The sliding window-based data stream processing technology can relieve the problem of large number of data streams to a certain extent, but a fixed sliding window is often adopted, dynamic change of the data streams is ignored, and direct influence on analysis of the data streams is generated due to the change of the data streams.
When the input data is isomorphic data, the above method can adjust the sliding window by analyzing the changing characteristics of the data stream. However, most of data in the heterogeneous sensor network comes from a plurality of data sources, the data types thereof are various, and different data types contain different features, for example, text data contains a large number of discrete character vectors and picture data is composed of pixel features. Therefore, sliding window adjustment under multi-source heterogeneous input data requires fusion analysis of the data first and then adjustment of the change of the sliding window.
To this end, the present invention applies adaptive adjustment of sliding window length to multi-source heterogeneous data streams. The invention adopts a data fusion technology, fuses heterogeneous data streams, analyzes the change condition of the data streams, and introduces a Hough boundary to detect the change of the data streams in time, thereby realizing the self-adaptive adjustment of the length of a sliding window. In addition, window adjustment errors caused by single-mode information loss can be effectively avoided through fusion analysis of all mode information of the data stream.
Disclosure of Invention
The invention aims to detect the change in the multi-source heterogeneous data stream and divide the data stream in real time, and therefore, the invention provides a sliding window length self-adaptive adjustment algorithm for the multi-source heterogeneous data stream.
In order to achieve the purpose, the invention discloses the following technical contents:
a sliding window length self-adaptive adjusting method for multi-source heterogeneous data streams is characterized by comprising the following steps:
step 1, performing feature extraction and feature fusion on heterogeneous data streams through an extended model of a Gauss input limited Boltzmann machine under matrix variables and multi-modal input; wherein each node of the visible layer of the matrix variable limited Boltzmann machine follows Gaussian distribution with variance of
Figure 166204DEST_PATH_IMAGE001
The energy function of which can be expressed as
Figure 473689DEST_PATH_IMAGE002
Figure 567547DEST_PATH_IMAGE003
The representation matrix variables may see the layer input data,
Figure 567864DEST_PATH_IMAGE004
represents the output data of the hidden layer(s),
Figure 114383DEST_PATH_IMAGE005
for the input data after the normalization, the data is processed,
Figure 959979DEST_PATH_IMAGE006
the outer product of (a) may result in a weight matrix between the visible layer and the hidden layer,
Figure 111606DEST_PATH_IMAGE007
in order to be able to see the biasing of the layers,
Figure 17245DEST_PATH_IMAGE008
a bias to a hidden layer; according to the matrix input, the probability of each node of the hidden layer can be calculated
Figure 48130DEST_PATH_IMAGE009
Obtaining an activation output matrix of the hidden layer node according to the probability, wherein the process is a characteristic extraction process; the principle of the multi-mode input matrix variable limited Boltzmann machine is similar to that of the matrix variable limited Boltzmann machine, and hidden layer output can be obtained by two visible layer inputs
Figure 697417DEST_PATH_IMAGE010
Namely, feature fusion is realized, wherein
Figure 765868DEST_PATH_IMAGE011
For the input of the visible layer or layers,
Figure 842408DEST_PATH_IMAGE012
in order to hide the layer output,
Figure 97940DEST_PATH_IMAGE013
the connection weights between the visible layer and the hidden layer are synthesized separately,
Figure 550918DEST_PATH_IMAGE014
an offset for the hidden layer;
step 2, constructing a free energy function of the multi-source heterogeneous data stream through the model obtained by training in the step 1, and measuring the probability distribution change of the data stream according to the free energy change rate between adjacent windows;
the probability distributions of free energy and data flow are related as follows:
Figure 739454DEST_PATH_IMAGE015
wherein
Figure 986896DEST_PATH_IMAGE016
Representing input data
Figure 732653DEST_PATH_IMAGE017
The probability density distribution of (a) is,
Figure 927006DEST_PATH_IMAGE018
the free energy of the input data is represented,
Figure 704469DEST_PATH_IMAGE019
for a distribution function, also called a normalization constant, for a trained restricted boltzmann machine, the value is kept unchanged, so that the free energy of input data can reflect the probability distribution condition of the input data;
and 3, comparing the free energy change rate between adjacent windows with a threshold value determined by a Hough boundary, and adjusting the length of the sliding window according to the result.
The training of the matrix variable limited boltzmann machine under the gaussian input in the step 1 comprises the following steps:
(1) calculating the probability of each node of the hidden layer according to the matrix input
Figure 122812DEST_PATH_IMAGE020
Obtaining an activation output matrix of the hidden layer node according to the probability;
(2) calculating the outer product of the matrix input value and the activation output value, and defining the outer product as a positive gradient;
(3) reconstructing the activation value of the visual layer according to the activation output matrix obtained in the step (1)
Figure 618515DEST_PATH_IMAGE021
And repeating the step (1) to obtain the corresponding activation output of the hidden layer;
(4) calculating the outer product of the reconstructed input value of the visual layer and the output matrix of the corresponding hidden layer, and defining the outer product as a negative gradient;
(5) and updating the parameters of the model at a certain learning rate according to the difference value of the positive gradient and the negative gradient.
The training process of the multi-modal input matrix variable limited Boltzmann machine in the step 1 is similar to the training process of the matrix variable limited Boltzmann machine under 2-Gauss input, and only the probability distribution of a hidden layer of the matrix variable limited Boltzmann machine needs to be determined by two characteristic inputs, namely
Figure 413296DEST_PATH_IMAGE022
The free energy function in step 2 of the present invention can be obtained from the following relationship:
the probability distributions of free energy and input data are related as follows:
Figure 310845DEST_PATH_IMAGE023
the probability distribution of the input data can be expressed as
Figure 897159DEST_PATH_IMAGE024
Thus the free energy function can be expressed as
Figure 817842DEST_PATH_IMAGE025
Wherein
Figure 150734DEST_PATH_IMAGE026
Which represents the input data, is,
Figure 902790DEST_PATH_IMAGE027
the output data is represented by a representation of,
Figure 662935DEST_PATH_IMAGE028
representing the corresponding probability distribution of the data stream,
Figure 867652DEST_PATH_IMAGE029
in order to be a function of the allocation,
Figure 4235DEST_PATH_IMAGE030
is the free energy function corresponding to the input function,
Figure 610797DEST_PATH_IMAGE031
as a function of the energy of the entire model
The free energy change rate in step 2 is defined as the ratio of the difference of the free energy of the data streams in two adjacent windows to the free energy value of the previous window; when the data between adjacent windows has no obvious change, the probability distribution of the data streams is similar, and the free energy values are not greatly different, so that the free energy change rate is close to zero, otherwise, the free energy change rate is a larger value.
The threshold value of window adjustment in step 3 of the invention can be obtained through a Hough boundary:
according to the Hough inequality in
Figure 533054DEST_PATH_IMAGE032
Should be less than or equal to the window-adjusted threshold at confidence level ofWherein
Figure 165341DEST_PATH_IMAGE034
Represents the maximum variation range of the free energy variation rate,
Figure 626409DEST_PATH_IMAGE035
is the total number of samples.
The adaptive adjustment of the length of the sliding window in step 3 comprises the following two conditions:
(1) when the free energy change rate is less than or equal to the threshold value, the data stream between adjacent windows is not different or the change of the data stream is in a negligible range, and when the maximum size set by the algorithm is not exceeded, the window is expanded to be twice of the original size;
(2) when the free energy change rate is greater than the threshold, indicating a significant difference in data flow between adjacent windows, the window size should be reduced to the minimum window size set by the algorithm.
The invention further discloses the application of the sliding window length self-adaptive adjustment algorithm for the multi-source heterogeneous data stream in the aspect of data blocking; the experimental result proves that compared with single-mode data, the fusion analysis of the multi-mode data has higher sliding window adjustment accuracy.
The invention partitions the data stream by self-adaptive adjustment of the length of the sliding window of the multi-source heterogeneous data stream, and can improve the efficiency of data stream processing in the heterogeneous data network: when the data flow is abnormal, the window size is reduced, and abnormal data can be processed in time; when the data stream is not obviously changed, the window size is continuously enlarged, similar data can be uniformly processed at the moment, and the data processing efficiency is improved.
The invention is described in more detail below:
step 1, preprocessing multi-source heterogeneous signals respectively, converting picture signals into matrix data according to pixel values, and converting sound signals into vector data according to Mel frequency cepstrum coefficients;
step 2, respectively extracting characteristics of heterogeneous data by using a matrix variable limited Boltzmann machine and a matrix variable limited Boltzmann machine under Gaussian input to obtain corresponding characteristic vectors;
step 3, performing feature fusion on the features extracted from the heterogeneous data by adopting a multi-mode input limited Boltzmann machine;
step 4, constructing a free energy function of the heterogeneous data stream according to the models obtained in the step 2 and the step 3, wherein the free energy function can measure the probability distribution change condition of the data stream;
and 5, calculating the free energy change rate of the data stream in the adjacent window according to the free energy function, and combining the free energy change rate with the Hough boundary to realize the self-adaptive adjustment of the length of the sliding window, thereby realizing the blocking of the data stream.
In step 2, the matrix variable limited boltzmann machine under the gaussian input is an extension of the gaussian limited boltzmann machine, and an input layer and a hidden layer of the matrix variable limited boltzmann machine are both matrix variables, so that a space structure of high-dimensional data can be well stored. Similar to a limited Boltzmann machine, a visible layer and a hidden layer in the model are connected through a weight matrix, and a process of obtaining output data of the hidden layer according to input data of the visible layer is a process of feature extraction. And the training of the matrix variable limited Boltzmann machine under the Gaussian input is completed by adopting a CD algorithm.
In step 3, the multi-modal input matrix variable restricted boltzmann machine comprises two visible layers and a hidden layer, the visible layers and the hidden layer are connected through two independent weights, and a process of obtaining hidden layer data from input layer data is a process of feature fusion. Similar to the matrix variable limited boltzmann machine, the training under the multi-modal input also adopts a CD algorithm, and the only difference is that the output value of the hidden layer is determined by two input values together.
In step 4, the probability distribution of the free energy and the input data has the following relationship:
Figure 728358DEST_PATH_IMAGE036
the probability distribution of the input data can be expressed as
Figure 907666DEST_PATH_IMAGE037
The free energy function can thus be expressed as
Figure 651631DEST_PATH_IMAGE038
Wherein
Figure 433118DEST_PATH_IMAGE039
Which represents the input data, is,
Figure 971547DEST_PATH_IMAGE040
the output data is represented by a representation of,
Figure 372572DEST_PATH_IMAGE041
representing the corresponding probability distribution of the data stream,
Figure 857911DEST_PATH_IMAGE042
in order to be a function of the allocation,
Figure 27993DEST_PATH_IMAGE043
is the free energy function corresponding to the input function,
Figure 737323DEST_PATH_IMAGE044
as a function of the energy of the entire model. For a trained model, the value of the partition function remains unchanged, so that the variation of the probability distribution can be measured by the free energy value of the input data.
In step 5, the hough-b-t boundary can be obtained according to the hough-t inequality. The Hough's inequality gives the probability upper bound of the sum of random variables and its expected deviation, and for a given confidence, the difference between the free energy rate of change and its average rate of change, which can be derived from the concept of confidence interval, has the following relationship:
Figure 625644DEST_PATH_IMAGE045
the threshold value of the sliding window adjustment should be less than or equal to
Figure 445833DEST_PATH_IMAGE046
Wherein
Figure 738929DEST_PATH_IMAGE047
Represents the maximum variation range of the free energy variation rate,
Figure 619161DEST_PATH_IMAGE048
is the total number of samples. That is, when the free energy change rate is larger than the threshold, the window size is reduced, otherwise, the window size is continuously enlarged.
Drawings
FIG. 1 is a schematic diagram of heterogeneous data feature extraction and feature fusion;
FIG. 2 is a diagram of sliding window length adaptive rectification;
FIG. 3 is an overall process diagram of the algorithm;
FIGS. 4 (a) and (b) show the free energy variation of the data stream and the adjustment of the length of the sliding window under the voice input, respectively;
fig. 5 (a) and (b) show free energy variation of data stream under image input and length adjustment of sliding window, respectively;
FIGS. 6 (a) and (b) illustrate free energy variation of data stream and adjustment of length of sliding window in voice-image input, respectively;
fig. 7 is a comparison of sliding window length adaptive adjustment accuracy obtained after five independent experiments.
Detailed Description
The invention is described below by means of specific embodiments. Unless otherwise specified, the technical means used in the present invention are well known to those skilled in the art. In addition, the embodiments should be considered illustrative, and not restrictive, of the scope of the invention, which is defined solely by the claims. It will be apparent to those skilled in the art that various changes and modifications can be made in these embodiments without departing from the spirit and scope of the invention. The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.
Example 1
Fig. 1 is a schematic diagram of feature extraction and feature fusion of heterogeneous data streams, in which two-dimensional picture data and one-dimensional voice data are shown as an example, and the specific steps are as follows:
step 1, training a matrix variable limited Boltzmann machine under Gaussian input, and performing feature extraction on matrix data by using a trained model;
step 2, inputting a training gauss into a restricted Boltzmann machine, and performing feature extraction on vector data by using a trained model;
and 3, training a matrix variable limited Boltzmann machine under multi-modal input, and performing feature fusion by taking the feature vectors obtained in the step 1 and the step 2 as input.
In step 1, each node of the visible layer of the matrix variable limited Boltzmann machine follows Gaussian distribution with variance of
Figure 994778DEST_PATH_IMAGE049
The energy function of which can be expressed as
Figure 884237DEST_PATH_IMAGE050
Wherein
Figure 497752DEST_PATH_IMAGE051
The outer product of (a) may result in a weight matrix between the visible layer and the hidden layer,
Figure 548885DEST_PATH_IMAGE052
in order to be able to see the biasing of the layers,
Figure 411799DEST_PATH_IMAGE053
in order to hide the bias of the layers,
Figure 774122DEST_PATH_IMAGE054
(ii) a The specific training process comprises the following steps:
(1) calculating the probability of each node of the hidden layer according to the matrix input
Figure 38881DEST_PATH_IMAGE055
Obtaining an activation output matrix of the hidden layer node according to the probability;
(2) calculating the outer product of the matrix input value and the activation output value, and defining the outer product as a positive gradient;
(3) reconstructing the activation value of the visual layer according to the activation output matrix obtained in the step (1)
Figure 995336DEST_PATH_IMAGE056
And repeating the step (1) to obtain the corresponding activation output of the hidden layer;
(4) calculating the outer product of the reconstructed input value of the visual layer and the output matrix of the corresponding hidden layer, and defining the outer product as a negative gradient; (5) and updating the parameters of the model at a certain learning rate according to the difference value of the positive gradient and the negative gradient.
In step 3, the training process of the multi-modal input matrix variable limited boltzmann machine is similar to step 1, except that the probability distribution of the hidden layer of the multi-modal input matrix variable limited boltzmann machine needs to be jointly determined by two characteristic inputs, namely
Figure 345546DEST_PATH_IMAGE057
Wherein
Figure 311228DEST_PATH_IMAGE058
The connection weights between the visible layer and the hidden layer are synthesized separately,
Figure 696073DEST_PATH_IMAGE059
is the offset of the hidden layer.
Fig. 2 is a flowchart of adaptive adjustment of the length of the sliding window, which specifically includes the following steps:
step 1, setting the maximum size Nmax and the minimum size Nmin of a window in an algorithm, and setting the size of a current window as N;
step 2, calculating the free energy value of the data in the previous window according to the free energy function, and defining the free energy value as energy 1;
step 3, calculating the free energy value of the data in the next window according to the free energy function, and defining the free energy value as energy 2;
step 4, calculating the free energy change rate of the current window relative to the previous window according to the values obtained in the step 2 and the step 3, and comparing the free energy change rates with a threshold value, wherein the energy change rate is determined by the ratio of the difference of the free energy values in adjacent windows to the free energy value of the previous window, and the threshold value is obtained according to the Hough boundary;
step 5, adjusting the window, mainly including two conditions:
(1) when the free energy change rate is less than or equal to the threshold value, it indicates that the data stream between adjacent windows is not different or the change is within a negligible range, and the window size should be enlarged, specifically the window size is
Figure 89008DEST_PATH_IMAGE060
(2) When the free energy change rate is greater than the threshold, indicating a significant difference in data flow between adjacent windows, the window size should be reduced to its minimum size Nmin.
Fig. 3 is an overall process diagram of the algorithm. As shown in fig. 3, for a multi-source heterogeneous data stream in a heterogeneous sensor network, firstly, a part of the multi-source heterogeneous data stream is taken as training data, and a matrix variable limited boltzmann machine and a multi-mode matrix input limited boltzmann machine under gaussian input are trained according to the process shown in fig. 1, so that feature extraction and feature fusion of the heterogeneous data stream are completed; and then, constructing a free energy function according to the model obtained by training, and completing the self-adaptive adjustment of the length of the sliding window according to the algorithm shown in FIG. 2.
Example 2
The testing of the algorithm on the CUAVE data set is given below:
the CUAVE data set is composed of 10 numbers which are respectively read from 0 to 9 by 36 persons for a plurality of times, wherein the 10 numbers comprise two kinds of heterogeneous data of images and voice. The picture data can be converted into a matrix input of 75 × 50 according to the pixel values, and the sound data can be converted into vector data of 1 × 534 according to mel-frequency cepstrum coefficients. In the experiment, the sizes of hidden layers of a Gauss input limited Boltzmann machine and a Gauss input matrix limited Boltzmann machine are set as
Figure 878846DEST_PATH_IMAGE061
The maximum and minimum sizes of the sliding window were set to 60 and 20, respectively, and the experiment was performed with 95% confidence. Fig. 4, 5, 6 and 7 show graphs of the results of the experiment.
Fig. 4, 5, and 6 (a) and (b) show the energy change rate of the current window relative to the previous window and the size of the sliding window under the voice input, the image input, and the voice-image input, respectively. Comparing the free energy rate of change between adjacent windows to a threshold in plot (a), and when the rate of change exceeds the threshold, increasing the window size in plot (b) up to the maximum window size set by the algorithm; reducing the window size in fig. (b) to a minimum size when the rate of change is below a threshold. In the figure, black vertical lines indicate that the data stream changes at this point. Through comparison, the overall change of the data stream under voice input is very small, so that the abnormity occurring in the data stream cannot be detected in time, and the length of the sliding window is misadjusted; under the condition of image input, the data flow fluctuation is large, the change rate of the abnormal free energy of the data flow is suddenly increased, the size of a sliding window is reduced when the change rate exceeds a threshold value, and when the data are not changed, misadjustment can also occur due to algorithms such as external factors; although the sliding window size can also be misadjusted under the voice-image input, the method has higher accuracy compared with the voice input and the image input, because the free energy change rate of the data stream under the multi-modal input is jointly determined by information in the voice data and the image data, the method can not only compensate the phenomenon that the sliding window is missed to be adjusted under the voice input, but also avoid the phenomenon that the sliding window is misadjusted under the single image input.
Fig. 7 shows that the accuracy of the sliding window size adaptive adjustment in the case of voice input, image input, and voice-image input is compared with the accuracy of the sliding window size adaptive adjustment in the case of selecting five different sets of data, and it can be seen from fig. 7 that the accuracy of the sliding window length adaptive adjustment can be improved by performing fusion and re-analysis on heterogeneous data from different data sources in the case of multi-modal input.

Claims (8)

1. A sliding window length self-adaptive adjusting method for multi-source heterogeneous data streams is characterized by comprising the following steps:
step 1, performing feature extraction and feature fusion on heterogeneous data streams through an extended model of a Gauss input limited Boltzmann machine under matrix variables and multi-modal input; wherein each node of the visible layer of the matrix variable limited Boltzmann machine follows Gaussian distribution with variance of
Figure 277993DEST_PATH_IMAGE001
The energy function of which can be expressed as
Figure 535799DEST_PATH_IMAGE002
Figure 843939DEST_PATH_IMAGE003
The representation matrix variables may see the layer input data,
Figure 23248DEST_PATH_IMAGE004
represents the output data of the hidden layer(s),
Figure 298371DEST_PATH_IMAGE005
for the input data after the normalization, the data is processed,
Figure 613946DEST_PATH_IMAGE006
the outer product of (a) may result in a weight matrix between the visible layer and the hidden layer,
Figure 152375DEST_PATH_IMAGE007
in order to be able to see the biasing of the layers,
Figure 553400DEST_PATH_IMAGE008
a bias to a hidden layer; according to the matrix input, the probability of each node of the hidden layer can be calculated
Figure 835477DEST_PATH_IMAGE009
Obtaining an activation output matrix of the hidden layer node according to the probability, wherein the process is a characteristic extraction process; the principle of the multi-mode input matrix variable limited Boltzmann machine is similar to that of the matrix variable limited Boltzmann machine, and hidden layer output can be obtained by two visible layer inputs
Figure 271138DEST_PATH_IMAGE010
Namely, feature fusion is realized, wherein
Figure 711959DEST_PATH_IMAGE011
For the input of the visible layer or layers,
Figure 865860DEST_PATH_IMAGE012
in order to hide the layer output,
Figure 889310DEST_PATH_IMAGE013
the connection weights between the visible layer and the hidden layer are synthesized separately,
Figure 445057DEST_PATH_IMAGE014
an offset for the hidden layer;
step 2, constructing a free energy function of the multi-source heterogeneous data stream through the model obtained by training in the step 1, and measuring the probability distribution change of the data stream according to the free energy change rate between adjacent windows;
the probability distributions of free energy and data flow are related as follows:
Figure 59709DEST_PATH_IMAGE015
wherein
Figure 435326DEST_PATH_IMAGE016
Representing the probability density distribution of the input data,
Figure 59206DEST_PATH_IMAGE017
the free energy of the input data is represented,
Figure 218528DEST_PATH_IMAGE018
for a distribution function, also called a normalization constant, for a trained limited boltzmann machine,
Figure 269660DEST_PATH_IMAGE018
the free energy of the input data can reflect the probability distribution condition of the input data;
and 3, comparing the free energy change rate between adjacent windows with a threshold value determined by a Hough boundary, and adjusting the length of the sliding window according to the result.
2. The method of claim 1, wherein the training of the matrix variable restricted boltzmann machine under gaussian input in step 1 comprises the steps of:
(1) calculating the probability of each node of the hidden layer according to the matrix input
Figure 866995DEST_PATH_IMAGE019
And obtaining an activation output matrix of the hidden layer node according to the probability;
(2) calculating the outer product of the matrix input value and the activation output value, and defining the outer product as a positive gradient;
(3) according to the activation output matrix obtained in the step (1)Reconstructing activation values for a visual layer
Figure 28986DEST_PATH_IMAGE020
And repeating the step (1) to obtain the corresponding activation output of the hidden layer;
(4) calculating the outer product of the reconstructed input value of the visual layer and the output matrix of the corresponding hidden layer, and defining the outer product as a negative gradient;
(5) and updating the parameters of the model at a certain learning rate according to the difference value of the positive gradient and the negative gradient.
3. The method of claim 1, wherein the training process of the multi-modal input matrix variable limited boltzmann machine in step 1 is similar to the training process of the matrix variable limited boltzmann machine under 2 gauss input, except that the probability distribution of the hidden layer thereof needs to be determined by two feature inputs, namely
Figure DEST_PATH_IMAGE021
4. The method of claim 1, wherein the free energy function in step 2 is derived from the relationship:
the probability distributions of free energy and input data are related as follows:
Figure 497008DEST_PATH_IMAGE022
the probability distribution of the input data can be expressed as
Figure 719041DEST_PATH_IMAGE023
Thus the free energy function can be expressed as
Figure 538093DEST_PATH_IMAGE024
Wherein
Figure 766424DEST_PATH_IMAGE025
Representing input data, representing output data,
Figure 885690DEST_PATH_IMAGE026
representing the corresponding probability distribution of the data stream,
Figure DEST_PATH_IMAGE027
in order to be a function of the allocation,
Figure 216308DEST_PATH_IMAGE028
is the free energy function corresponding to the input function,
Figure 53814DEST_PATH_IMAGE029
as a function of the energy of the entire model.
5. The method of claim 1, wherein the free energy change rate in step 2 is defined as a ratio of a difference between free energy of data streams in two adjacent windows to a free energy value of a previous window; when the data between adjacent windows has no obvious change, the probability distribution of the data streams is similar, and the free energy values are not greatly different, so that the free energy change rate is close to zero, otherwise, the free energy change rate is a larger value.
6. The method of claim 1, wherein the threshold value for window adjustment in step 3 is obtained by a huffman boundary:
according to the Hough inequality, under the confidence coefficient, the threshold value of window adjustment should be less than or equal to
Figure 557608DEST_PATH_IMAGE030
Wherein
Figure 531380DEST_PATH_IMAGE031
Represents the maximum variation range of the free energy variation rate,
Figure 95217DEST_PATH_IMAGE032
is the total number of samples.
7. The method of claim 1, wherein the adaptive adjustment of the length of the sliding window in step 3 comprises the following two cases:
(1) when the free energy change rate is less than or equal to the threshold value, the data stream between adjacent windows is not different or the change of the data stream is in a negligible range, and when the maximum size set by the algorithm is not exceeded, the window is expanded to be twice of the original size;
(2) when the free energy change rate is greater than the threshold, indicating a significant difference in data flow between adjacent windows, the window size should be reduced to the minimum window size set by the algorithm.
8. The sliding window length adaptive adjustment method for multi-source heterogeneous data streams, according to claim 1, in combination with fusion of multi-source heterogeneous data, can fully consider information contained in each data source, and has a better adjustment accuracy rate compared with a single data source.
CN202010087229.9A 2020-02-11 2020-02-11 Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream Pending CN111191742A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010087229.9A CN111191742A (en) 2020-02-11 2020-02-11 Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010087229.9A CN111191742A (en) 2020-02-11 2020-02-11 Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream

Publications (1)

Publication Number Publication Date
CN111191742A true CN111191742A (en) 2020-05-22

Family

ID=70710350

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010087229.9A Pending CN111191742A (en) 2020-02-11 2020-02-11 Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream

Country Status (1)

Country Link
CN (1) CN111191742A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114140462A (en) * 2021-12-10 2022-03-04 江苏牛犇轴承有限公司 Bearing wear degree evaluation method based on image processing
CN115238753A (en) * 2022-09-21 2022-10-25 西南交通大学 Self-adaptive SHM data cleaning method based on local outlier factor
WO2023168371A3 (en) * 2022-03-02 2024-01-18 Fdh Infrastructure Services, Llc Machine learning to predict bolt tension

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104318246A (en) * 2014-10-20 2015-01-28 西安电子科技大学 Depth self-adaption ridgelet network based polarimetric SAR (Synthetic Aperture Radar) image classification
US20160188726A1 (en) * 2014-12-31 2016-06-30 TCL Research America Inc. Scalable user intent mining using a multimodal restricted boltzmann machine
CN106060039A (en) * 2016-05-27 2016-10-26 广东工业大学 Classification detection method facing network abnormal data flow
WO2017066695A1 (en) * 2015-10-16 2017-04-20 D-Wave Systems Inc. Systems and methods for creating and using quantum boltzmann machines
CN107730040A (en) * 2017-09-30 2018-02-23 国网山东省电力公司电力科学研究院 Power information system log information comprehensive characteristics extracting method and device based on RBM
CN108470074A (en) * 2018-04-04 2018-08-31 河北北方学院 A kind of multi-source data under the environment based on big data polymerize sampling system
CN110008388A (en) * 2019-03-27 2019-07-12 东北大学 A kind of flow data classification method based on decision tree
WO2019157228A1 (en) * 2018-02-09 2019-08-15 D-Wave Systems Inc. Systems and methods for training generative machine learning models
CN110445726A (en) * 2019-08-16 2019-11-12 山东浪潮人工智能研究院有限公司 A kind of adaptive network stream concept drift detection method based on comentropy

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104318246A (en) * 2014-10-20 2015-01-28 西安电子科技大学 Depth self-adaption ridgelet network based polarimetric SAR (Synthetic Aperture Radar) image classification
US20160188726A1 (en) * 2014-12-31 2016-06-30 TCL Research America Inc. Scalable user intent mining using a multimodal restricted boltzmann machine
WO2017066695A1 (en) * 2015-10-16 2017-04-20 D-Wave Systems Inc. Systems and methods for creating and using quantum boltzmann machines
CN106060039A (en) * 2016-05-27 2016-10-26 广东工业大学 Classification detection method facing network abnormal data flow
CN107730040A (en) * 2017-09-30 2018-02-23 国网山东省电力公司电力科学研究院 Power information system log information comprehensive characteristics extracting method and device based on RBM
WO2019157228A1 (en) * 2018-02-09 2019-08-15 D-Wave Systems Inc. Systems and methods for training generative machine learning models
CN108470074A (en) * 2018-04-04 2018-08-31 河北北方学院 A kind of multi-source data under the environment based on big data polymerize sampling system
CN110008388A (en) * 2019-03-27 2019-07-12 东北大学 A kind of flow data classification method based on decision tree
CN110445726A (en) * 2019-08-16 2019-11-12 山东浪潮人工智能研究院有限公司 A kind of adaptive network stream concept drift detection method based on comentropy

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
ALBERT BIFET ;RICARD GAVALDÀ;: "Learning from Time-Changing Data with Adaptive Windowing", PROCEEDINGS OF THE 2007 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 30 April 2007 (2007-04-30) *
MACIEJ JAWORSKI; PIOTR DUDA; LESZEK RUTKOWSKI;: "Concept Drift Detection in Streams of Labelled Data Using the Restricted Boltzmann Machine", 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 14 October 2018 (2018-10-14) *
SIMENG LIU; YANFENG SUN; YONGLI HU; JUNBIN GAO; FUJIAO JU; BAOCAI YIN;: "Matrix variate RBM model with Gaussian distributions", 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 3 July 2017 (2017-07-03) *
WEI WANG, MENGJUN ZHANG;: "Data Stream Adaptive Partitioning of Sliding Window Based on Gaussian Restricted Boltzmann Machine", ARTIFICIAL INTELLIGENCE IN CHINA., no. 572, 1 February 2020 (2020-02-01) *
秦骏达;毕天姝;: "基于自由能理论的电力系统扰动能量评估", 电工技术学报, no. 03, 26 December 2017 (2017-12-26) *
郑志蕴;李步源;李伦;李钝;: "基于云计算的受限玻尔兹曼机推荐算法研究", 计算机科学, no. 12, 15 December 2013 (2013-12-15) *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114140462A (en) * 2021-12-10 2022-03-04 江苏牛犇轴承有限公司 Bearing wear degree evaluation method based on image processing
CN114140462B (en) * 2021-12-10 2023-09-08 江苏牛犇轴承有限公司 Bearing wear degree assessment method based on image processing
WO2023168371A3 (en) * 2022-03-02 2024-01-18 Fdh Infrastructure Services, Llc Machine learning to predict bolt tension
CN115238753A (en) * 2022-09-21 2022-10-25 西南交通大学 Self-adaptive SHM data cleaning method based on local outlier factor
CN115238753B (en) * 2022-09-21 2022-12-06 西南交通大学 Self-adaptive SHM data cleaning method based on local outlier factor

Similar Documents

Publication Publication Date Title
CN110245608B (en) Underwater target identification method based on half tensor product neural network
CN111191742A (en) Sliding window length self-adaptive adjustment method for multi-source heterogeneous data stream
CN112183501B (en) Depth counterfeit image detection method and device
CN111933124B (en) Keyword detection method capable of supporting self-defined awakening words
CN107392919B (en) Adaptive genetic algorithm-based gray threshold acquisition method and image segmentation method
CN112215054B (en) Depth generation countermeasure method for denoising underwater sound signal
CN111564163A (en) RNN-based voice detection method for various counterfeit operations
CN112307714A (en) Character style migration method based on double-stage deep network
CN112861982A (en) Long-tail target detection method based on gradient average
CN114694255B (en) Sentence-level lip language recognition method based on channel attention and time convolution network
CN113971735A (en) Depth image clustering method, system, device, medium and terminal
Zhuang et al. A handwritten Chinese character recognition based on convolutional neural network and median filtering
CN111931572B (en) Target detection method for remote sensing image
CN112085668B (en) Image tone mapping method based on region self-adaptive self-supervision learning
CN109919235B (en) Deep learning image classification model training method based on manual intervention sample set weight
CN116756391A (en) Unbalanced graph node neural network classification method based on graph data enhancement
CN107766838B (en) Video scene switching detection method
CN116309213A (en) High-real-time multi-source image fusion method based on generation countermeasure network
CN115861930A (en) Crowd counting network modeling method based on hierarchical difference feature aggregation
CN113420870B (en) U-Net structure generation countermeasure network and method for underwater sound target recognition
CN110378466A (en) Quantization method and system based on neural network difference
CN112488238B (en) Hybrid anomaly detection method based on countermeasure self-encoder
CN113177599A (en) Enhanced sample generation method based on GAN
CN112818152A (en) Data enhancement method and device of deep clustering model
CN113609970A (en) Underwater target identification method based on grouping convolution depth U _ Net

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination