Heterogeneous sensor potential correlation learning method based on deep learning
Technical Field
The invention belongs to the technical field of data processing, relates to a deep learning-based heterogeneous sensor potential correlation learning method, and particularly relates to the field of data acquisition and analysis and the field of data correlation of a sensor of a smart phone used by a user.
Background
In recent years, thanks to the development of technologies such as intelligent sensing and pervasive computing, a cooperative sensing technology widely applied to network convergence becomes a research hotspot. With the rapid development of embedded devices, wireless sensor networks, internet of things and intelligent mobile terminals, a universal intelligent sensing system with integrated sensing, computing and communication capabilities is gradually integrated into the daily life of the society, and the capability of intelligently sensing and acquiring data is remarkably enhanced.
The perception capability obtained by using a single perception device is limited, and in recent years, popularization of various intelligent devices, particularly smart phones, enables common people to share various information such as geographic positions, surrounding environments and the like in real time. The performance of the processor is continuously enhanced, and the smart phone is greatly innovated. Increasingly complex sensors such as sensors capable of measuring acceleration, gyroscope sensors, gravity sensors and direction sensors become standard configurations of smart phones. The sensors enable each individual carrying a mobile phone to become a terminal with strong sensing capability, so that a great number of individuals carrying intelligent terminals can form a coordinator sensing network to observe, track and the like specific targets. Learning of the heterogeneous sensor potential relevance is required.
The document "Rajesh G, Charturedi A. correlation analysis and statistical characterization of heterologous sensor data in environmental sensor Networks [ J ]. Computer Networks, 2019" proposes four classical and robust correlation coefficient measurement methods. The paper utilizes the correlation between sensor modalities to complete the subsequent sensor data interpolation and new modality data prediction, and defines and expresses the correlation between modalities by measuring the correlation coefficient between different multi-modality variables.
The document "Jiang M, Gao X, An H, et al.Reconstructing complex network for chromatography of the time-varying evaluation analysis viewer of multi-varying time series [ J ]. Scientific Reports,2017,7 (1): 10486-. The method has the potential of analyzing the multivariate time sequence and provides important information for investors and decision makers;
the document "Gupta S, Buriro a, Crispo b. driverauth: a Risk-based Multi-modal Biometric-based Driver Authentication Scheme [ J ] Computers & Security, 2019' through learning and utilizing the correlation among a plurality of heterogeneous sensors carried in smart phone equipment, a novel Multi-modal Biometric Authentication solution based on risks is realized, and the aim of ensuring the certification service of a network car booking Driver to be safer and more reliable is achieved.
The above documents briefly introduce the existing correlation analysis method and its main application scenario, and the existing correlation analysis method mainly represents the correlation by calculating the corresponding correlation coefficient. But for heterogeneous sensor devices, the data it produces is heterogeneous time series data. For the correlation analysis between the heterogeneous time series data in a large scale, the existing correlation analysis method based on correlation coefficient calculation cannot meet the requirement.
Disclosure of Invention
Technical problem to be solved
In order to solve the problem that the traditional correlation analysis method based on correlation coefficient calculation cannot effectively perform correlation analysis on the perception data of the heterogeneous sensor, the invention provides a heterogeneous sensor potential correlation learning method based on deep learning. The method is used for carrying out potential correlation analysis on a large amount of heterogeneous time sequence data generated by various sensors, and a real-time correlation analysis task is realized by utilizing a deep learning technology. The invention also optimizes the traditional time convolution network model and creatively realizes the task of analyzing the correlation of the time sequence data.
Technical scheme
A heterogeneous sensor potential correlation learning method based on deep learning is characterized by comprising the following steps:
step 1: acquiring perception data of a terminal user by using a sensor;
step 2: storing and preprocessing the acquired sensing data of the sensor;
and step 3: executing a data alignment task on the sensor data file of the acquired data;
and 4, step 4: taking the sensor data after executing the data alignment task as input data of a deep learning model, wherein the output data of the deep learning model is data of any sensor; after learning, the deep learning model returns a two-dimensional matrix to represent the correlation between various sensors which are input currently and output sensors;
and 5: learning all the sensor data as output data of the model, and finally summarizing the corresponding relations among all the sensors to generate a correlation two-dimensional matrix among all the sensors;
step 6: a two-dimensional matrix of all sensor correlations is visualized.
Preferably: the terminal in the step 1 is a smart phone.
Preferably: the sensor in the step 1 is a gyroscope sensor, a gravity sensor or a direction sensor.
Preferably: the file type of the stored sensor data in the step 2 is a CSV file.
Preferably: the data alignment task in step 3 is specifically: selecting the sensor S with the longest data lengthkThen, S iskAnd performing an alignment task on the data of the other sensors as a reference element of the data, selecting the data which is collected by the sensor for the last time to be supplemented when the alignment task is executed, and finally, all the sensors have data with relevant lengths after the execution of the data-optimal task is finished.
Preferably: the deep learning model in the step 4 comprises causal convolution, dilation convolution, residual connection and attention mechanism.
Preferably: the correlation in step 4 is specifically as follows: the decimal between 0 and 1 is used for representation, 0 represents the minimum correlation, and 1 represents the highest correlation.
Preferably: step 6 is visualized using thermodynamic diagrams.
Advantageous effects
The heterogeneous sensor correlation learning method based on deep learning can be used for completing the task of analyzing the correlation between heterogeneous sensors. For a large amount of time sequence data generated by heterogeneous sensors, the method can learn the correlation among the data through an improved time convolution network model, and the improved time convolution network model can express the correlation in a digital form through an attention mechanism of the improved time convolution network model, and finally, potential correlation relations among the heterogeneous sensors can be visually observed through a data visualization technology. The perception capability of the heterogeneous sensors can be fully exerted in intelligent perception and pervasive computing tasks by using the intuitive correlation results, the perception tasks are completed in a more efficient collaborative perception mode, cost reduction and efficiency improvement of the perception tasks are realized, and a plurality of complex user behavior recognition, user context perception and other high-level perception tasks are completed in a more energy-saving mode.
Drawings
FIG. 1 is a flowchart of a deep learning-based method for learning potential correlations of heterogeneous sensors according to an embodiment of the present invention
FIG. 2 is a flow chart of the execution of data alignment tasks during data pre-processing phase in an embodiment of the present invention
FIG. 3 is a diagram of a deep learning-based relevance latent learning neural network model structure constructed in an embodiment of the present invention
Detailed Description
The invention will now be further described with reference to the following examples and drawings:
the method comprises the steps that firstly, sensing data of all sensors are acquired from intelligent equipment carrying a plurality of heterogeneous sensors, and specifically, the same sampling frequency is used for acquiring all acquired data of the heterogeneous sensors; the neural network model is then used to learn the correlation between these sensors, and in particular the inputs and outputs of the model, here the corresponding sensor data, need to be specified. Meanwhile, in order to train the neural network model better, corresponding data is divided into two parts, namely training and testing. Secondly, in order to learn the correlation between the heterogeneous sensors, the influence relationship of the input data relative to the output data needs to be extracted in the model. Therefore, the part is specifically set in the model to learn the correlations, i.e., a matrix is used to hold the correlations. In order to represent the correlation between heterogeneous sensors, a decimal between 0 and 1 is used for representation, 0 represents the minimum correlation, and 1 represents the highest correlation. Next, in order to learn the potential correlation between a plurality of heterogeneous sensors, it is necessary to use all the heterogeneous sensors as the output of the model, and to sequentially input the data of the other sensors, and then to perform model learning and recording a plurality of times. Finally, the learned correlation coefficient data among the various heterogeneous sensor data are sorted, and finally the data are expressed as a two-dimensional matrix. The two-dimensional matrix records the correlations between all heterogeneous sensors and other heterogeneous sensors. Finally, in order to better show the potential correlation among the sensors, the invention also adopts a visualization strategy to express the potential correlation among the heterogeneous sensors learned by the neural network.
In order to realize the task, the invention adopts the following technical scheme: the heterogeneous sensor potential correlation learning method based on deep learning comprises the following steps:
step one, a user uses a portable smart phone to carry out daily activities, and meanwhile, a sensor mounted on the smart phone starts to collect data sensed currently.
And step two, sending the acquired sensing data of the sensor to a server side for storage, wherein the acquired sensor data file type is a CSV file. And after all the files are uploaded, the data acquisition task of the smart phone end is finished.
And thirdly, carrying out data preprocessing operation on the acquired sensor data file to reduce the influence of invalid data. And deleting the sensor files without the acquired data by data preprocessing, and only leaving the sensor data files with the acquired data.
Step four, executing data alignment task on the sensor data file of the acquired data, and executing data alignment taskThe execution flow chart of (2) is as shown in fig. 2. Selecting the sensor S with the longest data lengthk. Then the S iskAs a reference element of the data, the data of the remaining sensors are subjected to an alignment task. And when the alignment task is executed, the data acquired by the sensor at the last time is selected for completion, and finally all the sensors have data with relevant length after the execution of the data completion task is finished.
And step five, taking the sensor data after the data alignment task is executed as input data of the deep learning model, wherein the output data of the deep learning model is data of any sensor. The structure diagram of the deep learning based neural network model is shown in fig. 3, and the model comprises four components: causal convolution, dilation convolution, residual concatenation and attention mechanism. Consists of two layers: a time convolutional network layer, an attention layer. After the input and output data of the model are specified, a model learning task is started, and after the model is learned, a correlation matrix is returned to represent the potential correlation relationship existing between the currently input sensor and the currently specified output sensor.
And step six, selecting other sensors as output data of the model to perform a task of learning the potential correlation between the sensors and the input sensors, and finally performing potential correlation learning by using all the sensors as the output data of the model. And summarizing the learned correlation matrix, respectively displaying the potential correlation between each sensor relative to other sensors, and generating a correlation two-dimensional matrix among all the sensors.
Finally, a two-dimensional matrix visualization thermodynamic diagram representing all sensor correlations is output as a final result.