CN113428167B

CN113428167B - ECU (electronic control Unit) abnormality recognition method

Info

Publication number: CN113428167B
Application number: CN202110978223.5A
Authority: CN
Inventors: 卢继武; 许鹤; 刘平; 孟锦豪; 刘义; 张景哲
Original assignee: Changsha Deyi Technology Co ltd
Current assignee: Changsha Deyi Technology Co ltd
Priority date: 2021-08-25
Filing date: 2021-08-25
Publication date: 2021-11-26
Anticipated expiration: 2041-08-25
Also published as: CN113428167A

Abstract

The invention discloses an ECU (electronic control unit) abnormality identification method, which is based on the high-speed acquisition and processing capacity of an FPGA (field programmable gate array) on the basis of fully researching a CAN bus protocol communication mechanism, improves the sampling rate and the characteristic calculation rate to the maximum extent, combines the advantages of an artificial neural network, designs an Enhanced LSTM neural network model based on an attention mechanism aiming at CAN bus time sequence characteristic data, is used for identifying the working state of a vehicle-mounted ECU (electronic control unit), and has stable and reliable performance.

Description

ECU (electronic control Unit) abnormality recognition method

Technical Field

The invention belongs to the field of vehicle-mounted safety, and particularly relates to an ECU (electronic control unit) abnormity identification method.

Background

With the continuous development of the automatic driving technology, Electronic Control Units (ECUs) and networking communication devices inside automobiles are more and more complex, the ECUs on high-grade cars may be hundreds, and vehicle-mounted networking modes include WiFi, 2G, 3G, 4G, 5G and the like. The increasing complexity and networking nature of modern automotive functions make the security of automobiles extremely challenging.

Any industry faces information security problems in the process of networking development, and the development of the internet of vehicles is no exception. Along with the continuous emergence of intelligent networking automobiles, threats such as malicious attack, illegal control, privacy disclosure and the like are increased day by day, and in addition, the application of an automatic driving auxiliary system and a new generation of electronic information technology on the automobiles ensures that the automobiles are more and more intelligent and can ensure the driving safety to a great extent, but at the same time, once the intelligent auxiliary functions are suddenly disabled or abnormal under a certain condition, the driving safety is seriously threatened. The networking of automobiles enables attackers to have an opportunity to access the vehicle-mounted network through various wired or wireless modes, acquire vehicle information and even remotely control the vehicles. No matter what kind of attack form, the final target of attacker is vehicle-mounted CAN bus, through the mode of sending malicious frame data, realizes the purpose of attacking, so CAN bus safety becomes the car networking safety and is also the most important line of defense at last, and the research CAN bus safety technique has very important meaning to whole car networking safety.

Generally, the CAN bus security protection means CAN be generally divided into two types, one is a protection means based on encryption and authentication, and the other is a protection means based on intrusion detection. The protection means based on encryption and authentication mainly authenticates frame data and ECU or encrypts messages by using a security key, because the number of effective load bits of a CAN data frame format is only 64 bits, the encryption inevitably increases the network load burden of a CAN bus, so that the communication efficiency is reduced, and the upgrading of the encryption means needs to upgrade the existing firmware. The protection method based on intrusion detection is to establish a detection model by analyzing the physical characteristics, statistical characteristics and the like of frames. The method has the characteristics that the communication burden of the CAN bus is not increased, but the detection speed is limited by the sampling rate and the feature extraction. At present, most of intrusion identification systems are still detection models established based on statistical characteristics, and the requirements of modern intelligent networked automobiles on safety are difficult to meet in the aspect of detection precision.

Most of the traditional abnormality identification systems detect the state of the ECU based on the data frame data message, the method cannot identify the interception and forgery attacks of the ECU, meanwhile, the data needs to be analyzed in an upper computer, and a certain time delay exists in the calculation processing process; however, as the number of the vehicle-mounted ECUs increases, the complexity also increases, and a general machine learning model cannot obtain high detection accuracy.

The CAN ID is usually generated automatically by the vehicle through information of an Electronic Control Unit (ECU), so the invention realizes the identification of ECU abnormality by adopting the correlation between the information of the ECU and the CAN ID, and improves the detection efficiency and precision.

Disclosure of Invention

In order to solve the problems, the invention discloses an ECU abnormality identification method.

In order to achieve the purpose, the technical scheme of the invention is as follows:

an ECU abnormality recognition method includes the steps of:

the method comprises the following steps of firstly, obtaining multidimensional time sequence data of the vehicle-mounted ECU characteristics, and specifically:

s1: two differential data lines of the CAN bus are connected to an ADC data acquisition circuit module through a wiring terminal, and the ADC data acquisition circuit module is arranged on an FPGA;

s2: the FPGA transmits data to the ADC data acquisition module through a wired communication protocol to acquire ADC sampling data, and acquires CAN ID and extraction characteristics through ADC waveform data time sequence frame ID analysis and waveform physical characteristic extraction; the CAN ID and the extracted features form multidimensional time sequence data of the fingerprint features of the vehicle-mounted ECU;

s3: sending the multidimensional time sequence data of the ECU fingerprint characteristics to an upper computer;

step two, obtaining an identity recognition model of an abnormality judgment logic for the vehicle-mounted CAN bus abnormality detection system, and specifically comprising the following steps:

l1: inputting the multidimensional time sequence data of the ECU fingerprint characteristics into an Enhanced LSTM neural network model;

l2: and the multidimensional time series data of the ECU fingerprint features are encoded and decoded in an Enhanced LSTM neural network model, so that the identification of the multidimensional time series data of the ECU fingerprint features is completed, and an ECU fingerprint feature identification result is obtained.

In a further improvement, the Enhanced LSTM neural network model is divided into an input layer, a hidden layer and an output layer in sequence.

In a further improvement, the Enhanced LSTM neural network model includes an encoding stage and a decoding stage, the encoding stage calculates respective weights for input multidimensional time series data features by using an attention mechanism of the Enhanced LSTM model, and extracts the most relevant features, and the encoding process is as follows:

a. when a given multi-dimensional time series dataset (X, y):

wherein X represents input data of one sample,

Representing the dimension of the input time series data,

The mark of the sample is shown,

Adopts a one-hot coding mode,

Indicating a normal probability of recognition,

Representing a probability of being identified as an anomaly;

b. the input data first passes through an encoding layer of an Enhanced LSTM model attention mechanism, and the q-th input characteristic is assumed to be encoded at the time t, and the encoding process is as follows:

wherein

Representing a parameter matrix of an output layer of an encoding stage,

Expressed are (weight parameter matrix of middle hidden layer in encoding stage),

Representing the deflection parameter matrix of the middle hidden layer in the encoding stage,

Representing the similarity between the hidden state vector and the q-th feature of the input multi-dimensional time-series data at time t,

The q-th feature of the input sample data at time t,

Indicating the state of the hidden unit at time t-1,

Showing the state of the memory cell at time t-1,

() Representing a hyperbolic tangent activation function;

c. according to the similarity of all the features, the attention weight ratio of the qth feature at the time t is obtained

The formula is as follows:

represents a normalized exponential function, wherein

According toThe attention weight of all the features at the moment t is obtained by the formula

As shown in formula:

，

wherein

Representing a weight ratio of the nth feature;

d. after the weight of each feature of the input multi-dimensional time sequence data is obtained, multiplying the original input X of the sample by the corresponding weight to obtain the input data which is coded at the time t

As shown in formula:

representing the encoded new input samples;

e. the traditional LSTM model is reformed, the states of all the stacked LSTM hidden layers are combined to obtain an Enhanced LSTM model,thidden unit state at time 1 is as follows:

where m is the number of LSTM hidden layers,

representing the hidden unit state of the mth layer neural network;

f. the state of the memory cell at time t-1 is similarly obtained as follows:

wherein

The memory unit state of the mth layer neural network is shown;

g. inputting the encoded data as an internal Enhanced LSTM unit of an Enhanced LSTM neural network model, which is as follows:

wherein p is the number of output layer nerve units of Enhanced LSTM,

Showing the improved LSTM neural network model,

Represents the state of the p-th hidden unit at the time t,

Is the first time of t-1pA hidden unit state

And memory cell state

A tuple of formula:

；

same as that

One tuple at time t.

h. In the decoding phase, the output of the Enhanced LSTM model is used

As input to the decoding layer, the similarity of the hidden unit states at time t is first calculated

As shown in formula:

wherein p is the number of output layer neural units in the coding stage is consistent with the number of Enhanced LSTM output layer hidden units,

Represents the weight coefficient matrix of the middle hidden layer in the decoding stage,

A matrix representing the bias coefficient of the middle hidden layer in the decoding stage,

Representing a decoding stage output layer parameter matrix;

i. and then calculating to obtain a weight ratio according to the similarity of the single output characteristics of the decoding layer and the similarity of all the characteristics of the decoding layer at the time t:

representing the output of the weighted ratio of the p-th hidden layer state of the Enhanced LSTM model;

j. then, the weight ratio and the hidden layer state are output

Multiplying to obtain decoded data

As shown in formula:

；

k. finally, the final output value at time t is obtained by using the full link layer

As shown in formula:

in a further improvement, the physical characteristics of the waveform include a mean value of the waveform sample, a standard deviation of the waveform sample, a covariance of the waveform sample, a rising slope of the waveform sample, a kurtosis of the waveform sample, a root mean square deviation of the waveform sample, a maximum value of the waveform sample, a minimum value of the waveform sample, and an energy of the waveform sample.

In a further development, the mean value of the waveform samples

The description of (A) is as follows:

standard deviation of the waveform samples

The description of (A) is as follows:

covariance of the waveform samples

Description of (1)Comprises the following steps:

kurtosis of the waveform sample

The description of (A) is as follows:

root mean square deviation of the waveform samples

The description of (A) is as follows:

maximum value of the waveform sample

The description of (A) is as follows:

minimum value of the waveform sample

The description of (A) is as follows:

energy of the waveform sample

The description of (A) is as follows:

wherein in the above formula

Representing the waveform sample values in the time domain; n is a frameThe number of sampling points of the data;

representing the sample value at the ith time instant.

In a further improvement, the multidimensional time sequence data of the ECU characteristics are uploaded to an upper computer in a serial port transmission mode.

The invention has the following advantages:

on the basis of fully researching a CAN bus protocol communication mechanism, the invention improves the sampling rate and the characteristic calculation rate to the maximum extent based on the high-speed acquisition processing capacity of the FPGA, combines the advantages of an artificial neural network, designs an Enhanced LSTM neural network model based on an attention mechanism aiming at CAN bus time sequence characteristic data, is used for identifying the working state of a vehicle-mounted ECU, and has stable and reliable performance.

Drawings

FIG. 1 is a vehicle CAN bus attack model of the present invention;

FIG. 2 is a data frame feature extraction algorithm idea framework of the present invention;

FIG. 3 is a block diagram of an ECU anomaly detection algorithm of the present invention;

FIG. 4 is a diagram of an Enhanced LSTM structure;

fig. 5 is a process diagram of an encoding stage of a prediction model.

Detailed Description

The invention is further explained with reference to the drawings and the embodiments.

Example 1:

as shown in fig. 1, in the vehicle-mounted CAN bus attack model, the in-vehicle gateway connects the CAN bus subnetwork and other vehicle-mounted communication networks, the ECU 2 and the ECU 3 are ECUs in a normal operating state, the ECU 1 is an abnormal ECU which CAN be an external intrusion ECU or controlled by a malicious attacker, and CAN transmit abnormal frames to the bus to realize attacks such as eavesdropping, disguising, replaying, counterfeiting and the like. Because data transmission on the CAN bus has a broadcasting characteristic and does not have any encryption and authentication mechanism, the ECUs which CAN be mounted on the same CAN subnet CAN receive all data frames on a channel, and the transmitted data frames CAN not effectively verify the identity information of a sender, so that the clamped ECU 1 CAN eavesdrop and disguise as other normal ECUs to transmit data information to implement replay attack. In addition, the arbitration scheme of the CAN bus is that the frame rate of the high priority is transmitted first, and if the ECU 1 continues to transmit the information of the high priority, the channel is blocked.

The detection principle is as follows: the vehicle-mounted ECU is an independent module, generally has an independent clock domain and a CAN transceiver, the circuit structure of the vehicle-mounted ECU is also quite different, and the clock circuit and data frames sent to the CAN bus have unique fingerprint characteristics due to different degrees of crosstalk characteristics generated by the circuit. The physical fingerprint characteristics of the data frame sent by the ECU are detected as the identification ID of the ECU.

The method comprises the steps of firstly, training and identifying the physical fingerprint characteristics of the original vehicle-mounted ECU of the CAN bus through a neural network, enabling the physical fingerprint characteristics to be in one-to-one correspondence with IDs in data frames, wherein when an external ECU invades, the physical fingerprint characteristics cannot be identified and CAN be judged as abnormal ECUs, and when an internal ECU is clamped and attacked, the physical fingerprint characteristics cannot be matched with the IDs and CAN also be identified as abnormal ECUs. By using the method, a series of attack behaviors caused by the invasion of the external ECU on the CAN bus or the clamping of the internal ECU CAN be effectively avoided.

As shown in fig. 2, the data frame feature extraction algorithm idea framework is that the CAN bus communication protocol includes two data lines, the signals of which are differential signals, the differential data lines are connected to a 14-bit high-speed ADC data acquisition circuit module through a connection terminal, the FPGA performs data transmission with the ADC data acquisition module through a wired communication protocol to acquire ADC sampling data, the FPGA performs analysis of ADC waveform data time sequence frame ID and extraction of waveform physical features inside the FPGA, and the extracted features are shown in table 1 below. Each frame of data has a CAN ID identifier, the CAN ID and the extracted features are packaged and transmitted to the upper computer in a serial port transmission mode to serve as the basis of next physical fingerprint feature identification.

TABLE 1 characterization and description thereof

Feature(s)	Description of the invention
		Mean value
Standard deviation of
		Covariance
Rising slope
		Kurtosis
Root mean square difference
		Maximum value
Minimum value
		(Energy)

Representing the waveform sample values in the time domain; n is the number of sampling points of one frame data.

As shown in a framework diagram of an ECU anomaly detection algorithm shown in fig. 3, a PC upper computer receives a data packet collected and processed by an FPGA terminal to obtain time series data of fingerprint features of a vehicle-mounted ECU, 9 features are extracted from each frame of data, the data has multidimensional characteristics, aiming at the characteristics of the time series data, an Enhanced LSTM neural network model based on an attention mechanism is designed for feature recognition, the attention mechanism CAN be used for distinguishing the weight ratio of data with different dimensions to a prediction result, the LSTM (long-term memory network) model has a memory unit and is suitable for processing the time series data with a long-term effect, for a CAN data frame, the waveform physical characteristics of the CAN data frame are influenced by the integral effect of a hardware circuit and have correlation in time, so that the CAN be particularly suitable for processing by using the LSTM model, and researches show that the depth of the LSTM network is more important than the number of the memory units in improving the learning ability of the model, the attention-based LSTM neural network model designed herein is therefore particularly well suited for feature recognition anomaly classification of multi-dimensional time-series data.

The Enhanced LSTM structure shown in fig. 4 is composed of a plurality of basic LSTMs, and is different from the Stacked LSTM structure, and the Enhanced LSTM structure binds the hidden unit state and the memory state of each layer of LSTM unit at a certain time, so that the hidden unit state information and the memory state information of the current layer can be fully utilized, and meanwhile, the hidden unit state information and the memory state information of all layers except the current layer are used as auxiliary input information, thereby improving the prediction capability of the model network on time series data.

As shown in fig. 5, the encoding stage of the prediction model, the Enhanced LSTM neural network model based on attention mechanism proposed herein is mainly composed of two parts, i.e. an encoding stage and a decoding stage, each of which takes the Enhanced LSTM proposed herein as a basic encoder and decoder. As shown in fig. 5, the encoding stage of the prediction model is mainly to calculate respective weights for input multidimensional time series data features by using an attention mechanism, so as to extract the most relevant features; the decoding stage mainly performs attention weight distribution on different units of the hidden layer generated in the encoding stage to extract the most relevant hidden units. The main process is as follows:

in the encoding process, given a multi-dimensional time-series dataset (X, y):

wherein

N is the characteristic quantity of the input multi-dimensional time sequence data,

and representing the classification of the target features at the time t, and adopting a one-hot coding form.

Next, the input data first passes through an encoding layer of an attention mechanism, and it is assumed that the qth input feature is encoded at time t, and the encoding process is as follows:

wherein

All are parameter matrixes obtained by model training,

representing the similarity of the hidden state vector to the q-th feature of the input multi-dimensional time series data at time t. According to the similarity of all the features, the attention weight ratio of the qth input feature at the time t can be obtained according to the formula:

according to the above method, attention weights of all features at time t can be obtained as follows:

after the weight of each feature of the input multi-dimensional time sequence data is obtained, multiplying the original data by the corresponding weight, thereby obtaining the input data which is coded at the time t as shown in the formula:

next, the encoded data is used as the input of the Enhanced LSTM model part, so as to obtain the state of the hidden unit at the time t

It is taken as input to the decoding stage:

wherein

Is the first time of t-1pA hidden unit state

And memory cell state

A tuple of formula:

the conventional LSTM model is reconstructed, and the states of all stacked LSTM hidden layers are combined to obtain the hidden unit state at the t-1 moment as follows:

where m is the number of LSTM hidden layers.

The state of the memory cell at time t-1 can also be obtained as follows:

in the decoding stage, the output of the Enhanced LSTM model is used as the input of the decoding layer, and similar to the idea of the encoding stage, the similarity of the hidden unit state at time t needs to be calculated first, as shown in the following formula:

where p is the number of output layer neural units of Enhanced LSTM.

And then calculating according to the similarity of the single dimension and the similarity of all dimensions to obtain a weight ratio:

then, the weight ratio and the hidden layer state are output

The multiplication may result in decoded data. As shown in formula:

finally, the final output value at time t is obtained by using the full link layer, as:

the above process is the entire process of attention-based Enhanced LSTM neural network model anomaly identification presented herein.

While embodiments of the invention have been disclosed above, it is not limited to the applications set forth in the description and embodiments, which are fully applicable to various fields of endeavor for which the invention is intended, and further modifications may readily be effected therein by those skilled in the art, without departing from the general concept defined by the claims and their equivalents, which are to be limited not to the specific details shown and described herein.

Claims

1. An ECU abnormality recognition method characterized by comprising the steps of:

l2: the multidimensional time series data of the ECU fingerprint features are encoded and decoded in an Enhanced LSTM neural network model, so that the identification of the multidimensional time series data of the ECU fingerprint features is completed, and an ECU fingerprint feature identification result is obtained;

the Enhanced LSTM neural network model comprises an encoding stage and a decoding stage, wherein the encoding stage is used for calculating respective weights of input multi-dimensional time series data characteristics by using an attention mechanism of the Enhanced LSTM model and extracting the most relevant characteristics, and the encoding process is as follows:

a. when a given multi-dimensional time series dataset (X, y):

wherein X represents input data of one sample,

representing the dimensions of the input multi-dimensional time series data,

what is shown is the identity of the sample,

the one-hot coding form is adopted,

indicating the probability of being identified as normal,

representing a probability of being identified as an anomaly;

wherein

Representing a parameter matrix of an output layer of an encoding stage,

Representing the weight parameter matrix of the middle hidden layer in the encoding stage,

The q-th feature of the input sample data at time t,

Indicating the state of the hidden unit at time t-1,

Showing the state of the memory cell at time t-1,

() Representing a hyperbolic tangent activation function;

The formula is as follows:

represents a normalized exponential function, wherein

According to the formula, the attention weight of all the features at the moment t is obtained

As shown in formula:

，

wherein

Representing a weight ratio of the nth feature;

As shown in formula:

representing the encoded new input samples;

e. the traditional LSTM model is modified, the states of all stacked LSTM hidden layers are combined to obtain an Enhanced LSTM model, and the hidden unit state at the t-1 moment is as follows:

where m is the number of LSTM hidden layersThe amount of the compound (A) is,

representing the hidden unit state of the mth layer neural network;

f. the state of the memory cell at time t-1 is similarly obtained as follows:

wherein

The memory unit state of the mth layer neural network is shown;

wherein p is the number of output layer nerve units of Enhanced LSTM,

Showing the improved LSTM neural network model,

Represents the state of the p-th hidden unit at the time t,

Is the first time of t-1pA hidden unit state

And memory cell state

A tuple, e.g. ofFormula (II):

；

same as that

Is a tuple at time t;

h. in the decoding phase, the output of the Enhanced LSTM model is used

As shown in formula:

Representing a decoding stage output layer parameter matrix;

j. then, the weight ratio and the hidden layer state are output

Multiplying to obtain decoded data

As shown in formula:

；

k. finally, the final output value at time t is obtained using the full link layer

As shown in formula:

indicating the probability that the fingerprint feature of the ECU input at the time t is normal

And probability of being an anomaly

。

2. The method of claim 1, wherein the Enhanced LSTM neural network model is sequentially divided into an input layer, a hidden layer, and an output layer.

3. The ECU abnormality recognition method according to claim 1, wherein the waveform physical characteristics include a mean value of the waveform samples, a standard deviation of the waveform samples, a covariance of the waveform samples, a rising slope of the waveform samples, a kurtosis of the waveform samples, a root mean square deviation of the waveform samples, a maximum value of the waveform samples, a minimum value of the waveform samples, and an energy of the waveform samples.

4. The ECU abnormality recognition method according to claim 3, characterized in that the mean value of the waveform samples

The description of (A) is as follows:

standard deviation of the waveform samples

The description of (A) is as follows:

covariance of the waveform samples

The description of (A) is as follows:

kurtosis of the waveform sample

Is describedThe method comprises the following steps:

root mean square deviation of the waveform samples

The description of (A) is as follows:

maximum value of the waveform sample

The description of (A) is as follows:

minimum value of the waveform sample

The description of (A) is as follows:

energy of the waveform sample

The description of (A) is as follows:

wherein in the above formula

Representing the waveform sample values in the time domain; n is the number of sampling points of one frame of data;

represents the ithThe sampled value of the moment.

5. The ECU abnormality recognition method according to claim 1, wherein the multidimensional time series data of the ECU characteristics are uploaded to an upper computer in a serial port transmission mode.