CN108134780B

CN108134780B - Intelligent home security equipment safety judgment method based on improved decision tree algorithm

Info

Publication number: CN108134780B
Application number: CN201711319190.3A
Authority: CN
Inventors: 彭大芹; 项磊; 李司坤; 谢金凤
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2017-12-12
Filing date: 2017-12-12
Publication date: 2021-03-16
Anticipated expiration: 2037-12-12
Also published as: CN108134780A

Abstract

The invention relates to a method for judging the security of intelligent home security equipment based on an improved decision tree algorithm, and belongs to the technical field of network information security of intelligent home security equipment. The method comprises the steps of grabbing a Pcap data packet, analyzing the data packet, training a decision tree model, extracting control command data, forging transmission data, realizing control over intelligent home security equipment and further judging whether the intelligent home security equipment is safe or not. The modeling of the invention fully utilizes the mainstream intelligent household security equipment in the market, and can provide reliable technical support for consumers to judge the safety of the intelligent security equipment in the market.

Description

Intelligent home security equipment safety judgment method based on improved decision tree algorithm

Technical Field

The invention belongs to the technical field of network information security of intelligent home security equipment, and relates to a security judgment method of intelligent home security equipment based on an improved decision tree algorithm.

Background

In recent years, along with the rapid development of the internet of things, intelligent homes, intelligent security and the like are also popular with consumers. In order to meet the requirements of various consumer groups, various smart home products are available, and some manufacturers are not available for reducing the product quality in order to reduce the cost. According to investigation, a plurality of unencrypted intelligent security products appear in the market, so that the safety of the intelligent security equipment is insufficient, and powerful guarantee cannot be provided for consumers.

Disclosure of Invention

In view of this, the invention aims to provide an intelligent home security equipment security judgment method based on an improved decision tree algorithm, which identifies the security of intelligent security equipment and provides a good help for users to select products.

In order to achieve the purpose, the invention provides the following technical scheme:

the intelligent home security equipment safety judgment method based on the improved decision tree algorithm comprises the following steps:

s1: constructing a wifi environment, and acquiring a Pcap data packet of the intelligent home security equipment controlled by the mobile phone by controlling the intelligent home security equipment;

s2: analyzing data carried in a TCP protocol of a transmission layer in the captured Pcap data packet, and filtering the Pcap data packet;

s3: generating a training set and a test set of the decision tree according to the obtained and analyzed result;

s4: training a decision tree model by using a training set, and checking the decision tree model by using a test set to determine an improved decision tree model;

s5: and judging whether the control command data in the Pcap data packet is encrypted or not by using the trained improved decision tree model, if so, judging that the safety of the intelligent home security equipment is high, and if not, judging that the safety of the intelligent home security equipment is low.

Further, step S1 specifically includes the following steps:

s11: starting a wifi hotspot through a personal computer;

s12: connecting a mobile phone and intelligent security equipment to the wifi hotspot;

s13: and logging in the APP through the mobile phone to perform arming or disarming control on the intelligent security equipment, and opening the wirehardk software to capture the pcap data packet.

Further, step S13 is performed to select and control the smart home security device according to the requirement in the capturing process, and meanwhile, it is ensured that the capturing time is longer than 20 minutes.

Further, step S2 specifically includes the following steps:

s21: filtering out non-TCP protocol data frames in the Pcap data packet;

s22: judging whether the data bit length of data in a TCP protocol data frame is greater than 0, and filtering out data frames with the data bit length less than or equal to 0;

s23: acquiring a timestamp in the Pcap data packet, calculating the time difference of two similar frames, and filtering out data frames with unfixed time difference;

s24: and analyzing the residual data frame of the Pcap data packet, and recording the IP and the corresponding data bit data length.

Further, step S4 specifically includes the following steps:

s41: the total number of samples in the training set and the test set is assumed to be N, and each sample comprises M characteristic attributes;

s42: randomly extracting N1 samples from N samples to be used as training sets, and taking the rest N-N1 samples to be used as testing sets;

s43: generating a decision tree T according to N1 samples of the training set;

s44: and (4) judging the accuracy of the decision tree T by using the N-N1 samples of the remaining test sets, outputting the decision tree T as a decision tree model if the accuracy can be judged, replacing error data with samples of an equal training set to form a new test set and a new training set if the accuracy cannot be judged, and repeating the step S43 until the decision tree T can be accurately judged.

Further, step S5 specifically includes the following steps:

s51: control command data in the Pcap data packet are obtained twice continuously;

s52: matching character strings of the control command data captured twice one by one, and comparing the control command data captured twice;

s53: if the number of bytes of the control command data compared with the data bit change is less than 10 bytes in the two times and the TCP data is forged by the personal computer to control the intelligent home security equipment, the control command data is judged to be unencrypted and the safety is low;

if the byte number of the control command data for two times compared with the data bit change is more than or equal to 10 bytes, and the TCP data forged by the personal computer cannot control the intelligent home security equipment, the encryption is judged, and the safety is high.

The invention has the beneficial effects that: the method provided by the invention can help consumers to accurately judge the safety of the intelligent home security equipment on the market on one hand, and can also provide technical support for a specific government department to supervise the intelligent home market on the other hand.

Drawings

In order to make the object, technical scheme and beneficial effect of the invention more clear, the invention provides the following drawings for explanation:

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a schematic diagram of an IP filtered Pcap packet;

FIG. 3 is a diagram illustrating a basic format of a captured Pcap file;

FIG. 4 is a diagram of an improved decision tree model for extracting control commands for smart home security devices.

Detailed Description

Preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

The safety of the equipment is judged according to the control command of the intelligent home security equipment acquired under the wifi environment. The control command of the intelligent home security equipment has a direct relationship according to the brand of the intelligent home security equipment, and the control command is required to be acquired to judge whether the brand is worthy of purchase.

The invention will be further described with reference to the following detailed description of embodiments and with reference to the accompanying drawings in which:

a method for identifying safety of intelligent home security equipment based on an improved decision tree algorithm. Fig. 1 is a flow chart of the entire recognition method.

1. And grabbing the Pcap data packet of the intelligent home security equipment in the wifi environment.

2. And filtering the Pcap data packet to determine which intelligent home security equipment exists in the environment.

3. Establishing a decision tree model, and analyzing and identifying the Pcap data packet

Open wifi focus simulation router with personal computer, all be connected to this wifi focus with cell-phone, computer and intelligent security protection equipment etc. on, log in the APP of intelligent security protection equipment such as sharp, millet with the cell-phone, control of deploying troops on garrison or withdrawing garrison to equipment, open the wireshark software and carry out snatching of data packet, snatch the in-process and select control intelligent house security protection equipment as required, guarantee simultaneously that the time of snatching is greater than more than 20 minutes to the data packet that will snatch is classified according to the training set and the test set of decision tree model.

And according to the IP connected to the router and the characteristic that the intelligent home security equipment is interacted with the server at regular time, determining which IP is the IP of the intelligent home security equipment. As shown in fig. 2. The heartbeat data of the device is shown explicitly, i.e. sent repeatedly and the data bit length is greater than 0.

The structure of the Pcap packet is shown in fig. 3. The timestamp carried in each data packet needs to be found according to the Pcap structure, and the heartbeat time of each corresponding intelligent home security device is found.

A decision tree model is built according to the captured data, and whether the Pcap data packet includes a corresponding control command is analyzed, as shown in fig. 4. The decision tree model is a binary tree model, and the total time length of the Pcap file is the difference value of the timestamps of the first frame and the last frame of the acquired Pcap file; the transmission layer protocol and the Frame Data size (Frame _ Data _ size) are used for acquiring the transmission protocol and the Data Frame size of each Frame in the Pcap file; through the decision filtering, extracting the timestamp of each frame, classifying according to the IP addresses, calculating the timestamp difference, and finding the source IP and the target IP with fixed timestamp difference; and the source IP and the destination IP are exchanged, a data frame containing information is searched in the data packet, and whether the data frame is a control command is judged according to whether the data information is repeated excessively.

In an initial state, carrying out software filtering and classification on a Pcap data packet acquired by the wireshark; and training the decision tree algorithm model by using a training set, and verifying the trained decision tree by using a test set. The training set is obtained by testing various market mainstream products (fluorite, sharpening, millet and the like), the decision tree model is trained, data packets of related products are used as the testing set to test the decision tree precision, then comparison is carried out, the decision tree algorithm is adjusted according to errors, and the highest precision is selected as the final decision tree. The specific steps of the improved algorithm are described as follows:

1) n samples are assumed, and each sample comprises M characteristic attributes;

2) randomly extracting N1 samples from N samples to be used as training sets, and taking the rest N-N1 samples to be used as testing sets;

3) generating a decision tree T according to the N sample training sets;

4) judging the accuracy of the decision tree T according to the N-N1 sample test sets, if the accuracy can be judged accurately, outputting a decision tree model, if the accuracy cannot be judged accurately, replacing error data with an equal amount of training set samples to form a new test set and a new training set, and turning to the step 3) until the decision tree T can be judged accurately.

According to analysis of a Pcap file structure, data analysis of intelligent security equipment and capture of a Pcap data packet in a specific intelligent home environment, a decision tree algorithm model can be established to find out whether intelligent security equipment exists in the Pcap data packet or not from the captured Pcap data packet, if yes, a control command of the intelligent security equipment is found out according to the characteristics of the intelligent security equipment, and therefore the high and low of the safety of the intelligent security equipment are judged, and defense deployment and defense withdrawal are carried out on the control command of the intelligent security equipment. The method comprises the steps of continuously obtaining two control commands, comparing the two data through a comparison function, judging that the data is encrypted or not, namely judging the complexity of the data, judging that the data is not encrypted if the number of bytes of data bit change compared for two times is less than 10 bytes, and realizing control by counterfeiting TCP data through a computer, wherein the safety is low if the data is not encrypted, and judging that the data is encrypted if the data is not encrypted.

Finally, it is noted that the above-mentioned preferred embodiments illustrate rather than limit the invention, and that, although the invention has been described in detail with reference to the above-mentioned preferred embodiments, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the scope of the invention as defined by the appended claims.

Claims

1. The intelligent home security equipment safety judgment method based on the improved decision tree algorithm is characterized by comprising the following steps: the method comprises the following steps:

s4: training a decision tree model by using a training set, and checking the decision tree model by using a test set to determine an improved decision tree model; the method specifically comprises the following steps:

s43: generating a decision tree T according to N1 samples of the training set;

s44: judging the accuracy of the decision tree T by using N-N1 samples of the remaining test sets, outputting the decision tree T as a decision tree model if the accuracy can be judged, replacing error data with samples of an equal amount of training sets to form a new test set and a new training set if the accuracy cannot be judged, and repeating the step S43 until the decision tree T can be accurately judged;

s5: judging whether control command data in the Pcap data packet is encrypted or not by using the trained improved decision tree model, if so, judging that the safety of the intelligent home security equipment is high, and if not, judging that the safety of the intelligent home security equipment is low; the method specifically comprises the following steps:

2. The intelligent home security device safety judgment method based on the improved decision tree algorithm as claimed in claim 1, wherein: step S1 specifically includes the following steps:

s11: starting a wifi hotspot through a personal computer;

3. The intelligent home security device safety judgment method based on the improved decision tree algorithm as claimed in claim 2, wherein: in the step S13, the intelligent home security equipment is selectively controlled according to the requirement in the grabbing process, and meanwhile the grabbing time is ensured to be more than 20 minutes.

4. The intelligent home security device safety judgment method based on the improved decision tree algorithm as claimed in claim 2, wherein: step S2 specifically includes the following steps:

s21: filtering out non-TCP protocol data frames in the Pcap data packet;