CN115801604A - Method for predicting network flow characteristic value - Google Patents
Method for predicting network flow characteristic value Download PDFInfo
- Publication number
- CN115801604A CN115801604A CN202310101070.5A CN202310101070A CN115801604A CN 115801604 A CN115801604 A CN 115801604A CN 202310101070 A CN202310101070 A CN 202310101070A CN 115801604 A CN115801604 A CN 115801604A
- Authority
- CN
- China
- Prior art keywords
- network flow
- time sequence
- characteristic value
- model
- predicting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/50—Reducing energy consumption in communication networks in wire-line communication networks, e.g. low power modes or reduced link rate
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a method for predicting a network flow characteristic value, which comprises the steps of obtaining a network flow in an Internet of things network; extracting a characteristic value in the network flow, and obtaining a time sequence based on the characteristic value; checking the time series; and predicting the time sequence through an ARIMA model or normal distribution based on the test result to obtain a network flow characteristic value prediction result.
Description
Technical Field
The invention relates to the technical field of equipment identification of the Internet of things, in particular to a method for predicting a network flow characteristic value.
Background
Along with the development of the ecosystem of the Internet of things, the method has a trend of characterizing and identifying fingerprints of the equipment of the Internet of things. In the fingerprint identification model, a network flow feature vector is needed for judgment, but the feature vector can only be extracted through complete network flow, and the feature vector of the network flow cannot be obtained in real time, so that correct identification cannot be performed on the internet of things equipment.
In order to ensure accurate identification of the internet of things device, how to accurately obtain a feature vector of a network flow, namely a network flow feature value, is an urgent problem to be solved.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a method for predicting a network flow characteristic value, which can accurately obtain a characteristic vector of a network flow.
In order to achieve the technical purpose, the invention provides the following technical scheme: a method for predicting network flow characteristic values comprises the following steps:
acquiring a network flow in an Internet of things network; extracting a characteristic value of the network flow, and obtaining a time sequence based on the characteristic value;
checking the time sequence, and checking and judging whether the time sequence is sufficiently random or not; and predicting the time sequence through an ARIMA model or normal distribution based on the test result to obtain a network flow characteristic value prediction result.
Optionally, the network flow in the internet of things network includes data packets less than the actual number of the network flow data packets, that is, the network flow is an incomplete network flow.
Optionally, after the randomness of the time sequence is tested, if the time sequence is not sufficiently random, predicting the time sequence by using an ARIMA model, and if the time sequence is sufficiently random, predicting the time sequence by using normal distribution.
Optionally, the specific process of checking the time series includes:
constructing and initializing a hysteresis value, and iteratively updating the hysteresis value until the hysteresis value reaches a preset condition;
performing significance analysis on the time series based on the lag value after updating is stopped to obtain statistic;
and judging a threshold value of the statistic, wherein when the statistic is smaller than a preset threshold value, the time sequence is sufficiently random, otherwise, the time sequence is not sufficiently random.
Optionally, the preset condition is:
wherein, the first and the second end of the pipe are connected with each other,in order to be a value of the hysteresis,in order to round down the function,is the number of network flow packets.
Optionally, the process of predicting the time series by the ARIMA model includes:
establishing an ARIMA model, inputting the characteristic time sequence into the ARIMA model for parameter fitting to obtain model parameters, substituting the model parameters into the updated ARIMA model, and inputting the characteristic value sequence number t into the updated ARIMA model to obtain a predicted characteristic valueNamely, the network flow characteristic value prediction result:
wherein the content of the first and second substances,for the order of the autoregressive model,Din order to be a degree of difference,for the order of the moving average model,in order to fix the hysteresis operator, the operator,for the i-th lag operator, the lag operator,are the parameters of the autoregressive model,are the parameters of the moving average model and,is the term for the error as a function of,is a constant term.
Optionally, the process of predicting the time series by normal distribution includes: calculating a mean of eigenvalues in a time series(ii) a Calculating variance of eigenvalues in a time series(ii) a Obtaining a predicted eigenvalue based on the mean and varianceNamely, the network flow characteristic value prediction result:
wherein the content of the first and second substances,returning a random sample for normal distribution formed by given mean and variance.
The invention has the following technical effects:
the invention judges whether the network flow is sufficiently random or not by analyzing the time sequence of the network flow in transmission, and if the network flow is insufficiently random, the network flow is predicted by using an ARIMA model; if the network flow is sufficiently random, the normal distribution is used for predicting the network flow, so that the characteristic vector of the network flow can be accurately and effectively predicted, and other work can be better carried out.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.
Fig. 1 is a schematic flow chart of a method according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, the invention provides a method for predicting a network flow characteristic value, which includes capturing a network flow transmitted in an internet of things network, extracting a network flow characteristic value and a time sequence thereof, selecting a prediction method according to the randomness of the characteristic value time sequence, selecting an ARIMA model if the prediction method is not sufficiently random, and selecting normal distribution if the prediction method is sufficiently random, thereby obtaining a predicted network flow characteristic value to perform the next work.
The overall process comprises the following steps: s1, capturing a network stream to obtain an incomplete network stream; s2, extracting characteristic values and time sequences thereof; s3, judging whether the time sequence is sufficiently random or not; s4, forecasting by using an ARIMA model when the prediction is not sufficiently random; and S5, sufficiently randomly predicting by using normal distribution.
The method comprises the following specific steps:
s1 for a certain transmission in the Internet of things networkCapturing the network flow, and recording the captured network flow asIs a firstA packet of data, wherein,for the actual number of packets of a stream, co-captureThe number of the packets is one,obtaining a partial network flow as the difference between the actual number of packets and the number of packets captured
S2, extracting characteristic values and time series thereof
S21 extracting captured network streamsThe feature vector, i.e. the feature value, of each packet in the packet and stored inIn (1), the expression is as follows:
wherein the content of the first and second substances,the number of the characteristic values is determined by the selected characteristic values;co-extracting for different extracted eigenvalue functionsThen
S22, calculating the time sequence of the corresponding characteristic values according to the extracted characteristic values, and recording the time sequence as
Description of the drawings: taking the average time of arrival of a packet as an example,and so on, the time series of each characteristic value can be obtained.
S3, detecting whether the characteristic value time sequence is sufficiently random or not, and using Ljung-Box to test the randomness of the time sequence;
Wherein whenTime, explain the time seriesNot sufficiently random, prediction was performed using an ARIMA model; when in useTime, explain the time seriesIs sufficiently random to predict using a normal distribution
wherein the content of the first and second substances,to find parametersThe minimum value of (a) to (b),is a rounded down function;
s321 calculates an assumed valueAnd the calculation formula is used for judging whether the time sequence is sufficiently random or not, and is as follows:
wherein the content of the first and second substances,is the size of the time series of stream characteristic values,is a lagkThe auto-correlation of (a) with (b),is the hysteresis value in the test.
description of the drawings: the empirical value can be set by itself, and the invention is set to 0.05
when in useTime, explain the time seriesNot sufficiently random, and predicting future feature values using an ARIMA modelTime, explain the time seriesIs sufficiently random to predict future eigenvalues using a normal distribution.
s41, establishing an ARIMA model and predicting characteristic valuesThe formula of (1) is as follows:
wherein the content of the first and second substances,the order of the autoregressive model is,Din order to be a degree of difference,for the order of the moving average model,for the i-th lag operator, the lag operator,in order to fix the hysteresis operator, the operator,is the ith parameter of the autoregressive model,for the ith parameter of the moving average model,is the term for the error as a function of,is a constant term that is used to determine,whereinNThe number of parameters for the autoregressive/moving average model.
Description of the drawings: an autoregressive model: a method of processing time series using the same variable, e.g.The previous stages of, i.e.ToTo predict the current periodAnd assuming that they are in a linear relationship;
moving average model: the current value of the time series is a model formed by a linear function of a random error term and a lag error term
S42 time-series of characteristic valuesInputting the data into an ARIMA model, and performing parameter fitting to obtain the order of the autoregressive modelDifference degree ofOrder of moving average model
S43 serial number of characteristic value to be predictedInputting the characteristic values into an ARIMA model to obtain predicted characteristic valuesThe formula is as follows
S5 ifIs not equal toDescription of time seriesIs sufficiently random to predict future eigenvalues using normal distributions
s53 randomly extracting random samples from the Gaussian distribution to generate a Gaussian distributionAs a future eigenvalue
Wherein the content of the first and second substances,returning a random sample for normal distribution formed by given mean and variance.
The invention extracts the characteristic value and the characteristic value time sequence of the network flow being transmitted, selects a prediction method according to the random degree of the characteristic value time sequence, selects an ARIMA model if the random degree is insufficient, and selects normal distribution if the random degree is sufficient, thereby obtaining the predicted characteristic value, carrying out the development of the next work, and providing help for the works such as network flow length prediction, network flow deformation and the like.
In the current network flow analysis method, the characteristics of the non-payload of the network flow, namely, the characteristics are mainly focused on the derivation of data except the data needing to be transmitted. Such derived characteristic values may be referred to as original characteristics, such as the mean, maximum, minimum and standard deviation of the PDU inter-arrival time derived from the inter-arrival time sequence between the packets.
The invention judges whether the characteristic value time sequence is sufficiently random or not by extracting the characteristic value and the characteristic value time sequence of the network flow being transmitted, if the characteristic value time sequence is insufficiently random, the ARIMA model is used for predicting the characteristic value time sequence; if the network flow length is sufficiently random, the normal distribution is used for predicting the network flow length, so that the next work can be better carried out, and the network flow length prediction, the network flow deformation and other works are helped.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.
Claims (7)
1. A method for predicting network flow characteristic values is characterized by comprising the following steps:
acquiring a network flow in an Internet of things network; extracting a characteristic value of the network flow, and obtaining a time sequence based on the characteristic value;
checking the time sequence, and checking and judging whether the time sequence is sufficiently random or not; and predicting the time sequence through an ARIMA model or normal distribution based on the test result to obtain a network flow characteristic value prediction result.
2. The prediction method according to claim 1, characterized in that:
the network flow in the internet of things network comprises data packets less than the actual data packet quantity of the network flow, namely the network flow is an incomplete network flow.
3. The prediction method according to claim 1, characterized in that:
and after the randomness of the time sequence is tested, predicting the time sequence through an ARIMA model if the time sequence is not sufficiently random, and predicting the time sequence through normal distribution if the time sequence is sufficiently random.
4. The prediction method according to claim 1, characterized in that:
the specific process for checking the time series comprises the following steps:
constructing and initializing a hysteresis value, and iteratively updating the hysteresis value until the hysteresis value reaches a preset condition;
performing significance analysis on the time series based on the lag value after updating is stopped to obtain statistic;
and judging a threshold value of the statistic, wherein when the statistic is smaller than a preset threshold value, the time sequence is sufficiently random, otherwise, the time sequence is not sufficiently random.
6. The prediction method according to claim 3, characterized in that:
the process of predicting the time series by the ARIMA model comprises the following steps:
establishing an ARIMA model, inputting the characteristic time sequence into the ARIMA model for parameter fitting to obtain model parameters, substituting the model parameters into the updated ARIMA model, and inputting the characteristic value sequence number t into the updated ARIMA model to obtain a predicted characteristic valueNamely, the network flow characteristic value prediction result:
wherein the content of the first and second substances,for the order of the autoregressive model,Din order to be a degree of difference,for the order of the moving average model,in order to fix the hysteresis operator, the operator,for the i-th lag operator, the lag operator,are the parameters of the autoregressive model,are the parameters of the moving average model and,is the term for the error as a function of,is a constant term.
7. The prediction method according to claim 3, characterized in that:
the process of predicting the time series by the normal distribution includes:
Obtaining a predicted eigenvalue based on the mean and varianceNamely, the network flow characteristic value prediction result:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310101070.5A CN115801604B (en) | 2023-02-13 | 2023-02-13 | Prediction method of network flow characteristic value |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310101070.5A CN115801604B (en) | 2023-02-13 | 2023-02-13 | Prediction method of network flow characteristic value |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115801604A true CN115801604A (en) | 2023-03-14 |
CN115801604B CN115801604B (en) | 2023-05-02 |
Family
ID=85430836
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310101070.5A Active CN115801604B (en) | 2023-02-13 | 2023-02-13 | Prediction method of network flow characteristic value |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115801604B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002039254A1 (en) * | 2000-11-09 | 2002-05-16 | Spss Inc. | System and method for building a time series model |
CN106990763A (en) * | 2017-04-20 | 2017-07-28 | 浙江大学 | A kind of Vertical Mill operation regulator control system and method based on data mining |
CN109951358A (en) * | 2019-03-21 | 2019-06-28 | 北京交通大学 | Data network method for predicting |
CN111882135A (en) * | 2020-08-05 | 2020-11-03 | 杭州安恒信息技术股份有限公司 | Internet of things equipment intrusion detection method and related device |
CN112929214A (en) * | 2021-02-02 | 2021-06-08 | 北京明朝万达科技股份有限公司 | Model construction method, device, equipment and storage medium |
CN115695046A (en) * | 2022-12-28 | 2023-02-03 | 广东工业大学 | Network intrusion detection method based on reinforcement ensemble learning |
-
2023
- 2023-02-13 CN CN202310101070.5A patent/CN115801604B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002039254A1 (en) * | 2000-11-09 | 2002-05-16 | Spss Inc. | System and method for building a time series model |
CN106990763A (en) * | 2017-04-20 | 2017-07-28 | 浙江大学 | A kind of Vertical Mill operation regulator control system and method based on data mining |
CN109951358A (en) * | 2019-03-21 | 2019-06-28 | 北京交通大学 | Data network method for predicting |
CN111882135A (en) * | 2020-08-05 | 2020-11-03 | 杭州安恒信息技术股份有限公司 | Internet of things equipment intrusion detection method and related device |
CN112929214A (en) * | 2021-02-02 | 2021-06-08 | 北京明朝万达科技股份有限公司 | Model construction method, device, equipment and storage medium |
CN115695046A (en) * | 2022-12-28 | 2023-02-03 | 广东工业大学 | Network intrusion detection method based on reinforcement ensemble learning |
Non-Patent Citations (1)
Title |
---|
田中大;李树江;王艳红;高宪文;: "经验模式分解与时间序列分析在网络流量预测中的应用" * |
Also Published As
Publication number | Publication date |
---|---|
CN115801604B (en) | 2023-05-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111796957B (en) | Transaction abnormal root cause analysis method and system based on application log | |
CN110222148B (en) | Confidence evaluation method and device suitable for grammar analysis | |
CN114218998A (en) | Power system abnormal behavior analysis method based on hidden Markov model | |
CN117041017A (en) | Intelligent operation and maintenance management method and system for data center | |
CN117056834A (en) | Big data analysis method based on decision tree | |
CN112926621A (en) | Data labeling method and device, electronic equipment and storage medium | |
CN116361191A (en) | Software compatibility processing method based on artificial intelligence | |
CN117784710B (en) | Remote state monitoring system and method for numerical control machine tool | |
CN117562311A (en) | Detection system of high-performance electronic cigarette atomizer | |
CN110111311B (en) | Image quality evaluation method and device | |
CN117849193A (en) | Online crack damage monitoring method for neodymium iron boron sintering | |
CN115801604A (en) | Method for predicting network flow characteristic value | |
JP7484065B1 (en) | Control device and method for intelligent manufacturing equipment | |
CN111814776A (en) | Image processing method, device, server and storage medium | |
CN117171619A (en) | Intelligent power grid terminal network anomaly detection model and method | |
CN117056902A (en) | Password management method and system for Internet of things | |
WO2024103470A1 (en) | Performance testing system and method for escalator production | |
CN113239075A (en) | Construction data self-checking method and system | |
CN114553473A (en) | Abnormal login behavior detection system and method based on login IP and login time | |
CN113393325A (en) | Transaction detection method, intelligent device and computer storage medium | |
CN111798237A (en) | Abnormal transaction diagnosis method and system based on application log | |
CN115329968B (en) | Method, system and electronic equipment for determining fairness of quantum machine learning algorithm | |
CN112598118B (en) | Method, device, storage medium and equipment for processing abnormal labeling in supervised learning | |
CN113641848B (en) | Online assembly connection guiding method and system for electronic product | |
CN114374561B (en) | Network security state evaluation method, device and storable medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |