CN105183780B - Based on the protocol classification method for improving AGNES algorithms - Google Patents
Based on the protocol classification method for improving AGNES algorithms Download PDFInfo
- Publication number
- CN105183780B CN105183780B CN201510492631.4A CN201510492631A CN105183780B CN 105183780 B CN105183780 B CN 105183780B CN 201510492631 A CN201510492631 A CN 201510492631A CN 105183780 B CN105183780 B CN 105183780B
- Authority
- CN
- China
- Prior art keywords
- cluster
- class cluster
- similarity
- class
- similar
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L69/00—Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
- H04L69/06—Notations for structuring of protocol data, e.g. abstract syntax notation one [ASN.1]
Abstract
The invention discloses a kind of based on the protocol classification method for improving AGNES algorithms, includes the following steps:Input data set, arrange parameter;Initial similarity reference threshold is set;Similarity-rough set is simultaneously clustered, and the cluster result for the condition that meets is put into result set;It reduces and updates similarity reference threshold, be compared cluster again, until similarity reference threshold merges similarity no more than minimum.The present invention provides a kind of based on the protocol classification method for improving AGNES algorithms, the number of cluster can be automatically determined, there are one similarity evaluation index, algorithms can investigate current cluster result in cluster process, the class cluster for the condition that met is extracted in time for each class cluster of gained.
Description
Technical field
The present invention relates to a kind of based on the protocol classification method for improving AGNES algorithms.
Background technology
The network information security and confrontation have become the major issue extremely paid close attention to the information age.In fields such as electronic countermeasures,
It is often used the bit stream that any special measures obtain intercommunication, the communication protocol that general communicating pair uses is customized, non-public
's.In addition, when using protocol analysis tool during network communication, the protocol bits stream that can not be parsed often is encountered;Solution
It is relatively difficult to analyse these totally unknown agreements, but for as fields such as network supervision, information protection, information stealths, knowing
Other unknown protocol is a vital job again;Therefore further identification communication is made from the bit stream sequence obtained
Unknown protocol is an important topic.
A kind of basic ideas of unknown protocol identification at present are for a certain unknown protocol, using data mining and pattern
Matched method finds the feature of the unknown protocol with data digging method, is then carried out with method for mode matching matching characteristic
Identification;Such method on condition that obtain single protocol data frame for study use, single protocol data frame to multi-protocol data frames into
Row cluster obtains, and needs to use hierarchical clustering algorithm, that is, AGNES algorithms.
Traditional AGNES algorithm ideas are:First using each object as a cluster, then according to one step of the criterion of setting
Cluster is merged into increasing cluster by one step, it is known that is met the cluster number intentionally got or other setting conditions, is usually merged
Similarity of the criterion between object between class cluster.
Traditional AGNES algorithms are described as follows:Input:Data set containing c object;The cluster number k intentionally got;
Output:K class cluster;Step:(1) using each object as a class cluster, total c is a;(2)Repeat;(3) according to distance criterion
Definition, finds two most like clusters;(4) merge two most like clusters, obtain the set of new cluster;(5) know and reach finger
The number k of fixed class cluster.
AGNES algorithms are simple, accuracy rate is high, but the algorithm does not have good scalability;Algorithm is in the selection for merging point
It is very crucial, if there is no preferable selection combining point in a certain step, it will have a direct impact on subsequent Clustering Effect.
Invention content
It is an object of the invention to overcome the deficiencies of the prior art and provide a kind of based on the agreement point for improving AGNES algorithms
Class method can automatically determine the number of cluster, and there are one similarity evaluation indexs for each class cluster of gained, and algorithm is poly-
Current cluster result can be investigated in class process, and the class cluster met is extracted in time.
The purpose of the present invention is achieved through the following technical solutions:Based on the protocol classification side for improving AGNES algorithms
Method, it includes the following steps:
S1. the data set DataSet for inputting n data object sets the minimum similarity lowestSimi, minimum of merging
Class cluster object number lowestSize and similarity reduce the value of step-length temp, wherein minimum merge similarity lowestSimi
Less than 1;
S2. by each object as an initial class cluster, and similarity reference threshold is set, similarity reference threshold
Similar=1;
S3. by i-th of class cluster in data set DataSet in addition to itself all class clusters be compared and cluster, wherein 1
≤ i≤n, and i is integer;
S4. the value of i is converted, cycle executes comparison and the cluster of step S3, until i takes integer all in 1~n
Value;
S5. judge whether the class cluster object number clustered in S3~S4 is more than minimum class cluster object number
lowestSize:
(1) the obtained class cluster object number of cluster is more than minimum class cluster object number lowestSize, by cluster result plus
Enter cluster result set clusterResultSet, the object number set that class cluster contains is added in the object number that class cluster contains
indexResultSet;Similarity evaluation set similarSet is added in the value of current similar, and jumps to step
S6;
(2) the class cluster object number that cluster obtains is not more than minimum class cluster object number lowestSize;It gos to step
S6;
S6. the value of similar is reduced, updated similar values take the similar values before update to subtract similarity reduction
Step-length temp, and judge whether the value of similar after update is more than and minimum merge similarity lowestSimi:
(1) value of similar merges similarity lowestSimi more than minimum after updating, and go to step S3;
(2) value of similar merges similarity lowestSimi no more than minimum after updating, and go to step S7;
S7. cluster terminates, and checks the remaining data frame that cannot merge, and is added into residue and does not form the data preferably clustered
Object set leftDataSet.
The step S3 includes following sub-step:
S31. by class cluster i respectively compared with all class clusters of the current data set DataSet in addition to itself, current number is found out
According to collection DataSet in the highest class cluster j of class cluster i similarities;Wherein, class cluster i is i-th of class in initial data set DataSet
Cluster;
S32. judge whether the similarity of class cluster i and class cluster j is more than the value of current similar;
(1) when the similarity of class cluster i and class cluster j is more than the value of current similar:Class cluster i is merged with class cluster j;
(2) when the similarity of class cluster i and class cluster j is not more than the value of current similar, go to step S4;
S33. j-th of class cluster is deleted from data set DataSet, updates the data collection DataSet, and the S31 that gos to step.
The step S31 includes following sub-step:
S311. it calculates separately class cluster i and asks similar respectively to all class clusters in current data set DataSet in addition to itself
Degree;
S312. current data is found out to concentrate and the highest class cluster j of class cluster i similarities.
Similarity between two class clusters is solved according to following formula:
p∈ci,p′∈cj
In formula, davg(ci, cj) indicate two class cluster ci, cjBetween similarity, niIndicate class cluster ciIn include data frame
Item number, njIndicate class cluster cjIn include data frame number, similarities of the d (p, p') between data frame p and data frame p';
There are two types of the method for solving of the d (p, p'):
Method one:Similarity between direct solution data frame solves d (p, p'):
d(p,p')=sam (p, p')/sum (p, p'),
Wherein sam (p, p') is by the following result for operating and obtaining:It is with left alignment, with nibble by data frame p, p'
Unit, from left to right to data frame p, the alignment characters of p' are compared, and the number for encountering the identical situation of alignment characters is
sam(p,p');And sum (p, p') is to calculate the number compared when sam (p, p');
Method two:A character string will be treated as per data frame, the similarity similar (p, p') between character string is
Required d (p, p'):
In formula, length (p), length (p') are respectively the length of character string p and character string p', Distance (p, p')
For the editing distance of two character strings, indicate character string p becoming what character string p' needed by insertion, replacement, delete operation
Number of operations.
The beneficial effects of the invention are as follows:(1) present invention can automatically determine the number of cluster by changing end condition, eventually
Only condition is:Under the similarity reference threshold of some setting, the class cluster that can not merge;The AGNES algorithms of the present invention are first
With high similarity cluster, then gradually reduce similarity until setting minimum similarity value.
(2) it is used as present invention uses different grades of similarity and refers to threshold value, to obtain the class cluster of different Clustering Effects,
There are one similarity evaluation indexs for each class cluster of gained, can intuitively find out which class cluster from cluster result in this way
It is preferably to cluster.
(3) invention specifies satisfied class cluster object number threshold value, cluster result is investigated in cluster process, when there is ratio
When relatively satisfactory cluster class cluster occurs (for example the object number of some class cluster is more than given threshold), such cluster is extracted and is added
Enter in result set;The preferable class cluster formed can be found in time, to prevent being destroyed by subsequent undesirable merging;Another party
Face can reduce influence of the bad merging to later step, improve the scalability of algorithm.
Description of the drawings
Fig. 1 is the flow chart of the present invention.
Specific implementation mode
Technical scheme of the present invention is described in further detail below in conjunction with the accompanying drawings, but protection scope of the present invention is not limited to
It is as described below.
As shown in Figure 1, based on the protocol classification method for improving AGNES algorithms, it includes the following steps:
S1. the data set DataSet for inputting n data object sets the minimum similarity lowestSimi, minimum of merging
Class cluster object number lowestSize and similarity reduce the value of step-length temp, wherein minimum merge similarity lowestSimi
Less than 1;
S2. by each object as an initial class cluster, and similarity reference threshold is set, similarity reference threshold
Similar=1;
S3. by i-th of class cluster in data set DataSet in addition to itself all class clusters be compared and cluster, wherein 1
≤ i≤n, and i is integer;
S4. the value of i is converted, cycle executes comparison and the cluster of step S3, until i takes integer all in 1~n
Value;
S5. judge whether the class cluster object number clustered in S3~S4 is more than minimum class cluster object number
lowestSize:
(1) the obtained class cluster object number of cluster is more than minimum class cluster object number lowestSize, by cluster result plus
Enter cluster result set clusterResultSet, the object number set that class cluster contains is added in the object number that class cluster contains
indexResultSet;Similarity evaluation set similarSet is added in the value of current similar, and jumps to step
S6;
(2) the class cluster object number that cluster obtains is not more than minimum class cluster object number lowestSize;It gos to step
S6;
S6. the value of similar is reduced, updated similar values take the similar values before update to subtract similarity reduction
Step-length temp, and judge whether the value of similar after update is more than and minimum merge similarity lowestSimi:
(1) value of similar merges similarity lowestSimi more than minimum after updating, and go to step S3;
(2) value of similar merges similarity lowestSimi no more than minimum after updating, and go to step S7;
S7. cluster terminates, and checks the remaining data frame that cannot merge, and is added into residue and does not form the data preferably clustered
Object set leftDataSet.
The step S3 includes following sub-step:
S31. by class cluster i respectively compared with all class clusters of the current data set DataSet in addition to itself, current number is found out
According to collection DataSet in the highest class cluster j of class cluster i similarities;Wherein, class cluster i is i-th of class in initial data set DataSet
Cluster;
S32. judge whether the similarity of class cluster i and class cluster j is more than the value of current similar;
(1) when the similarity of class cluster i and class cluster j is more than the value of current similar:Class cluster i is merged with class cluster j;
(2) when the similarity of class cluster i and class cluster j is not more than the value of current similar, go to step S4;
S33. j-th of class cluster is deleted from data set DataSet, updates the data collection DataSet, and the S31 that gos to step.
The step S31 includes following sub-step:
S311. it calculates separately class cluster i and asks similar respectively to all class clusters in current data set DataSet in addition to itself
Degree;
S312. current data is found out to concentrate and the highest class cluster j of class cluster i similarities.
Similarity between two class clusters is solved according to following formula:
p∈ci,p′∈cj
In formula, davg(ci, cj) indicate two class cluster ci, cjBetween similarity, niIndicate class cluster ciIn include data frame
Item number, njIndicate class cluster cjIn include data frame number, similarities of the d (p, p') between data frame p and data frame p';
There are two types of the method for solving of the d (p, p'):
Method one:Similarity between direct solution data frame solves d (p, p'):
d(p,p')=sam (p, p')/sum (p, p'),
Wherein sam (p, p') is by the following result for operating and obtaining:It is with left alignment, with nibble by data frame p, p'
Unit, from left to right to data frame p, the alignment characters of p' are compared, and the number for encountering the identical situation of alignment characters is
sam(p,p');And sum (p, p') is to calculate the number compared when sam (p, p');
Method two:A character string will be treated as per data frame, the similarity similar (p, p') between character string is
Required d (p, p'):
In formula, length (p), length (p') are respectively the length of character string p and character string p', Distance (p, p')
For the editing distance of two character strings, indicate character string p becoming what character string p' needed by insertion, replacement, delete operation
Number of operations.
What the tcpdump experimental data sets that the present invention is announced using Lincoln laboratory were tested, extract wherein 9
The binary data frame of kind agreement is tested as unknown protocol;It is respectively:Dns, http, ntp, rip, smtp, ssh,
Arp, llc, loop agreement.
It is first hexadecimal format by binary data stream, for arp, llc, loop link layer protocols take data frame
Preceding x (taking x=68) byte, remaining agreement removes ip stems and the heads tcp or udp (for the spy of agreement itself is more preferably presented
Sign) after take its preceding x (taking x=68) byte, the whole of inadequate x bytes to take;The value of x is selected by rule of thumb, and the value of x includes preferably
Whole characteristic informations of data frame, but should not be too big, it may include otherwise a large amount of data information, influence the accurate of result
Degree, and increase operand.
Randomly select each 300 of above 9 kinds of agreements, totally 2700 data frames form input data, by input sequence from 0 to
2699 number for it.
Embodiment one, arrange parameter is minimum to merge similarity lowestSimi=0.1, minimum class cluster object number
LowestSize=200, similarity reduce step-length temp=0.1, using the present invention is based on the protocol classifications for improving AGNES algorithms
Method, and using the method one in step S312, obtained experimental result such as following table 1-1:
Table 1-1 cluster experimental results (method one)
Data frame bar number of the similarity less than 0.1 has 162 between remaining class cluster;In remaining 162 data frame, with phase
It is more than the 0.1 class cluster merged like degree, does not meet 200 object requirements, but data frame bar number is as follows more than or equal to 5 class clusters
Table 1-2:
Table 1-2 experiment gained group clusters (method one)
Embodiment two, identical as one parameter setting of embodiment, arrange parameter is minimum to merge similarity lowestSimi=
0.1, minimum class cluster object number lowestSize=200, similarity reduce step-length temp=0.1, and using in step S312
Method two, obtained experimental result such as following table 2-1:
Table 2-1 cluster experimental results (method two)
Data frame of the similarity less than 0.1 has 58 between remaining class cluster.In remaining 58 data frame, with similarity
The class cluster merged more than 0.1 does not meet 200 object requirements, but data frame bar number is more than or equal to 5 class cluster such as following table 2-
2:
The group cluster (method two) of table 2-2 experiment gained
Comparative example one and embodiment two:Embodiment one the result shows that:Input is 9 classifications, has successfully clustered out 8
A, similarity is less than 0.1 between remaining 162 data, or does not reach the requirement of 200 objects;Overall cluster is correct
Rate is (2700-314-162)/(2700-162)=85.88%, and cluster result is preferable;Embodiment two the result shows that:Overall cluster
Accuracy be (2700-502-58)/(2700-58)=81.00%, slightly below test one as a result, but application method two phase
It calculates and more data frames can be clustered like degree, but simultaneously between processing similarity relatively low (such as 0.2 to 0.3)
It is easier to produce the cluster of mistake when data frame.
Embodiment three carries out cluster calculation to above 9 kinds of agreements using common clustering algorithm in weka tools, first uses
StringToWordVector makees data prediction, and it is 9 then to specify the cluster number parameter of each clustering algorithm, is used
Classes toclusters evaluation are assessed, each clustering algorithm is clustered 3 times using different random seeds,
As a result its average value is taken, obtained result is respectively as shown in following table 3-1,3-2 and 3-3:
The result of SimplKMeans clustering algorithms in table 3-1 weka
The result of sIB clustering algorithms in table 3-2 weka
The result of EM clustering algorithms in table 3-3 weka
Comparative example one and embodiment three, it can be seen that using the present invention is based on the protocol classifications for improving AGNES algorithms
Method, and using the method one in step S312, overall accuracy is higher than SimpleKMeans algorithms 20.43%, is calculated higher than sIB
Method 11.48% is higher than EM algorithms 13.08%.
Claims (5)
1. based on the protocol classification method for improving AGNES algorithms, it is characterised in that:It includes the following steps:
S1. the data set DataSet of n data object is inputted, setting is minimum to merge similarity lowestSimi, minimum class cluster
Object number lowestSize and similarity reduce the value of step-length temp, wherein the minimum similarity lowestSimi that merges is less than
1;
S2. by each object as an initial class cluster, and similarity reference threshold is set, similarity reference threshold
Similar=1;
S3. by i-th of class cluster in data set DataSet in addition to itself all class clusters be compared and cluster, wherein 1≤i≤
N, and i is integer;
S4. the value of i is converted, cycle executes comparison and the cluster of step S3, until i takes integer value all in 1~n;
S5. judge whether the class cluster object number clustered in S3~S4 is more than minimum class cluster object number lowestSize:
(1) the class cluster object number that cluster obtains is more than minimum class cluster object number lowestSize, cluster result is added poly-
The object number set that class cluster contains is added in the object number that class cluster contains by class results set clusterResultSet
indexResultSet;Similarity evaluation set similarSet is added in the value of current similar, and jumps to step
S6;
(2) the class cluster object number that cluster obtains is not more than minimum class cluster object number lowestSize;Go to step S6;
S6. the value of similar is reduced, updated similar values, which take the similar values before update to subtract similarity, reduces step-length
Temp, and judge whether the value of similar after update is more than and minimum merge similarity lowestSimi:
(1) value of similar merges similarity lowestSimi more than minimum after updating, and go to step S3;
(2) value of similar merges similarity lowestSimi no more than minimum after updating, and go to step S7;
S7. cluster terminates, and checks the remaining data frame that cannot merge, and is added into residue and does not form the data object preferably clustered
Set leftDataSet.
2. according to claim 1 based on the protocol classification method for improving AGNES algorithms, it is characterised in that:The step
Rapid S3 includes following sub-step:
S31. by class cluster i respectively with currently'sAll class clusters of the data set DataSet in addition to itself compare, and find out current'sNumber
According to collection DataSet in the highest class cluster j of class cluster i similarities;Wherein, class cluster i is initial'sI-th in data set DataSet
Class cluster;
S32. judge whether the similarity of class cluster i and class cluster j is more than the value of current similar;
(1) when the similarity of class cluster i and class cluster j is more than the value of current similar:Class cluster i is merged with class cluster j;
(2) when the similarity of class cluster i and class cluster j is not more than the value of current similar, go to step S4;
S33. j-th of class cluster is deleted from data set DataSet, updates the data collection DataSet, and the S31 that gos to step.
3. according to claim 2 based on the protocol classification method for improving AGNES algorithms, it is characterised in that:The step
Rapid S31 includes following sub-step:
S311. calculate separately class cluster i with it is current'sAll class clusters in data set DataSet in addition to itself ask similar respectively
Degree;
S312. current data is found out to concentrate and the highest class cluster j of class cluster i similarities.
4. according to claim 3 based on the protocol classification method for improving AGNES algorithms, it is characterised in that:Two class clusters
Between similarity solved according to following formula:
p∈ci, p' ∈ cj,
In formula, davg(ci,cj) indicate two class cluster ci, cjBetween similarity, niIndicate class cluster ciIn include data frame bar
Number, njIndicate class cluster cjIn include data frame number, similarities of the d (p, p ') between data frame p and data frame p '.
5. according to claim 4 based on the protocol classification method for improving AGNES algorithms, it is characterised in that:The d
There are two types of the method for solving of (p, p '):
Method one:Similarity between direct solution data frame, i.e. solution d (p, p '):
d(p,p')=sam (p, p')/sum (p, p'),
Wherein sam (p, p ') it is by the following result for operating and obtaining:By data frame p, p ' is single with nibble with left alignment
Position, from left to right to data frame p, the alignment characters of p' are compared, the number for encountering character same case be sam (p,
P '), and sum (p, p ') is to calculate the number compared when sam (p, p ');
Method two:A character string will be treated as per data frame, and similarity similar between character string (p, p ') it is required
D (p, p '):
In formula, length (p), length (p ') are respectively the length of character string p and character string p ', and Distance (p, p ') it is two
The editing distance of character string indicates character string p becoming the operation that character string p ' needs by insertion, replacement, delete operation
Number.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510492631.4A CN105183780B (en) | 2015-08-12 | 2015-08-12 | Based on the protocol classification method for improving AGNES algorithms |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510492631.4A CN105183780B (en) | 2015-08-12 | 2015-08-12 | Based on the protocol classification method for improving AGNES algorithms |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105183780A CN105183780A (en) | 2015-12-23 |
CN105183780B true CN105183780B (en) | 2018-09-18 |
Family
ID=54905864
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510492631.4A Active CN105183780B (en) | 2015-08-12 | 2015-08-12 | Based on the protocol classification method for improving AGNES algorithms |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105183780B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106021361A (en) * | 2016-05-10 | 2016-10-12 | 中国空间技术研究院 | Sequence alignment-based self-adaptive application layer network protocol message clustering method |
CN107292701A (en) * | 2017-05-25 | 2017-10-24 | 北京小度信息科技有限公司 | Order group technology and device |
CN108833156B (en) * | 2018-06-08 | 2022-08-30 | 中国电力科学研究院有限公司 | Evaluation method and system for simulation performance index of power communication network |
CN112148942B (en) * | 2019-06-27 | 2024-04-09 | 北京达佳互联信息技术有限公司 | Business index data classification method and device based on data clustering |
CN116166960B (en) * | 2023-02-07 | 2023-09-29 | 山东经鼎智能科技有限公司 | Big data characteristic cleaning method and system for neural network training |
CN115842875B (en) * | 2023-02-21 | 2023-06-02 | 德力西集团仪器仪表有限公司 | Method, device, computer equipment and medium for determining similar data frames |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008154029A1 (en) * | 2007-06-11 | 2008-12-18 | The Trustees Of Columbia University In The City Of New York | Data classification and hierarchical clustering |
KR101130734B1 (en) * | 2010-08-12 | 2012-03-28 | 연세대학교 산학협력단 | Method for generating context hierachyand, system for generating context hierachyand |
CN102436480B (en) * | 2011-10-15 | 2013-11-06 | 西安交通大学 | Incidence relation excavation method for text-oriented knowledge unit |
-
2015
- 2015-08-12 CN CN201510492631.4A patent/CN105183780B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN105183780A (en) | 2015-12-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105183780B (en) | Based on the protocol classification method for improving AGNES algorithms | |
CN111865815B (en) | Flow classification method and system based on federal learning | |
Yu et al. | PBCNN: packet bytes-based convolutional neural network for network intrusion detection | |
CN107431663A (en) | Net flow assorted | |
CN106790170B (en) | Data packet filtering method and device | |
CN110012029A (en) | A kind of method and system for distinguishing encryption and non-encrypted compression flow | |
CN103200133A (en) | Flow identification method based on network flow gravitation cluster | |
US20140136538A1 (en) | Method and Apparatus for Communications Analysis | |
CN109299742A (en) | Method, apparatus, equipment and the storage medium of automatic discovery unknown network stream | |
CN104468273A (en) | Method and system for recognizing application type of flow data | |
CN108846275A (en) | Unknown Method of Detecting Operating System based on RIPPER algorithm | |
CN109286576A (en) | A kind of network agent encryption traffic characteristic extracting method of data packet frequency analysis | |
CN112887291A (en) | I2P traffic identification method and system based on deep learning | |
CN104796407B (en) | A kind of extracting method of unknown protocol feature | |
CN107689899A (en) | A kind of unknown protocol recognition methods and system based on bit stream | |
CN104079450B (en) | Feature mode set creation method and device | |
CN105429817A (en) | Illegal business identification device and illegal business identification method based on DPI and DFI | |
CN107124410A (en) | Network safety situation feature clustering method based on machine deep learning | |
CN112291226A (en) | Method and device for detecting abnormality of network traffic | |
CN107222343A (en) | Dedicated network stream sorting technique based on SVMs | |
CN104767739B (en) | The method that unknown multi-protocols blended data frame is separated into single protocol data frame | |
Wang et al. | Bitstream protocol classification mechanism based on feature extraction | |
CN116248530A (en) | Encryption flow identification method based on long-short-time neural network | |
CN112367325B (en) | Unknown protocol message clustering method and system based on closed frequent item mining | |
CN114118255B (en) | Unknown protocol cluster analysis method, device and medium based on spectral clustering |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Zhang Chunrui Inventor after: Liu Yuan Inventor after: Zhao Qiang Inventor after: Meng Fanzhi Inventor after: Yue Yang Inventor before: Liu Yuan Inventor before: Zhang Chunrui Inventor before: Zhao Qiang Inventor before: Meng Fanzhi Inventor before: Yue Yang |
|
CB03 | Change of inventor or designer information |