CN114448906A

CN114448906A - Network traffic identification method and system

Info

Publication number: CN114448906A
Application number: CN202210101924.5A
Authority: CN
Inventors: 陈迎春; 冉晓旻; 董芳; 刘广怡; 孙昱; 莫有权; 王晓梅; 张静; 余道杰
Original assignee: Information Engineering University of PLA Strategic Support Force
Current assignee: Information Engineering University of PLA Strategic Support Force
Priority date: 2022-01-27
Filing date: 2022-01-27
Publication date: 2022-05-06

Abstract

The application provides a network flow identification method and a system, which are used for converting first network flow data in hexadecimal into second network flow data in binary, and mapping the second network flow data into gray map data through a mapping rule; performing feature extraction on the gray scale image data by adopting a triple residual network L2-triplot Resnet constrained by L2 to obtain first feature data; performing linear dimensionality reduction on the first characteristic data by using a Principal Component Analysis (PCA) algorithm to obtain second characteristic data; carrying out nonlinear dimensionality reduction on the second characteristic data through a t-SNE algorithm to obtain visual characteristic data; the visual characteristic data is subjected to clustering identification through a K-means algorithm, and an identification result is output.

Description

Network traffic identification method and system

Technical Field

The present application relates to the field of network security technologies, and in particular, to a method and a system for identifying network traffic.

Background

In the field of network security, network traffic carries behavioral characteristics of network applications and is an important carrier for characterizing network application properties. With the rapid increase of network traffic, more and more encrypted network traffic and a large number of private protocols emerge, so that a network traffic identification method capable of improving the traffic identification accuracy and the automation degree has important practical significance for network security control, and is a problem to be solved by technical personnel in the field.

Disclosure of Invention

The application provides a network traffic identification method and a network traffic identification system, which can improve the accuracy and the automation degree of network traffic identification.

In order to achieve the above object, the present application provides the following technical solutions:

a network traffic identification method comprises the following steps:

the method comprises the steps of converting first network flow data in hexadecimal into second network flow data in binary, and mapping the second network flow data into gray map data through a mapping rule;

performing feature extraction on the gray scale image data by adopting a triple residual network L2-triplot Resnet network constrained by L2 to obtain first feature data;

performing linear dimensionality reduction on the first characteristic data by utilizing a Principal Component Analysis (PCA) algorithm to obtain second characteristic data;

carrying out nonlinear dimensionality reduction on the second characteristic data through a t-SNE algorithm to obtain visual characteristic data;

and performing clustering identification on the visualized characteristic data through a K-means algorithm, and outputting an identification result.

Preferably, the converting the hexadecimal first network traffic data into the binary second network traffic data includes:

judging whether the bit stream length of the first network traffic data exceeds the 1024-bit stream length, if so, deleting the network traffic data exceeding the 1024-bit stream length in the first network traffic data to obtain second network traffic data;

if not, zero padding is carried out in the first network traffic data to obtain second network traffic data, and the bit stream length of the second network traffic data is the 1024-bit stream length.

Preferably, the L2-triplot Resnet network includes: a depth residual error network Resnet-18;

the depth residual error network Resnet-18 comprises:

17 convolutional layers and 1 fully-connected layer;

the depth residual network Resnet-18 does not include a classification layer.

Preferably, the L2-triplot Resnet network includes: a depth residual network Resnet-18, and L2 constraint and scaling module;

the method for extracting the characteristics of the gray scale map data by adopting the triple residual network L2-triplot Resnet constrained by L2 to obtain first characteristic data comprises the following steps:

inputting three images in the gray map data into three depth residual error networks Resnet-18 in the L2-triplot Resnet network respectively to obtain three embedded features;

adding L2 constraints to the three embedded features through an L2 constraint and scaling module to obtain the first feature data corresponding to the three embedded features respectively, wherein the first feature data are specifically represented by the following formula:

wherein x is_iFor the embedding feature, r is the scaling parameter constrained by the L2, N is the natural number set, | f (x)_i)‖₂Is constrained to the first feature data.

Preferably, the method further comprises: :

calculating an image x in a gray map dataset by_i、x_jSimilarity of (2):

L_pis a Min-type distance, p is a norm and p is more than or equal to 1, when p is 2, the image x_i、x_jIs Euclidean distance, L_pThe smaller, the image x_iAnd image x_jThe more similar the intensity map data set is χ, x_i,x_jE x, image x_i、x_jIs two different images in the gray scale map dataset, d is d-dimensional euclidean space,

f(x_i)＝(f(x_i)⁽¹⁾,f(x_i)⁽²⁾,…,f(x_i)^(d))^T，f(x_j)＝(f(x_j)⁽¹⁾,f(x_j)⁽²⁾,…,f(x_j)^(d))^T；

the distance between the positive and negative image pairs is calculated by:

x_i、

respectively a sample image, a positive image and a negative image, x_i、

For a positive image pair, x_i、

For the negative image pair,

is the euclidean distance of the positive image pair,

is the euclidean distance of the negative image pair, the positive image being an image that belongs to the same application class as the sample image and is different from the network traffic data of the sample image, the negative image being an image that does not belong to the same application class as the sample image and is different from the network traffic data of the sample image, the positive image pair comprising the sample image and the positive image, the negative image pair comprising the sample image and the negative image, α being the distance between the positive image pair and the negative image pair;

computing

Hinge loss of

From said set of gray-scale map data, image x_i、x_jSimilarity of the positive image pair to the negative image pair, distance between the positive image pair and the negative image pair, and hinge loss

Calculating a ternary loss function:

l is the minimum value of the ternary loss function;

updating the L2-triplot Resnet network with the minimum of the ternary loss function.

A network traffic identification system, comprising:

the data acquisition module is used for converting first network flow data in hexadecimal form into second network flow data in binary form and mapping the second network flow data into gray map data through a mapping rule;

the characteristic extraction module is used for performing characteristic extraction on the gray map data by adopting a triple residual network L2-triplot Resnet network constrained by L2 to obtain first characteristic data, performing linear dimensionality reduction on the first characteristic data by utilizing a Principal Component Analysis (PCA) algorithm to obtain second characteristic data, and performing nonlinear dimensionality reduction on the second characteristic data by utilizing a t-SNE algorithm to obtain visual characteristic data;

and the flow identification module is used for carrying out clustering identification on the visualized characteristic data through a K-means algorithm and outputting an identification result.

Preferably, the data acquisition module includes:

Preferably, the L2-triplot Resnet network in the feature extraction module includes: a depth residual error network Resnet-18;

the depth residual error network Resnet-18 comprises:

17 convolutional layers and 1 fully-connected layer;

the depth residual network Resnet-18 does not include a classification layer.

Preferably, the L2-triplot Resnet network in the feature extraction module includes: a depth residual network Resnet-18, and L2 constraint and scaling module;

adding, by the L2 constraint and scaling module, L2 constraints to the three embedded features to obtain the first feature data corresponding to the three embedded features, according to the following formula:

Preferably, the feature extraction module further includes:

the gray map data set, image x, is calculated as follows_i、x_jSimilarity of (2):

wherein L is_pIs a Min-type distance, p is a norm and p is more than or equal to 1, when p is 2, the image x_i、x_jIs Euclidean distance, L_pThe smaller, the image x_iAnd image x_jThe more similar the intensity map data set is χ, x_i,x_jE x, image x_i、x_jIs two different images in the gray scale map dataset, d is d-dimensional euclidean space,

the distance between the positive and negative image pairs is calculated by:

x_i、

respectively a sample image, a positive image and a negative image, x_i、

For a positive image pair, x_i、

For the negative image pair,

is the euclidean distance of the positive image pair,

computing

Hinge loss of

By image x in the greyscale map data set_i、x_jSimilarity of the positive image pair to the negative image pair, distance between the positive image pair and the negative image pair, and hinge loss

Calculating a ternary loss function:

l is the minimum value of the ternary loss function;

The application provides a network flow identification method and a system, which are used for converting first network flow data in hexadecimal into second network flow data in binary, and mapping the second network flow data into gray map data through a mapping rule; performing feature extraction on the gray scale image data by adopting a triple residual network L2-triplot Resnet network constrained by L2 to obtain first feature data; performing linear dimensionality reduction on the first characteristic data by utilizing a Principal Component Analysis (PCA) algorithm to obtain second characteristic data; carrying out nonlinear dimensionality reduction on the second characteristic data through a t-SNE algorithm to obtain visual characteristic data; and performing clustering identification on the visualized characteristic data through a K-means algorithm, and outputting an identification result. Due to the fact that the L2-triplot Resnet network is adopted to improve the efficiency and the precision of feature extraction, PCA linear dimensionality reduction and t-SNE nonlinear dimensionality reduction are combined, the stability of the structure in data is guaranteed, meanwhile, the calculated amount is reduced, visual analysis of unknown feature data is achieved, finally, fast iteration is conducted through a K-means algorithm, network flow is classified, and the accuracy and the automation degree of network flow identification can be improved.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present application, and other drawings can be obtained by those skilled in the art according to the drawings.

Fig. 1 is a schematic flowchart of a network traffic identification method according to an embodiment of the present application;

fig. 2a is a schematic diagram of a structure diagram of a single depth residual error network Resnet-18 according to an embodiment of the present application;

fig. 2b is a schematic structural diagram of a Layer in a depth residual error network Resnet-18 according to an embodiment of the present application;

fig. 2c is a schematic structural diagram of a basic block in a Layer according to an embodiment of the present application;

fig. 3 is a schematic structural diagram of a network traffic identification system according to an embodiment of the present application.

Detailed Description

The application provides a network traffic identification method and a network traffic identification system, which are used for improving the accuracy and the automation degree of network traffic identification.

In order that those skilled in the art will better understand the disclosure, the following detailed description will be given with reference to the accompanying drawings. It is to be understood that the embodiments described are only a few embodiments of the present application and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the application. As used in this application and the appended claims, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items.

Referring to fig. 1, fig. 1 is a flowchart of a network traffic identification method according to an embodiment of the present application, where the embodiment of the present application at least includes the following steps:

101. the method comprises the steps of converting first network flow data in hexadecimal into second network flow data in binary, and mapping the second network flow data into gray map data through a mapping rule;

in the embodiment of the present application, because the bit stream lengths of the network traffic data are different, and the gray scale map data generated according to the network traffic data are also not uniform, it is necessary to convert the first network traffic data into the binary second network traffic data, so as to implement uniform bit stream lengths of the network traffic data, and then map the binary second network traffic data into the gray scale map data according to a mapping rule, where the mapping rule is: the binary value 1 corresponds to the image gray scale value 255, and the binary value 0 corresponds to the image gray scale value 0, so that the gray scale image data are unified. When the network flow data is processed, the bit stream length of the network flow data is unified and the bit stream length is also unified with the gray scale map data. Thus improving the efficiency of network traffic data processing.

Specifically, an optional manner is that the specific implementation process of this step 101 may include:

judging whether the bit stream length of the first network traffic data exceeds the 1024-bit stream length, if so, deleting the network traffic data exceeding the 1024-bit stream length in the first network traffic data to obtain second network traffic data; and if not, zero padding is carried out in the first network traffic data to obtain second network traffic data, and the bit stream length of the second network traffic data is 1024-bit stream length.

In this implementation manner, when first network traffic data is acquired, a network traffic data identification experiment is performed in order to unify the bit stream lengths of the network traffic data, and finally a 1024-bit stream length is selected as a bit stream length unification standard to unify the bit stream lengths of the network traffic data, and then whether the bit stream length of the first network traffic data exceeds a 1024-bit binary bit stream length is judged, and if the bit stream length of the first network traffic data exceeds the 1024-bit binary bit stream length, the network traffic data exceeding the 1024-bit binary bit stream length in the first network traffic data is deleted to obtain second network traffic data; and if the bit stream length of the first network traffic data does not exceed the 1024-bit binary bit stream length, zero padding is carried out in the first network traffic data to obtain second network traffic data. The bit stream length of the second network traffic data is 1024-bit stream length, and the processing efficiency of the network traffic data can be improved by processing the second network traffic after the bit stream length is unified.

102. And performing feature extraction on the gray map data by adopting a triple residual network L2-triplot Resnet network constrained by L2 to obtain first feature data.

In this embodiment of the present application, the triple residual network L2-triplot Resnet network constrained by L2 includes: l2 constraint and scaling module, three depth residual networks Resnet-18. Please refer to fig. 2a, fig. 2b, and fig. 2c for the composition of the depth residual error network Resnet-18. Fig. 2a is a structural diagram of a single depth residual error network Resnet-18, fig. 2b is a structural diagram of a Layer in the depth residual error network Resnet-18, and fig. 2c is a structural diagram of a basic block in the Layer.

The deep residual network Resnet-18 does not contain a classification layer, but directly takes the embedded features of 32 dimensions in the fully-connected layer as output results.

As can be seen from fig. 2a, the depth residual network Resnet-18 includes: one 3 x 3 convolutional Layer, Layer1, Layer2, Layer3, Layer4, average pooling Layer, full tie Layer.

As can be seen from fig. 2b, each Layer contains two basic blocks.

As can be seen from fig. 2c, each basic block contains two convolutional layers.

The specific parameters of each layer of the single depth residual network Resnet-18 are shown in table 1:

TABLE 1

Specifically, in an alternative implementation manner, the specific implementation process of this step 102 includes the following steps a1-a 2:

step A1: and respectively inputting the three images in the gray map data into three depth residual error networks Resnet-18 in an L2-triplot Resnet network to obtain three embedded features.

In this implementation, three images in the grayscale image data are acquired: x is the number of_i、

Wherein x_i、

A sample image, a positive image and a negative image, respectively, the positive image being an image belonging to the same application class as the sample image and different from the network traffic data of the sample image, and the negative image being an image not belonging to the same application class as the sample image and different from the network traffic data of the sample image.

Step A2: adding L2 constraints to the three embedded features through an L2 constraint and scaling module to obtain first feature data corresponding to the three embedded features respectively, wherein the first feature data are specifically represented by the following formula:

In this implementation, the scaling parameter r constrained by L2 is added to the embedded feature to be located on a hypersphere with a fixed radius r, and the scaling parameter r can reduce the ternary loss, so that adding the L2 constraint to the three embedded features through the L2 constraint and scaling module can make the embedded features converge quickly and reduce the ternary loss of the three embedded features to the maximum extent.

103. And performing linear dimensionality reduction on the first characteristic data by using a Principal Component Analysis (PCA) algorithm to obtain second characteristic data.

In the embodiment of the application, because the Principal Component Analysis (PCA) algorithm is high in calculation speed, the adoption of the PCA algorithm to perform dimensionality reduction on the feature data can reduce excessive resource consumption when the data dimensionality is too high, reduce the data to a lower dimensionality, and improve the calculation efficiency of performing dimensionality reduction on the first feature data.

104. And carrying out nonlinear dimensionality reduction on the second characteristic data through a t-SNE algorithm to obtain visualized characteristic data.

In the embodiment of the application, because the t-SNE (t-distributed stored probabilistic Neighbor Embedding) algorithm can project high-dimensional data into a low-dimensional space to realize visualization and maintain the capability of a local structure of network traffic data, the t-SNE algorithm is adopted to perform 2-dimensional visualization on the second characteristic data, so that the nonlinear dimension reduction is performed on the second characteristic data while the local structure of the second characteristic data is maintained, the visualization processing on the characteristic data is realized, and the problems of congestion and difficulty in optimization of the characteristic data are solved.

105. And performing clustering identification on the visualized characteristic data through a K-means algorithm, and outputting an identification result.

In the embodiment of the application, due to the fact that the K-means (K-means clustering algorithm) algorithm is high in iteration speed, convenient to use and good in clustering performance, the K-means algorithm is adopted to conduct iterative training on the visual characteristic data to determine the mass center of each visual characteristic data cluster, the distance between the data point participating in the iterative training and the mass center of each visual characteristic data cluster is calculated, and then the first network flow data corresponding to the data point is identified according to the distance between the data point and the mass center.

On the basis of the method, the method further comprises the following steps: updating the L2-triplot Resnet network through the minimum value of the ternary loss function, which is as follows:

defining a gray-scale map dataset χ, x_i,x_jE χ are two different images in the grayscale dataset, f (x) e R^dThe function is a feature embedding function for mapping the images in the gray map data set into feature points in Euclidean space, and the function embeds the images in the gray map data set into d-dimensional Euclidean space to make the distance between similar images in the gray map data set shorter, f (x)_i)＝(f(x_i)⁽¹⁾,f(x_i)⁽²⁾,…,f(x_i)^(d))^T；f(x_j)＝(f(x_j)⁽¹⁾,f(x_j)⁽²⁾,…,f(x_j)^(d))^T；

From image x_i、x_jThe distance between feature points in d-dimensional Euclidean space is used for calculating an image x in a gray scale image data set in the following way_i、x_jSimilarity of (2):

L_pis a Min-type distance, p is a norm and p is more than or equal to 1, when p is 2, the image x_i、x_jIs Euclidean distance, L_pThe smaller, the image x_iAnd image x_jThe more similar the grayscale image dataset is χ, x_i,x_jE x, image x_i、x_jIs two different images in the gray scale map dataset, d is d-dimensional euclidean space,

the distance between the positive and negative image pairs is recalculated by:

x_i、

respectively a sample image, a positive image and a negative image, x_i、

For a positive image pair, x_i、

For the negative image pair,

is the euclidean distance of the positive image pair,

the Euclidean distance of a negative image pair, wherein the positive image is an image which belongs to the same application class as the sample image and is different from the network traffic data of the sample image, the negative image is an image which does not belong to the same application class as the sample image and is different from the network traffic data of the sample image, the positive image pair is the sample image and the positive image, the negative image pair is the sample image and the negative image, and alpha is the distance between the positive image pair and the negative image pair;

recalculation

Hinge loss of

And concentrating the image x by means of a gray-scale map data set_i、x_jAnd the distance between the positive and negative image pairs and hinge loss

Calculating a ternary loss function:

l is the minimum value of the ternary loss function;

and finally, updating the L2-triplot Resnet network through the minimum value L of the ternary loss function, and improving the accuracy of extracting the characteristic data by the L2-triplot Resnet network.

In summary, the network traffic identification method and system provided in this embodiment are configured to convert hexadecimal first network traffic data into binary second network traffic data, and map the second network traffic data into grayscale data according to a mapping rule; performing feature extraction on the gray scale image data by adopting a triple residual network L2-triplot Resnet network constrained by L2 to obtain first feature data; performing linear dimensionality reduction on the first characteristic data by utilizing a Principal Component Analysis (PCA) algorithm to obtain second characteristic data; carrying out nonlinear dimensionality reduction on the second characteristic data through a t-SNE algorithm to obtain visual characteristic data; and performing clustering identification on the visualized characteristic data through a K-means algorithm, and outputting an identification result.

As shown in fig. 3, for a schematic structural diagram of a network traffic identification system provided in an embodiment of the present application, a network traffic identification system will be described below, and for related contents, refer to the foregoing method embodiment, where the network traffic identification system includes:

the data acquisition module 201: the system comprises a data processing module, a data processing module and a data processing module, wherein the data processing module is used for converting first network flow data in hexadecimal into second network flow data in binary, and mapping the second network flow data into gray map data through a mapping rule;

the feature extraction module 202: the method comprises the steps of performing feature extraction on gray map data by adopting an L2-triplot Resnet network to obtain first feature data, performing linear dimensionality reduction on the first feature data by utilizing a Principal Component Analysis (PCA) algorithm to obtain second feature data, and performing nonlinear dimensionality reduction on the second feature data by utilizing a t-SNE algorithm to obtain visual feature data;

the flow identification module 203: the method is used for carrying out clustering identification on the visualized characteristic data through a K-means algorithm and outputting an identification result.

Optionally, the data obtaining module includes:

and if not, zero padding is carried out in the first network traffic data to obtain second network traffic data, and the bit stream length of the second network traffic data is 1024-bit stream length.

Optionally, the L2-triplot Resnet network in the feature extraction module includes: a depth residual error network Resnet-18;

the deep residual network Resnet-18 includes:

17 convolutional layers and 1 fully-connected layer;

the depth residual network Resnet-18 does not include a classification layer.

Optionally, the performing feature extraction on the grayscale map data by using the triple residual network L2-triplot Resnet network constrained by L2 in the feature extraction module to obtain first feature data includes:

respectively inputting three images in gray map data into three depth residual error networks Resnet-18 in an L2-triplot Resnet network to obtain three embedded characteristics;

adding L2 constraints to the three embedded features through an L2 constraint and scaling module to obtain three first feature data corresponding to the three embedded features, wherein the three first feature data are represented by the following formula:

wherein x is_iFor embedded features, N is a set of natural numbers, | f (x)_i)‖_zIs a first feature data constraint.

Optionally, the feature extraction module further includes:

computing image x in a grayscale image dataset_i,x_jSimilarity of (2):

wherein L is_pIs a Min-type distance, p is a norm and p is more than or equal to 1, when p is 2, the image x_i、x_jIs Euclidean distance, L_pThe smaller, the image x_iAnd image x_jThe more similar the intensity map data set is χ, x_i,x_jE x, image x_i、x_jTwo different images in the gray scale image dataset, d is d-dimensional euclidean space,

calculating the distance between the positive and negative image pairs:

x_i、

respectively a sample image, a positive image and a negative image, x_i、

For a positive image pair, x_i、

For the negative image pair,

is the euclidean distance of the positive image pair,

is the Euclidean distance of the negative image pair, the positive image is the same with the sample imageThe image of the application class and different from the network traffic data of the sample image, the negative image is an image which does not belong to the same application class as the sample image and is different from the network traffic data of the sample image, the positive image pair comprises the sample image and the positive image, the negative image pair comprises the sample image and the negative image, and alpha is the distance between the positive image pair and the negative image pair;

computing

Hinge loss of

Concentration of image x by grayscale map data_i、x_jSimilarity of (d), distance between positive and negative image pairs, and hinge loss

Calculating a ternary loss function:

l is the minimum value of the ternary loss function;

the L2-triplot Resnet network is updated by the minimum of the ternary loss function.

The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.

The above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims

1. A network traffic identification method is characterized by comprising the following steps:

2. The method of claim 1, wherein converting the first hexadecimal network traffic data into the second binary network traffic data comprises:

3. The method of claim 1, wherein the L2-triplot Resnet network comprises: a depth residual error network Resnet-18;

the depth residual error network Resnet-18 comprises:

17 convolutional layers and 1 fully-connected layer;

the depth residual network Resnet-18 does not include a classification layer.

4. The method of claim 1, wherein the L2-triplot Resnet network comprises: a depth residual network Resnet-18, and L2 constraint and scaling module;

‖f(x_i)‖₂＝r,

wherein x is_iFor the embedding feature, r is the scaling parameter constrained by the L2, N is the natural number set, | f (x)_i)‖₂And constraining the first feature data.

5. The method of claim 1, further comprising:

the gray map data set, image x, is calculated as follows_i,x_jSimilarity of (2):

the distance between the positive and negative image pairs is calculated by:

x_i、

respectively a sample image, a positive image and a negative image, x_i、

For a positive image pair, x_i、

For the negative image pair,

is the euclidean distance of the positive image pair,

computing

Hinge loss of

By image x in the greyscale map data set_i、x_jThe distance between the positive image pair and the negative image pair, and a hingeLoss of power

Calculating a ternary loss function:

l is the minimum value of the ternary loss function;

6. A network traffic identification system, comprising:

7. The system of claim 6, wherein the data acquisition module comprises:

8. The system according to claim 6, wherein the L2-triplot Resnet network in the feature extraction module comprises: a depth residual error network Resnet-18;

the depth residual error network Resnet-18 comprises:

17 convolutional layers and 1 fully-connected layer;

the depth residual network Resnet-18 does not include a classification layer.

9. The system according to claim 6, wherein the L2-triplot Resnet network in the feature extraction module comprises: a depth residual network Resnet-18, and L2 constraint and scaling module;

‖f(x_i)‖₂＝r,

10. The system of claim 6, wherein the feature extraction module further comprises:

the distance between the positive and negative image pairs is calculated by:

x_i、

respectively a sample image, a positive image and a negative image, x_i、

For a positive image pair, x_i、

For the negative image pair,

is the euclidean distance of the positive image pair,

computing

Hinge loss of

By image x in said gray map data set_i、x_jSimilarity of the positive image pair to the negative image pair, distance between the positive image pair and the negative image pair, and hinge loss

Calculating a ternary loss function:

l is the minimum value of the ternary loss function;