CN115834251A

CN115834251A - Hypergraph transform based threat hunting model establishing method

Info

Publication number: CN115834251A
Application number: CN202310108673.8A
Authority: CN
Inventors: 邱日轩; 孙欣; 梁良; 周欣; 付俊峰; 张俊峰; 汪一波; 林楠
Original assignee: State Grid Corp of China SGCC; Information and Telecommunication Branch of State Grid Jiangxi Electric Power Co Ltd
Current assignee: State Grid Corp of China SGCC; Information and Telecommunication Branch of State Grid Jiangxi Electric Power Co Ltd
Priority date: 2023-02-14
Filing date: 2023-02-14
Publication date: 2023-03-21
Anticipated expiration: 2043-02-14
Also published as: CN115834251B

Abstract

The invention discloses a hypergraph transform-based threat hunting model establishing method, which comprises the following steps of: using threat intelligence and system logs as input data, generating a log graph through a processing module, and inputting the log graph into a threat hunting model; the threat hunting model encodes input data and constructs a hypergraph, and then matrix data are generated through processing of a hypergraph neural network layer; extracting features from the preprocessed data through a multi-head attention mechanism, mapping the features to a super-edge matrix, finally realizing similarity score calculation of a log graph through super-edge matching, finding a novel power system kernel audit log matched with network threat information, and finishing threat hunting. The model can adapt to the continuously updated and changed APT attack, complete the threat hunting of the APT attack of the novel power system, and realize the quick response and active defense aiming at the APT attack.

Description

Hypergraph transform based threat hunting model establishing method

Technical Field

The invention relates to the technical field of threat hunting model establishment, in particular to a hypergraph Transformer-based threat hunting model establishment method.

Background

Because the power distribution of a novel power system is changing towards a distributed direction, the risk of being attacked by the APT is increased due to the increase of cross-space vulnerability, an attacker can invade and hide in a novel power system information network through an external network, a novel power system service layer is modified, and finally the power system is damaged.

Based on the above, the application provides a method for establishing a hypergraph Transformer-based threat hunting model to solve the above problems.

Disclosure of Invention

The invention aims to provide a hypergraph Transformer threat hunting model establishing method, which can maximally reserve APT attack traces of a novel power system aiming at the characteristic of long-term latency of APT attack when a log graph is established, and can utilize network threat information to self-adapt to the APT attack which is continuously updated and changed so as to solve the defects in the background technology.

In order to achieve the above purpose, the invention provides the following technical scheme: the method for establishing the hunting model based on the hypergraph transform threat comprises the following steps of:

s1: using threat intelligence and system logs as input data, encoding the input data and constructing a hypergraph, and processing the hypergraph by a hypergraph neural network layer to generate preprocessed data;

s2: extracting characteristic data from the preprocessed data through a Transformer multi-head attention mechanism;

s3: and calculating the score of the characteristic data through a super-edge matching algorithm, completing the matching of threat intelligence in a power system log library, and establishing an HTTN threat hunting model of the APT attack of the power system.

In a preferred embodiment, the step S1 of obtaining threat intelligence includes the steps of:

s1.1: acquiring kernel audit log streams of the power system through various kernel audit engines of the operating system, and constructing a log graph of the power system by the log streams through an overcurrent processing unit module;

s1.2: collecting network threat intelligence in various open sources or private threat intelligence libraries, and generating a threat intelligence log graph through a threat intelligence processing module;

s1.3: inputting the log graph of the power system and the log graph of the threat intelligence into an HTTN threat hunting model together, and calculating the scores of the log graph subgraph of the novel power system and the log graph of the threat intelligence through matching the log graphs;

s1.4: all operating system logs matched with threat intelligence in a novel power system log library are obtained by setting a score threshold value for the HTTN threat hunting model, unknown APT attack is found through the HTTN threat hunting model, and the threat hunting of the APT attack is completed.

In a preferred embodiment, the HTTN threat hunting model comprises a graph information input layer, a hypergraph construction layer, a hypergraph neural network layer, a hypergraph Transformer coding layer, a hypergraph matching layer and a function calculation layer;

the generating step of the graph information input layer comprises the following steps:

n log graph pairs constitute the data input, each log graph pair represented as

；

Each log graph

Or log map

The number of nodes and edges of the log graph is arbitrary;

any set of log graph entries

The log map is shown as

，

And

respectively representing the number of nodes and the number of edges;

using a contiguous matrix

To characterize a log graph

Connection information of, wherein

Is a set of real numbers;

use of

To represent a log graph

A feature matrix of nodes, wherein

Is the dimension of a node, log graph

Is expressed by a log graph

The same is true.

In a preferred embodiment, in the hypergraph construction layer, the log hypergraph is defined as

The log hypergraph comprises a set of log nodes

Journal edge set

Log node feature matrix

And log diagonal edge weight matrix

Each superedge of the log hypergraph comprises at least two nodes, and the incidence matrix is used

To model unpaired node relationships, the entries in H are defined as:

，

wherein ,

representing the assignment of elements in the incidence matrix, wherein if an edge exists between two nodes, the value is 1, and if no edge exists between the two nodes, the value is 0; the number of nodes v is represented as

，

Representing the degree of the fixed point; the number of times of the edge e is expressed as

The node degree diagonal matrix and the super-edge degree diagonal matrix are respectively expressed as

And

。

in a preferred embodiment, in the hypergraph construction layer, a log hypergraph of the power system is constructed by adopting a random walk method, for each log node v, a common log graph G with the step length of K is selected to perform random walk, and then a sampling node sequence is used as a hyperedge to obtain a hyper-edge

A supercide matrix.

In a preferred embodiment, in the hypergraph neural network layer, an HGNN layer is added in the HTTN threat hunting model, and for the l-th layer in the HGNN layer, a hypergraph H log and a hidden representation matrix are used

As input, the next level of nodes is then computed:

，

，

wherein

Is a non-linear activation function and,

represents the training parameter matrix of the l-th layer,

、

、

respectively diagonal node degree, edge degree and edge weight matrix,

a matrix of training parameters.

In a preferred embodiment, the HGNN layer performs a log graph node-edge-node conversion, such that the log hypergraph structure refines the hyper-edge characteristics of the log.

In a preferred embodiment, the hypergraph Transformer coding layer inputs the log hyper-edge matrix E processed by the hypergraph neural network layer into the Transformer coding layer, the Transformer coding layer extracts core features in the log hyper-edge matrix, and the hypergraph Transformer coding layer comprises a multi-head attention mechanism and a feedforward neural network;

the calculation formula of the self-attention mechanism is as follows:

，

，

，

，

wherein E is a log hyper-edge matrix, Q, K and V are Query, key and Value vectors respectively from E,

represents the dimension of the vector of Q and K,

、

、

initializing a matrix for random;

the multi-head attention mechanism passes through h different linear transformation pairs

Projection mapping is carried out, and finally, the calculation results of the self-attention modules are spliced, wherein the expression is as follows:

，

，

initializing multiple sets of weight matrices

、

、

, wherein

Respectively calculate the respective

、

、

Then obtaining the result according to the attention mechanism calculation formula

Each group of

Spliced sum weight matrix

Multiplying, and finally mapping to the original space to obtain the product with the same dimension as the input dimension of the original super-edge matrix

；

A feed-forward neural network: the method is composed of a full-connection layer with an activation function of RELU and a full-connection layer with a linear activation function, and is used for solving the problem that the fitting degree of a multi-head attention mechanism on data processed by a hypergraph neural network layer is not enough.

In a preferred embodiment, the pair of super-edge matching layers is a pair of super-graphs

And

scores between hypergraph edges, constructing a score matrix of hypergraph pairs

To a

Each of the super edges

Calculating it from the other graph of the pair

The gaussian kernel function of all hyper-edges computes a score:

，

wherein ,

is that

The number of the middle-over edges is equal to that of the middle-over edges,

and

representation hypergraph

And

the super-edge in (1) indicates that,

the range of action of the gaussian kernel function is controlled,

the larger the value, the larger the local influence range of the gaussian kernel function.

In a preferred embodiment, in the function calculation layer, the matrix generates scores after being processed by the full connection layer

The calculation formula is as follows:

，

wherein G is a set of pairs of training images, and

presentation log graph

And log graph

The actual fraction in between.

In the technical scheme, the invention provides the following technical effects and advantages:

the method comprises the steps of establishing a hypergraph by using network threat information and novel electric power system kernel audit logs, learning the relation between hypergraph high-order nodes through an HGNN layer, mapping characteristics into a super-edge matrix, adding a multi-head attention mechanism to the super-edge matrix by using a transform coding layer, finally realizing similarity score calculation of the log graph through super-edge matching, and finding the novel electric power system kernel audit logs matched with the network threat information.

Drawings

In order to more clearly illustrate the embodiments of the present application or technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments described in the present invention, and other drawings can be obtained by those skilled in the art according to the drawings.

Fig. 1 is a flow chart of threat hunting according to the present invention.

FIG. 2 is a schematic diagram of the HTTN threat hunting model according to the present invention.

FIG. 3 is a flow chart of the construction of a Trojan log hypergraph according to the invention.

FIG. 4 is a schematic view of a multi-headed attention mechanism of the present invention.

FIG. 5 is a graph of the mean square error variation of each model training process according to the present invention.

FIG. 6 is a process for training models according to the present invention

And (5) a variation graph.

FIG. 7 is a graph of the accuracy @10 variation of each model training process of the present invention.

FIG. 8 is a comparison chart of hunting time of models according to the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example 1

In this embodiment, the method for establishing a hunting model based on hypergraph transform threat includes the following steps:

s1, using threat intelligence and a system log as input data, encoding the input data, constructing a hypergraph, and processing the hypergraph by a hypergraph neural network layer to generate preprocessed data;

s2, extracting characteristic data from the preprocessed data through a Transformer multi-head attention mechanism;

and S3, calculating a similarity score by the characteristic data through a super-edge matching algorithm, completing matching of threat intelligence in a power system log library, and establishing an HTTN threat hunting model of power system APT attack.

The hypergraph is constructed through network threat information and novel electric power system kernel audit logs, the relation between hypergraph high-order nodes is learned through an HGNN layer, the characteristics are mapped to a super-edge matrix, a multi-head attention mechanism is added to the super-edge matrix through a transform coding layer, the similarity score calculation of the log graph is finally realized through super-edge matching, the novel electric power system kernel audit logs matched with the network threat information are found, the model can adapt to the APT attack which is continuously updated and changed, the threat hunting of the novel electric power system APT attack is completed, and the quick response and active defense for the APT attack are realized.

Referring to fig. 1, in the step a, the obtaining of threat information includes the following steps:

1) The method comprises the steps that the novel power system kernel audit log stream is collected through various operating system kernel audit engines, and a novel power system log graph is constructed through a log stream overcurrent processing unit module;

2) Network threat intelligence in various open sources or private threat intelligence libraries is artificially collected, and a threat intelligence log graph is generated through a threat intelligence processing module;

3) Inputting the novel power system log graph and the threat information log graph into an HTTN threat hunting model together, and calculating the similarity score of the novel power system log graph subgraph and the threat information log graph through log graph similarity matching;

4) The threat hunting expert acquires all operating system logs matched with threat intelligence in a novel power system log library by setting a similarity score threshold value for the HTTN threat hunting model, discovers unknown APT attack through the HTTN threat hunting model and finishes the threat hunting of the APT attack.

Example 2

Referring to fig. 2, the HTTN threat hunting model is composed of a graph information input layer, a hypergraph construction layer, a hypergraph neural network layer, a hypergraph Transformer coding layer, a hyperedge matching layer, and a similarity score calculation layer.

wherein ,

graph information input layer: the data input of the HTTN threat hunting model is composed of N log graph pairs, and each log graph pair can be represented as

Wherein for each log graph

Or log map

The number of nodes and edges of the log graph can be arbitrary; for any set of log graph entries

The log map is shown as

，

And

representing the number of nodes and edges separately, and then using the adjacency matrix

To characterize a log graph

Wherein R is a real number set; use of

To represent a log graph

A feature matrix of nodes, wherein

Is the dimension of a node, log graph

Is expressed by a log graph

The same is true.

Hypergraph structural layer: in order to complete the super-edge matching of the system log graph, a super graph needs to be constructed for the log graph data input by the information input layer, and the log super graph is defined as

The log hypergraph consists of a set of log nodes

Log edge set

Log node feature matrix

And log diagonal edge weight matrix

The composition is different from that of a common log graph G, and each hyper-edge of the log hyper-graph comprises two or more nodes; and use the incidence matrix

To model the unpaired node relationship, the entries in H are defined as:

（1），

wherein ,

，

And

。

in a hypergraph construction layer, a novel power system log hypergraph is constructed by adopting a random walk (RandomWalk) method;

for each log node v, selecting a common log graph G with the step length of K to carry out random walk, and then taking a sampling node sequence as a super edge to obtain

A supercide matrix.

Referring to fig. 3, a process for constructing a log hypergraph in a Trojan attack scenario of an APT attack is shown, wherein,

node a represents an untrusted external address;

node B represents a browser;

the node C represents a Trojan file;

node D represents the executed Trojan process;

node E represents a dash script command line;

node F represents a command to display the server network configuration;

node G represents a command to display the host name;

node H represents a command to monitor the server TCP/IP network connection;

the node I represents a configuration file containing sensitive information such as account number and password in the server;

the leakage of the configuration files can directly cause an attacker to invade a service layer of the novel power system, tamper with the service layer data and the like.

Hypergraph neural network layer: the Hyper Graph Neural Network (HGNN) is a neural network model considering a high-order node relationship rather than a pair node relationship, and because the kernel audit log graph nodes of the novel power system have the characteristics of complexity and stage APT attack, the correlation of the nodes between the log graphs cannot be fully extracted only by matching and training the pair nodes of the log graphs, and thus the trained model has poor matching effect on the APT attack threat intelligence logs.

And because the HGNN shows better performance than a traditional graph volume network (GCN) in terms of encoding log node position correlation, in order to better capture complex node relations in the log hypergraph, a HGNN layer is added in the HTTN threat hunting model. Wherein, for the l-th layer in the HGNN layer, the log hypergraph H and the hidden representation matrix

As input, the nodes of the next layer are then computed as follows:

，

（2），

wherein

Is a non-linear activation function and,

represents the training parameter matrix of the l < th > layer,

、

、

respectively diagonal node degree, edge degree and edge weight matrix,

is a matrix of trainable parameters.

The HGNN layer can perform node-edge-node conversion of the log graph, so that the hyper-edge characteristics of the log can be better refined by the log hyper-graph structure. In the HTTN threat hunting model, in order to improve the matching effect of the super edges in the super edge matching layer in the subsequent module, a node-edge conversion method is adopted for the novel power system log graph, so that the node characteristics are embedded into a super edge matrix.

Initial log node in HTTN threat hunting model

Can learn and process

Parameter matrix characteristics, and then collecting log node characteristics according to the excess edges to form an excess edge characteristic matrix

From

Finally, related overcide characteristics are aggregated by multiplying the matrix H, and the HGNN layer can fully extract a novel power system and threat situationAnd reporting the position and the characteristic information of the node in the log graph, and improving the similarity score of subsequent over edge matching.

Hypergraph Transformer coding layer: and inputting the log hyper-edge matrix E processed by the hyper-graph neural network layer into a Transformer coding layer. The Transformer coding layer can extract core characteristics in the log super-edge matrix, and the problem of dependence between log super-edges is weakened. The Transformer coding layer mainly comprises the following two structures:

a multi-head attention mechanism: the self-attention mechanism is an improvement of the original attention mechanism and is a core technology in a Transformer model. The self-attention calculation formula is as follows:

，

，

，

（3），

wherein E is a log super-edge matrix, Q, K, V are Query, key and Value vectors, respectively, from E,

represents the dimension of the vector of Q and K,

、

、

the matrix is randomly initialized, and the model can learn proper parameters in back propagation;

the multi-head attention mechanism can find the dayPosition features in log hyperedges are calculated simultaneously by multiple sets of weights, the weights are not shared among the position features, nodes of each hyperedge in the log hypergraph pay attention to features of surrounding nodes by stacking attention layers, and the multi-head attention mechanism is realized by h different linear transformation pairs

Performing projection mapping;

as shown in fig. 4, the calculation results of the self-attention module are finally concatenated, and the formula is as follows:

，

（4），

first, multiple sets of weight matrices are initialized

、

、

, wherein

Respectively calculate each of

、

、

Each group of

Post-concatenation (Concat) with weight matrix

。

A feed-forward neural network: the feedforward neural network of the hypergraph Transformer coding layer mainly solves the problem that the fitting degree of a multi-head attention mechanism to data processed by the hypergraph neural network layer is not enough so as to better generalize a function, and the feedforward neural network is composed of a full-connection layer with an activation function being a RELU and a full-connection layer with a linear activation function.

And (3) a super edge matching layer: because the correlation between log super edges is very important for a graph matching model, a super edge matching mechanism is used in an HTTN threat hunting model, the traditional graph matching problem mostly adopts node-by-node matching, and due to the characteristics of concealment and long-term entanglement of APT attack, the matching effect of threat information of APT attack in a novel power system log library is not good only by considering the correlation of log graph nodes or single edges, so that the HTTN threat hunting model does not use node feature matching but uses a super edge matching method, and compared with the matching of all nodes in the whole graph, the computing efficiency and the computing accuracy are higher;

the core part of the super edge matching layer is a pair of super graphs

And

the similarity scores between the super edges are calculated by first constructing a similarity score matrix of the graph pair

To a

Each of the super edges

Calculating it from the other graph of the pair

The gaussian kernel function of all hyper-edges of (a) calculates a score, i.e.:

（5），

wherein ,

is that

The number of the middle-out edges,

and

representation hypergraph

And

the super-edge in (1) indicates that,

the larger the value of the action range of the Gaussian kernel function is, the larger the local influence range of the Gaussian kernel function is.

Similarity score calculation layer: after obtaining the log graph similarity score matrix, gradually reducing the dimension of the log graph similarity matrix by using a full-connection layer neural network, and further fitting a function to realize the similarity score calculation of the log graph, wherein the full-connection layer principle is that one feature space is linearly transformed to another feature space through the vector product of the matrix, and finally the dimension reduction of the matrix is realized;

the similarity matrix generates a similarity score after being processed by the full connection layer

And comparing the following mean square error loss function with the actual similarity score, and measuring the matching effect of the model on the novel power system log graph and the threat intelligence log graph:

（6），

wherein G is a set of pairs of training images, and

presentation log graph

And log graph

The actual similarity score between them.

Example 3

In order to verify the accuracy and the high efficiency of the HTTN threat hunting model for APT attack threat hunting, the application adopts a data set formed by mixing a Linux kernel audit log and a plurality of APT attack scenes, and performs a comparison experiment with traditional graph regression models such as SimGNN, graphSim, H2MN, HGMN and the like, and finally proves that the HTTN threat hunting model provided by the application has better performance in matching APT attack threat information.

Experimental preparation and experimental environment: the server version of the experiment is Ubuntu16.04, 4 NVIDIATITANTX 2080Ti display cards and CUDA of version 10.2 are configured in the equipment, the experiment environment is python version 3.7, the equipment is written by using a Pythroch frame, the optimal hyper-parameter of the HTTN threat hunting model is determined based on the grid search experiment, and the relevant hyper-parameter is shown in Table 1:

in the context of Table 1, the following examples are,

in the training process of the HTTN threat hunting model, the Adam algorithm is used for optimizing model parameters, the Adam algorithm is a first-order optimization algorithm, the traditional gradient descent process can be replaced, the memory required in the training process can be less, the calculation is more efficient, and the method is suitable for solving the problem of large scale of kernel audit log data of the power system.

The evaluation method comprises the following steps: in order to accurately evaluate the matching effect of the HTTN threat hunting model provided by the application, mean Square Error (MSE) and Spearman grade correlation coefficient (Spearman grade correlation coefficient) are adopted

) And precision @10 (precision @10, p @ 10) measure model performance, respectively;

wherein MSE is used to measure the mean squared variance of the predicted similarity score and the true similarity score, as in equation (6);

evaluating ranking correlation between the predicted result and the real ranking result; p @10 calculates the interaction of the predicted similarity score with the actual similarity score divided by 10.

Data set introduction and preprocessing: the experimental data set is from Linux kernel audit logs in some APT attack scenes, the novel power system belongs to a distributed architecture, most of services are deployed in a Linux server, so that the requirement on the safety of the server is high, the kernel audit logs record programs, processes and operations of a user system based on a Linux bottom layer, and log information of each stage of APT attack can be collected. One node representative in log graph

A bar of commands or programs, and an edge representing a dependency between commands or programs.

In a data set, 1000 log graph pairs are randomly selected and divided into a training set, a testing set and a verification set according to 60%, 20% and 20%, due to the characteristic of the concealment of APT attack, the number of log graph nodes generated by threat intelligence generally does not exceed 15, and an A-x algorithm is used for the data set to generate the similarity scores of the log graph pairs.

Analyzing experimental results of different models: experiment the HTTN threat hunting model proposed in the present application was compared with the traditional SimGNN, graphSim, HGMN and H2MN graph regression models, and the experimental results are shown in table 2:

in the context of Table 2, the following examples are,

for example, as shown in fig. 5, 6, and 7, in the Linux log data set containing APT attacks, the HTTN threat hunting model provided by the present application has a mean square error index that is 0.81 lower than that of SimGNN, about 0.27 lower than that of GraphSim, about 0.166 lower than that of HGMN, and about 0.046 lower than that of H2 MN;

in the aspect of Spearman grade correlation coefficient, compared with a SimGNN model, the HTTN threat hunting model provided by the application is improved by 0.06, 0.0226, 0.0076 and 0.0126 respectively compared with a GraphSim model, and an H2MN model;

in the aspect of the p @10 index, the HTTN threat hunting model is improved by about 0.1 compared with the SimGNN model, is improved by about 0.015 compared with the GraphSim model, is improved by 0.0147 compared with the HGMN model, and is improved by 0.011 compared with the H2MN model. By taking the MSE,

Compared with the p @10 index, the effectiveness of multi-head attention of the HTTN threat hunting model for adding the transform coding layer to the log graph super-edge matrix can be fully proved, and the method has a better effect on threat information matching compared with other four models.

When the new power system is subjected to the APT attack based on the zero-day vulnerability, the longer the APT attack exists in the new power system, the larger the generated damage is, and therefore, the shorter the time requirement for threatening the hunting model is, the better, we have conducted a comparison experiment of the similarity score calculation time of different model log graphs,

the experimental results are shown in fig. 8, the computing time of the HTTN threat hunting model for the log graph similarity score is respectively shortened by 6.14, 7.1 and 5.35 milliseconds compared with the SimGNN, graphSim and HGMN models, and is only slightly different from the time consumed by the H2MN model. It can be seen that the HTTN threat hunting model optimizes the log graph for computation time.

The above embodiments may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, the above-described embodiments may be implemented in whole or in part in the form of a computer program product. The computer program product comprises one or more computer instructions or computer programs. The procedures or functions according to the embodiments of the present application are wholly or partially generated when the computer instructions or the computer program are loaded or executed on a computer. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions may be transmitted from one website, computer, server, or data center to another website, computer, server, or data center by wire (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains one or more collections of available media. The usable medium may be a magnetic medium (e.g., floppy disk, hard disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium. The semiconductor medium may be a solid state disk.

It should be understood that the term "and/or" in this application is only one type of association relationship that describes the associated object, meaning that three relationships may exist, e.g., a and/or B may mean: a exists alone, A and B exist simultaneously, and B exists alone, wherein A and B can be singular or plural. In addition, the "/" in the present application generally indicates that the former and latter associated objects are in an "or" relationship, but may also indicate an "and/or" relationship, and may be understood by referring to the former and latter text specifically.

In the present application, "at least one" means one or more, "a plurality" means two or more. "at least one of the following" or similar expressions refer to any combination of these items, including any combination of the singular or plural items. For example, at least one (one) of a, b, or c, may represent: a, b, c, a-b, a-c, b-c, or a-b-c, wherein a, b, c may be single or multiple.

It should be understood that, in the various embodiments of the present application, the sequence numbers of the above-mentioned processes do not imply any order of execution, and the order of execution of the processes should be determined by their functions and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present application.

Those of ordinary skill in the art would appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.

The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.

The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application or portions thereof that substantially contribute to the prior art may be embodied in the form of a software product stored in a storage medium and including instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a portable hard disk, a read-only memory (ROM), a Random Access Memory (RAM), a magnetic disk, an optical disk, or other various media capable of storing program codes.

The above description is only for the specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present application, and shall be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. A hypergraph transform-based threat hunting model establishing method is characterized by comprising the following steps: the establishing method comprises the following steps:

2. The hypergraph Transformer-based threat hunting model building method of claim 1, wherein: in step S1, the threat intelligence acquisition includes the following steps:

s1.3: inputting the power system log graph and the threat intelligence log graph into an HTTN threat hunting model together, and calculating scores of a novel power system log graph subgraph and the threat intelligence log graph through matching the log graphs;

3. The hypergraph Transformer-based threat hunting model building method of claim 2, wherein: the HTTN threat hunting model comprises a graph information input layer, a hypergraph construction layer, a hypergraph neural network layer, a hypergraph Transformer coding layer, a hypergraph matching layer and a function calculation layer;

n log graph pairs constitute the data input, each log graph pair represented as

；

Each log graph

Or log graph

The number of nodes and edges of the log graph is arbitrary;

any set of log graph entries

The log map is shown as

，

And

respectively representing the number of nodes and the number of edges;

using a contiguous matrix

To characterize a log graph

Connection information of, wherein

Is a set of real numbers;

use of

To represent a log graph

A feature matrix of nodes, wherein

Is the dimension of a node, log graph

Is expressed by a log graph

The same is true.

4. The hypergraph Transformer-based threat hunting model establishing method according to claim 3, wherein: in the hypergraph construction layer, the log hypergraph is defined as

The log hypergraph comprises a set of log nodes

Log edge set

Log node feature matrix

And log diagonal edge weight matrix

To model unpaired node relationships, the entries in H are defined as:

，

wherein ,

，

And

。

5. the hypergraph Transformer-based threat hunting model building method of claim 4, wherein: in the hypergraph construction layer, a log hypergraph of the power system is constructed by adopting a random walk method, for each log node v, a common log graph G with the step length of K is selected to carry out random walk, and then a sampling node sequence is used as a hyperedge to obtain a hypergraph

A supercide matrix.

6. The hypergraph Transformer-based threat hunting model establishing method according to claim 3, wherein: in the hypergraph neural network layer, an HGNN layer is added in an HTTN threat hunting model, and for the l-th layer in the HGNN layer, a log hypergraph H and a hidden representation matrix are used

As a transfusionThen, the nodes of the next layer are calculated:

，

，

wherein Sigmoid

Is a non-linear activation function and,

represents the training parameter matrix of the l-th layer,

、

、

respectively diagonal node degree, edge degree and edge weight matrix,

is a training parameter matrix.

7. The hypergraph Transformer-based threat hunting model building method of claim 6, wherein: and the HGNN layer executes the node-edge-node conversion of the log graph, so that the log hypergraph structure refines the hyperedge characteristics of the log.

8. The hypergraph Transformer based hunting model for threat model according to claim 3, wherein: the hypergraph Transformer coding layer inputs the log super-edge matrix E processed by the hypergraph neural network layer into the Transformer coding layer, the Transformer coding layer extracts core characteristics in the log super-edge matrix, and the hypergraph Transformer coding layer comprises a multi-head attention mechanism and a feedforward neural network;

the calculation formula of the self-attention mechanism is as follows:

，

，

，

，

represents the dimension of the vector of Q and K,

、

、

initializing a matrix for random;

，

，

initializing multiple sets of weight matrices

、

、

, wherein

Respectively calculate the respective

、

、

Each group of

Spliced sum weight matrix

Multiplying, and finally mapping to the original space to obtain the matrix with the same dimension as the input dimension of the original super-edge matrix

；

A feed-forward neural network: the full-connection mechanism is composed of a full-connection layer with an activation function of RELU and a full-connection layer with a linear activation function, and is used for solving the problem that the fitting degree of a multi-head attention mechanism on data processed by a hypergraph neural network layer is not enough.

9. The hypergraph Transformer-based threat hunting model establishing method according to claim 3, wherein: the super edge matching layer pair is a super graph pair

And

To a

Each of the super edges

Calculating it from the other graph of the pair

The gaussian kernel function of all hyper-edges computes a score:

，

wherein ,

is that

The number of the middle-out edges,

and

representation hypergraph

And

the super-edge in (1) indicates that,

the range of action of the gaussian kernel function is controlled,

10. The hypergraph Transformer-based threat hunting model building method of claim 9, wherein: in the function calculation layer, the matrix generates scores after being processed by the full connection layer

The calculation formula is as follows:

，

wherein G is a set of pairs of training images, and

presentation log graph

And log graph

The actual score in between.