CN114490995A

CN114490995A - Multistage self-attention network security cooperative disposal battle room semantic abstraction method

Info

Publication number: CN114490995A
Application number: CN202210329999.9A
Authority: CN
Inventors: 孙捷; 车洵; 胡牧; 孙翰墨
Original assignee: Nanjing Zhongzhiwei Information Technology Co ltd
Current assignee: Nanjing Zhongzhiwei Information Technology Co ltd
Priority date: 2022-03-31
Filing date: 2022-03-31
Publication date: 2022-05-13

Abstract

The invention discloses a multistage self-attention network security cooperative disposal battle room semantic abstract method, which gives a section of network security cooperative disposal battle room chat record and comprises the following steps: recording the operation process of the network security cooperative disposal studio to form a corpus; converting text contents in the corpus into a mark vector through a word vector matrix, and splicing the mark vector, the sentence segmentation vector and the position vector to form a complete word vector as the input of a network structure; sending the output result to an encoder of the transformer; inputting the result into a module of the transformer containing an attention mechanism to eliminate repeated redundant contents in the text; generating a semantic abstract through a decoder in a transformer model; outputting semantic abstracts corresponding to the texts according to the k key words with larger weights to form task instructions; and operation records are simplified, and a concise and smooth abstract task is generated.

Description

Multistage self-attention network security cooperative treatment combat room semantic abstraction method

Technical Field

The invention relates to the technical field of network security, in particular to a multistage self-attention network security cooperative treatment battle room semantic abstraction method.

Background

Under the background of the rapid development of the information age, the network information security becomes a topic which is greatly regarded by people at present. Network security technology protects the software, hardware of network systems and data in their systems from malicious influences. In the face of a network attack and defense countermeasure environment which is increasingly white and persistent, Security Operations (SecOps) are oriented to integration and fusion of people, technologies and processes, the global performance and the cooperativity of Security defense resources are improved, and the SecOps become the most direct and most critical links for falling on the ground of Security capability, playing the effectiveness of a defense system and resisting high-level threats. The network security cooperative treatment battle room is used as an important ring in a security operation system, and defense strategies are formulated and executed on network security events by combining operation personnel and an automatic operation system to form an efficient, accurate and timely network security defense system.

In the network safety cooperative disposition battle room, the artificial intelligence cooperative operation personnel complete the network safety emergency response task. The artificial intelligence obtains the task instruction by understanding the chat record content of the operator in the battle room. Firstly, identifying and understanding the content in the chat records of the battle room, extracting the key information in the chat records, forming a script responding to the network security events according to the semantic abstract content of the chat records of the network security cooperative disposition battle room, and completing the corresponding tasks.

The method for analyzing the text data of the chat records in the network security cooperative disposal combat room can use an algorithm for processing natural language, in the recent algorithm for processing natural language, the introduction of an attention mechanism greatly improves the accuracy of semantic abstract extraction, and the semantic abstract extraction of the chat records in the network security cooperative disposal combat room is completed by combining a transformer model.

The existing analysis method for the chat records of the network security cooperative treatment battle room is characterized in that a modeling sequence marking task is processed, each sentence of the battle room is processed and learned, a simple keyword combination meeting requirements is screened out, tasks needing to be executed are identified through a classifier, and the understanding of text semantics is lacked. The processing of the semantic abstract of the network security cooperative treatment battle room has the following three defects:

(1) the text semantic abstract extraction task under supervised learning needs a large number of data training models, but the data set resources in the network security professional field are not abundant, and the cost of manually labeled supervision data is too high;

(2) different semantic roles such as combat participation, general command, executive staff and the like exist in chat records of a network security cooperative disposal combat room, the traditional processing method has the same attention for extracting the characteristics of text sequences input by different roles, so that the understanding of key information in a section of semantics depends on the physical position of a keyword instead of the meaning of a word, and the redundancy of memory information is caused;

(3) before a transformer model is provided, a recurrent neural network is generally used for processing a text, and the limitation is that the recurrent neural network is effective in processing a short text and has a poor effect in processing a long text. However, the studio chat history we need to process is complex long text, and the hierarchical structure of the text needs to be understood, requiring the model to be able to process unstructured multi-role text.

Based on the above considerations, it is urgently needed to provide a multi-level self-attention network security cooperative treatment studio semantic abstraction method to solve the above problems.

Disclosure of Invention

In order to achieve the above object, the inventor provides a multistage self-attention network security cooperative treatment studio semantic abstraction method, and a section of network security cooperative treatment studio chat record is given, and the semantic abstraction extraction of the section of chat record comprises the following steps:

s1: recording the operation process of the network security cooperative disposal combat room to form a corpus, and performing table building, text content serialization, role marking of operators, content of analysis problems and coping tasks according to analysis arrangement on dialogs and chatting record texts in the corpus to form the network security cooperative disposal combat room chatting record corpus;

s2: converting text contents in a corpus into mark vectors through a word vector matrix, distinguishing different sentences by assigning values of 0 and 1 to different sentences to obtain sentence segmentation vectors, distributing position vectors for each mark vector by adopting a transformer, and splicing the mark vectors, the sentence segmentation vectors and the position vectors to form complete word vectors as input of a network structure;

s3: sending the output result in the step S2 to an encoder of the transformer;

s4: inputting the output result of the encoder of the transformer in the S3 into a module of the transformer, which contains an attention mechanism, compressing the output result into a vector representation, and extracting key information by combining a context vector to eliminate repeated redundant contents in the text;

s5: after the text content in the corpus is compressed into a vector, generating a semantic abstract through a decoder in a transformer model;

s6: processing the output result of the step S5 by a softmax layer, and outputting semantic abstracts corresponding to the text according to k key words with larger weight to form a task instruction;

s7: and the semantic abstract of the chat record in the battle room is cooperatively treated by network security to form a task instruction, so that the operation record is simplified, and a concise and smooth abstract task is generated.

As a preferred mode of the present invention, the S2 includes the steps of: for inputted text

Is formed by

The composition of each sentence is as follows,

wherein

To represent the second in the text

Sentence words, the text is preprocessed in sequence, word segmentation is carried out by using an LTP word segmentation device, then noise words and stop words are removed, training linguistic data are generated in a standardized way, and each sentence is assigned with a label

Where 0 means no sentence is recognized and 1 means a sentence is recognized.

As a preferred embodiment of the present invention, the S2 further includes the steps of: changing the character symbols of the processed text into digital sequence word vectors through a word vector layer, and marking [ CLS ] on the head mark]And tail mark (SEP)]And generating a sentence segmentation vector for distinguishing sentences and a position vector representing an absolute position of each word, wherein the vector dimensions of the token vector, the sentence segmentation vector and the position vector are all z, and then splicing the corresponding token vector, sentence segmentation vector and position vector of the input sequence for use

Represents:

wherein the content of the first and second substances,

a token vector representing each word in the sentence,

a statement segmentation vector is represented which is,

the corresponding parities divide the sentence into AB blocks,

a position vector is represented by a vector of positions,

corresponding to the maximum length of the sentence,

representing input text

Is obtained by splicing three vectors, and the dimensions of the rows and the columns in the vector space are all

。

As a preferable aspect of the present invention, the S3 includes: for inputted text

Is formed by inputting a sequence

Firstly, a multi-head attention block composed of a plurality of attention modules is transmitted, the number of heads of the multi-head attention block is super-parameter, t heads are customized, and the output is t heads

Then using the initialization matrix

And input sequence

Multiply to obtain

The query vector matrix, the key value vector matrix and the value vector matrix are respectively corresponded.

As a preferred embodiment of the present invention, the S3 further includes the steps of: since there are t heads of attention, i.e. division into

For the attention weight at the current moment, firstly, the association degree of the current word and other words is calculated, and the similarity is calculated by using the query vector and the key value vectors of other words

：

For calculating similarity by product of query vector and key value vector

Performing a reduction by dividing by the same factor

Then, carrying out normalization processing by using a softmax function, wherein the obtained value is the value representation of the current word and the current word, and the expression is as follows:

weighted value obtained by current word

To update the attention weight of the current word

：

And circulating the same steps for other input sequences to obtain all outputs.

As a preferred embodiment of the present invention, the S3 further includes the steps of: updating the attention weight by the following formula:

then, a plurality of attention weights are output and spliced together by a multi-head attention module and then are mixed with the input sequence

Residual error jump output is carried out, and then the residual error jump output is input into a specification layer LN to output a new value, wherein the attention weight of an input sequence is shown as the following expression:

calculated, output vector as input for fully connected layer, also through residual jump and normative layer LN, and encapsulated with superimposed linear layer

Function activation, the expression is:

vector of output for full connection layer

As the input of the coding part of the next layer of transformer, the expression is:

as a preferred mode of the present invention, the S4 includes the steps of: the calculation process is repeated to process the features by using 12 layers of transformer coding parts, and the vectors are output after passing through the training layers of the stacked bidirectional transformer coding parts

，

Is the beginning of each sentence at the time of input [ CLS]The symbolic tag vector is also an information vector containing the whole sentence, and the expression is as follows:

as a preferred mode of the present invention, the S5 includes the steps of: to obtain

And then, introducing the input of a decoding part of the multilayer transformer for decoding, and splicing the output of each layer, wherein the expression is as follows:

wherein

Obtained by the weight summation and the average of the information vectors of the multilayer transformer and additionally input into a sigmod function

To predict the semantic extraction score of each sentence, the expression is:

is shown as

The result of each sentence.

As a preferred mode of the present invention, the S6 includes the steps of: ranking according to the score of each sentence which is output by sequential training, selecting a label corresponding to the sentence with the highest score by adding a softmax layer, and mapping a corresponding semantic abstract expression from an operation set S according to the corresponding label as follows:

。

different from the prior art, the technical scheme has the following beneficial effects:

according to the scheme, the chat record content in the safe operation process in the network safe cooperative disposal warroom is fully mined by the network safe cooperative disposal warroom semantic abstract method based on multi-level attention, key content in a chat record text is concerned to obtain effective information, the efficiency of the warroom is improved by the semantic abstract, the communication content of operators is quickly converted into a language which can be processed by a computer, and a high-efficiency, high-precision and low-cost safe operation automation system is formed.

Drawings

FIG. 1 is a flow chart of a method according to an embodiment.

FIG. 2 is a block diagram of a method according to an embodiment.

FIG. 3 is a diagram illustrating the method of the present invention in greater detail.

Detailed Description

To explain technical contents, structural features, and objects and effects of the technical solutions in detail, the following detailed description is given with reference to the accompanying drawings in conjunction with the embodiments.

The method aims to solve the problem that in the process of cooperation of operators and an automatic operation system, the operators can efficiently understand operation instructions, the operation instructions are mostly spoken unstructured texts such as conversation records and chat records, tasks are executed according to the conversation records and the chat records, the automatic operation system needs to extract key information in the texts, semantic abstracts are generated through neural network processing, and then task instructions are formed according to the abstracts. Most of the existing text processing methods focus on structural texts, and the semantic recognition effect on non-structural texts is not ideal. In the process of carrying out safe operation work in a network safe cooperative disposition operation room, tasks such as supervision and treatment, safe preparation, operation and maintenance, protection and defense, safe analysis, collection and operation, safe investigation and the like are composed of different professional fields and work roles, each role carries out information exchange and data analysis in a conversation mode, artificial intelligence is introduced to assist operators to complete network safe operation tasks, and the meaning of text input of the operators is understood firstly. Therefore, the technology takes the transformer model as a basic structure of the abstract language model, can carry out semantic recognition on the text according to key information and context content, utilizes the multistage attention mechanism to carry out deeper reading on the text according to semantic roles and task content in conversation and chatting records, removes redundancy in the conversation text, generates an abstract with complete and effective semantics, simplifies the chatting records in the network safety cooperative disposal studio, and provides efficient work help.

Specifically, the embodiment provides a multistage self-attention network security cooperative disposition battle room semantic abstraction method, and a section of network security cooperative disposition battle room chat record is given, and the semantic abstraction extraction of the section of chat record comprises the following steps:

s3: sending the output result in the step S2 to an encoder of the transformer;

In the specific implementation process of the above embodiment, as shown in fig. 1 to fig. 3, the method mainly includes a processing procedure of recording an operation process of the network security studio, specifically an analysis procedure of an operator for a certain network security event; and (3) performing table building on the dialogs and the chat record texts in the corpus, serializing the text contents, marking the roles of operators, analyzing the contents of problems and responding tasks according to analysis arrangement to form a network security cooperative treatment studio chat record corpus.

For inputted text

Is formed by

The composition of each sentence is as follows,

wherein

To represent the second in the text

Where 0 means no sentence is recognized and 1 means a sentence is recognized.

Changing the character symbols of the processed text into digital sequence word vectors through a word vector layer, and marking [ CLS ] on the head mark]And tail mark (SEP)]And generating a sentence segmentation vector for distinguishing sentences and a position vector representing an absolute position of each word, wherein the vector dimensions of the token vector, the sentence segmentation vector and the position vector are all z, and then splicing the corresponding token vector, sentence segmentation vector and position vector of the input sequence for use

Represents:

wherein the content of the first and second substances,

a token vector representing each word in the sentence,

a statement segmentation vector is represented which is,

the corresponding parities divide the sentence into AB blocks,

a position vector is represented by a vector of positions,

corresponding to the maximum length of the sentence,

representing input text

。

For inputted text

Is formed by inputting a sequence

Then using three initialization matrices

And input sequence

Multiply to obtain

Since there are t attention heads, so divide into

For the attention weight at the current moment, firstly, the association degree of the current word and other words is calculated, and the key value direction of the query vector and other words is usedQuantity calculation similarity

：

For calculating similarity by product of query vector and key value vector

Performing a reduction by dividing by the same factor

Then, by using a softmax function and normalization processing of one, the obtained value is the value representation of the current word and the current word, and the expression is as follows:

finally, the weighted value obtained by the current word

To update the attention weight of the current word

：

All outputs can be obtained by also cycling through the same steps for other input sequences.

The attention weight is updated by the following formula:

then using a multi-head attention moduleOutputting and splicing a plurality of attention weights together and then adding the attention weights to an input sequence

Function activation, the expression is:

vector of output for full connection layer

the calculation process is repeated to process the features by using 12 layers of transformer coding parts, and the vectors are output after passing through the training layers of the stacked bidirectional transformer coding parts

，

to obtain

wherein

To predict the semantic extraction score of each sentence, the expression is:

is shown as

The result of each sentence.

Ranking according to the score of each sentence which is output by sequential training, selecting a label corresponding to the sentence with the highest score by a softmax layer, and mapping corresponding semantic abstract expressions from an operation set S according to the corresponding labels as follows:

。

in some embodiments, the whole process framework shown in fig. 2 needs to be trained in advance, the training stage adopts a few-sample learning framework to learn meta knowledge, and the testing stage uses a transformer based on multi-level attention to complete a semantic abstraction extraction task, which is described in detail as follows:

pre-training with public dialogue data sets: the pre-training task uses a semantic abstract model based on a multilevel attention transformer, when a text is input, sentences in the text are processed in a segmented mode to generate a subsequence with limited length, words in the subsequence are converted into digital sequence word vectors, and semantic abstract is generated through transformer processing.

After the pre-training is completed, the network model is fine-tuned 12000 times with the open source dataset CMCSE.

The invention adopts the parameter initialization network model of Chinese pre-trained Bert-base-case (text data set based on Bert transformer-supporting case) issued by Google, uses the cross entropy loss function for training, adopts the adamW optimizer, and sets the dynamic setting by default

。

Model with

Training to 10000 times, then descending, training 100k, L2 attenuation parameter 0.01, replacing the RELU by the GELU for the activation function, then fine-tuning by fixing the parameter, the hidden layer vector dimension, namely embedding size, is 768, the maximum length of the input sequence is 256, the trained batch is 16, and the learning rate is set to

And does not participate in training. The number of model layers is set to be 12, the number of attention heads is set to be 8, the dimension of an input layer is 256, the training speed is influenced by too long, and the difference between the training in the fine-tuning stage and the pre-training stage is small.

Based on the scheme of the embodiment, an open Source data set CMCSE (Comprehensive, Multi-Source Cyber-Security Events) is used, because most of chat records of a network Security cooperative treatment studio are confidential, a semantic abstract is trained and extracted by using a model of less sample learning, a text data set with enough open data volume is used for training a model during training, model parameters capable of extracting the semantic abstract are obtained, and a small amount of safe operation chat record texts are input during testing, so that the effectiveness of the model is tested. The model measures the performance of the semantic abstract method of the network security cooperative disposition battle room based on multi-level attention from three evaluation criteria of accuracy, recall and F-measure (comprehensive evaluation index), as shown in the following table. On the same data set, the semantic abstract method of the network security cooperative treatment battle room based on multi-level attention is higher than other model methods in the aspect of semantic abstract extraction effect, in the transverse comparison, different models are used for comparison on a CMCSE data set, compared with the basic framework of a recurrent neural network, such as LSTM (long and short term memory network), BilTM (bidirectional long and short term memory network), GRU (gated memory network) and other models, the transformer-based multi-stage attention mechanism is added to extract the text semantic abstract information of a battle room, the precision rate of abstract extraction is better than the best effect of redundancy removal, the evaluation standard precision rate, the recall rate and the F-measure value are respectively improved by 9.82%, 7.23% and 3.70%, and the requirement for tag data is reduced by using a less-sample learning framework.

It should be noted that, although the above embodiments have been described herein, the scope of the present invention is not limited thereby. Therefore, based on the innovative concepts of the present invention, the technical solutions of the present invention can be directly or indirectly applied to other related technical fields by making changes and modifications to the embodiments described herein, or by using equivalent structures or equivalent processes performed in the content of the present specification and the attached drawings, which are included in the scope of the present invention.

Claims

1. A multistage self-attention network security cooperative disposal battle room semantic abstract method is characterized in that given a section of network security cooperative disposal battle room chat records, semantic abstract extraction of the section of chat records comprises the following steps:

s3: sending the output result in the step S2 to an encoder of the transformer;

2. The multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 1, wherein the S2 comprises the steps of:

for inputted text

Is composed of

The number of the sentences is composed of,

wherein

To represent the second in the text

Sentence words, the text is preprocessed in sequence, word segmentation is carried out by using an LTP word segmentation device, then noise words and stop words are removed, training linguistic data are generated in a standardized mode, and a label is distributed to each sentence

Where 0 means no sentence is recognized and 1 means a sentence is recognized.

3. The multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 2, wherein the S2 further comprises the steps of: changing the character symbols of the processed text into digital sequence word vectors through a word vector layer, and marking [ CLS ] on the head mark]And tail mark (SEP)]And generating a sentence segmentation vector for distinguishing sentences and a position vector representing an absolute position of each word, wherein the vector dimensions of the token vector, the sentence segmentation vector and the position vector are all z, and then splicing the corresponding token vector, sentence segmentation vector and position vector of the input sequence for use

Represents:

wherein the content of the first and second substances,

a token vector representing each word in the sentence,

a statement segmentation vector is represented which is,

the corresponding parities divide the sentence into AB blocks,

a position vector is represented by a vector of positions,

corresponding to the maximum length of the sentence,

representing input text

。

4. The multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 3 wherein the S3 comprises the steps of: for inputted text

Is formed by inputting a sequence

Firstly, a multi-head attention block composed of a plurality of attention modules is introduced, the head number of the multi-head is super parameter, t heads are defined by user, and the output is t heads

Then using the initialization matrix

And input sequence

Multiply to obtain

5. The multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 4, wherein the S3 further comprises the steps of: since there are t heads of attention, i.e. division into

：

By query vectors and key-value vectors for calculating similarityProduct of

Performing a reduction by dividing by the same factor

weighted value obtained by current word

To update the attention weight of the current word

：

And circulating the same steps for other input sequences to obtain all outputs.

6. The multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 5, wherein the S3 further comprises the steps of: the attention weight is updated by the following formula:

Function activation, the expression is:

vector of output for full connection layer

。

7. the multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 3 wherein the S4 comprises the steps of:

then repeating the calculation process, processing the characteristics by using 12 layers of transformer coding parts, outputting vectors after passing through the training layers of the stacked bidirectional transformer coding parts

，

Is [ CLS ] of each sentence head in input]The symbolic tag vector is also an information vector containing the whole sentence, and the expression is as follows:

。

8. the multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 7, wherein the S5 comprises the steps of:

to obtain

wherein

To predict the semantic extraction score of each sentence, the expression is:

is shown as

The result of each sentence.

9. The multi-level self-attentive cyber-security-co-disposition studio semantic summarization method of claim 8, wherein the S6 comprises the steps of:

ranking according to the score of each sentence which is output by sequential training, selecting a label corresponding to the sentence with the highest score by adding a softmax layer, and mapping a corresponding semantic abstract expression from an operation set S according to the corresponding label as follows:

。