WO2022219787A1

WO2022219787A1 - Labeling device, labeling method, and program

Info

Publication number: WO2022219787A1
Application number: PCT/JP2021/015632
Authority: WO
Inventors: 勝真田; 英俊川口
Original assignee: 日本電信電話株式会社
Priority date: 2021-04-15
Filing date: 2021-04-15
Publication date: 2022-10-20
Also published as: JPWO2022219787A1; JP7544260B2

Abstract

This labeling device comprises: a training unit which trains, on the basis of a first monitoring data group in which each of a plurality of pieces of monitoring data each showing characteristics of specific communication data is given a label indicating whether or not the monitoring data is necessary and indicating a method for coping with the specific communication data, a model that receives the monitoring data as an input and outputs the label corresponding to the monitoring data; and a setting unit which sets monitoring data with a label indicating that it is necessary among the first monitoring data group, to a monitoring device that monitors communication data on the basis of the monitoring data, whereby the load of setting work for a method for coping with the monitoring data is reduced.

Description

LABELING APPARATUS, LABELING METHOD AND PROGRAM

The present invention relates to a labeling device, a labeling method, and a program.

A communication security monitoring device such as an IDS (Intrusion Detection System) or an IPS (Intrusion Prevention System) is installed in a communication channel, monitors communication data, discovers malicious communication data (threat data), and deals with it (storage/ (notification/blocking, etc.) (Fig. 1).

A list of threat data (monitoring data) to be discovered is set in the IDS/IPS, and the threat data is discovered by comparing the communication data and the monitoring data. In the monitoring data, a countermeasure (save/notify/block, etc.) is set, and the IDS/IPS takes action according to the setting.

IDS/IPS monitoring data is provided by the security vendor that provides the IDS/IPS. Monitoring data of security vendors (hereinafter referred to as "general-purpose monitoring data") is exhaustive and huge in number in order to require versatility.

If telecommunications carriers that apply and operate IDS/IPS use general-purpose monitoring data from security vendors as they are, communication data will be compared with a huge amount of general-purpose monitoring data, which will reduce communication performance deterioration such as communication delays. will cause.

Therefore, the telecommunications carrier selects only the necessary monitoring data (hereinafter referred to as "individual monitoring data") according to the conditions of its own communication system, and sets the coping method for its own communication system.

JP 2019-174988 JP 2020-024513 JP 2020-160642

For telecommunications carriers that apply and operate IDS/IPS, it takes a lot of work to sort out individual monitoring data from the vast amount of general-purpose monitoring data from security vendors and set individual countermeasures.

The present invention has been made in view of the above points, and an object of the present invention is to reduce the load of setting work for coping methods for monitoring data.

Therefore, in order to solve the above problem, the labeling device provides a label indicating whether or not the monitoring data is necessary and how to deal with the specific communication data for each of a plurality of pieces of monitoring data each indicating characteristics of specific communication data. a learning unit for learning a model that receives the monitoring data as input and outputs the label corresponding to the monitoring data based on the first monitoring data group to which is assigned; and a setting unit configured to set monitoring data labeled indicating that the monitoring data is a monitoring device that monitors communication data based on the monitoring data.

　It is possible to reduce the load of setting work to deal with monitoring data.

It is a figure for demonstrating IDS/IPS. It is a figure which shows the hardware structural example of the labeling apparatus 10 in embodiment of this invention. It is a figure showing an example of functional composition of labeling device 10 in an embodiment of the invention. 3 is a diagram showing an example of the functional configuration of a labeling model 13; FIG. 4 is a diagram for explaining initial learning of the labeling model 13; FIG. FIG. 4 is a diagram showing a configuration example of labeled general-purpose monitoring data; FIG. 4 is a diagram for explaining operation using a trained labeling model 13a and re-learning of the trained labeling model 13a; 4 is a diagram for explaining a learning procedure of the labeling model 13; FIG.

Embodiments of the present invention will be described below based on the drawings. FIG. 2 is a diagram showing a hardware configuration example of the labeling device 10 according to the embodiment of the present invention. The labeling device 10 of FIG. 2 has a drive device 100, an auxiliary storage device 102, a memory device 103, a processor 104, an interface device 105, etc., which are interconnected by a bus B, respectively.

A program that implements the processing in the labeling device 10 is provided by a recording medium 101 such as a CD-ROM. When the recording medium 101 storing the program is set in the drive device 100 , the program is installed from the recording medium 101 to the auxiliary storage device 102 via the drive device 100 . However, the program does not necessarily need to be installed from the recording medium 101, and may be downloaded from another computer via the network. The auxiliary storage device 102 stores installed programs, as well as necessary files and data.

The memory device 103 reads and stores the program from the auxiliary storage device 102 when a program activation instruction is received. The processor 104 is a CPU or a GPU (Graphics Processing Unit), or a CPU and a GPU, and executes functions related to the labeling device 10 according to programs stored in the memory device 103 . The interface device 105 is used as an interface for connecting to a network.

FIG. 3 is a diagram showing a functional configuration example of the labeling device 10 according to the embodiment of the present invention. In FIG. 3 , the labeling device 10 includes a learning section 11 , a setting section 12 and a labeling model 13 . These units are implemented by processing that one or more programs installed in the labeling apparatus 10 cause the processor 104 to execute.

The labeling model 13 receives as input a set of monitoring data (hereinafter referred to as "general-purpose monitoring data") provided by a security vendor (hereinafter referred to as "general-purpose monitoring data group"), and classifies the monitoring data as , and the correct/incorrect judgment result for the classification result. The classification results are output in the form of labels. A label is information indicating whether general-purpose monitoring data to which the label is attached is necessary and how to deal with it. The label assignment model 13 assigns a label as a classification result to the input general-purpose monitoring data.

The learning unit 11 causes the labeling model 13 to learn the correspondence relationship between each general-purpose monitoring data included in the general-purpose monitoring data group and the labels for the general-purpose monitoring data. In the present embodiment, initial learning and re-learning are performed as the learning of the labeling model 13 . In the initial learning, labels are manually assigned by the user to the general-purpose monitoring data used as learning training data.

The setting unit 12 determines whether or not the monitoring data is necessary based on the label indicated by the classification result of the monitoring data, and sets the necessary monitoring data in the communication security monitoring device 20 . The communication security monitoring device 20 is, for example, an IDS (Intrusion Detection System) or an IPS (Intrusion Prevention System). That is, the communication security monitoring device 20 monitors communication data based on set monitoring data, detects (discovers) specific communication data corresponding to the monitoring data, A process is executed according to the coping method indicated by the label assigned to the monitoring data.

FIG. 4 is a diagram showing a functional configuration example of the labeling model 13. As shown in FIG. As shown in FIG. 4 , the labeling model 13 includes three models: a classification estimator 131 , a classification estimation process observer 132 and an error determiner 133 . Each of these units may be the same as the functional unit with the same name described in Patent Document 2. FIG.

Specifically, the classification estimation unit 131 estimates the label of the input general-purpose monitoring data and outputs the label as the classification result. The classification estimator 131 can be implemented using artificial intelligence-related technology such as SVM, neural network, Bayesian network, and decision tree, for example.

The classification estimation process observation unit 132 observes the calculation process (estimation process) when the classification estimation unit 131 estimates the label of the general-purpose monitoring data, acquires data in the estimation process, and converts the data into a feature vector. and outputs the feature vector to error determination section 133 .

For example, when the classification estimation unit 131 estimates a label using a neural network, the classification estimation process observation unit 132 uses values output from each node (activation function) of each intermediate layer and output layer of the neural network as features. May be output as a vector. For example, if the values of each node in the hidden layer are 0.5, 0.4, 0.7 and the values of each node in the output layer are 0.2, 0.7, 0.1, the feature vector is It can be configured as [0.5 0.4 0.7 0.2 0.7 0.1].

Alternatively, when the classification estimating unit 131 estimates a label using a decision tree, the classification estimation process observing unit 132 observes the route leading to the classification decision and constructs a feature vector. For example, when a certain label is estimated from the route node 1->node 3->node 6, the classification estimation process observation unit 132 assigns [1 0 1 0 0 1 0 0 0] indicating the route to the feature vector can be output as In this example, the subscript of the vector element corresponds to the node number of the decision tree. to construct the feature vector.

Examples of other feature vectors are as disclosed in Patent Document 2.

The error determination unit 133 receives the feature vector from the classification estimation process observation unit 132 and determines whether the label estimated by the classification estimation unit 131 is "correct" or "wrong" based on the feature vector.

The configuration method of the error determination unit 133 is not limited to a specific method. For example, the error determination unit 133 determines whether a specific value of the feature vector (especially the value of the output layer of the neural network or the number of votes of the random forest) exceeds the threshold, and the classification estimation unit 131 estimates It can be determined whether the label is "correct" or "wrong".

Also, the error determination unit 133 may be configured with a model that is often used in the field of machine learning. For example, the error determination unit 133 can be configured with an SVM, neural network, or the like. When these models are used, the error determination unit 133 can be implemented by performing model parameter tuning by supervised learning.

FIG. 5 is a diagram for explaining the initial learning of the labeling model 13. FIG.

At the time of initial learning, there is no training data for learning the labeling model 13. Therefore, the learning unit 11 selects all or part of a set of a plurality of general-purpose monitoring data (hereinafter referred to as "general-purpose monitoring data group X") provided by the security vendor during a predetermined period up to the time of initial learning. For the general-purpose monitoring data, an input of a label is received from the user, and a labeled general-purpose monitoring data group X to which the input label is assigned is generated as learning training data (S101). When the labeling target is limited to a portion of the general-purpose monitoring data, the portion may be selected at random, for example.

FIG. 6 is a diagram showing a configuration example of labeled general-purpose monitoring data. One row in FIG. 6 shows the labeled generic monitoring data. A single labeled generic monitoring data includes generic monitoring data and a label. Generic monitoring data is data that characterizes specific (eg, malicious) communication data (threat data), including, for example, protocol, source address, source port, destination address, destination port, and communication content data.

In the present embodiment, the label values are "unnecessary", "save", "notify", or "block".

"Unnecessary" indicates that the labeled general-purpose monitoring data is unnecessary for telecommunications carriers.

"Save", "notify" and "block" are labels given to general-purpose monitoring data necessary for telecommunications carriers. In other words, "storage", "notification" and "blocking" mean that the labeled general-purpose monitoring data is necessary for telecommunications carriers, and that communication data corresponding to the general-purpose monitoring data is discovered. Indicate how to deal with it.

"Save" indicates to save the communication data. "Notify" indicates to notify the carrier of the detection of the communication data. “Cut off” indicates that the communication data is cut off.

For example, the telecommunications carrier selects whether or not each general-purpose monitoring data included in the general-purpose monitoring data group X is necessary based on other information related to the general-purpose monitoring data (reports of cyberattacks, information on internal troubles within the carrier, etc.). , "unnecessary" is given to unnecessary general-purpose monitoring data, and necessary general-purpose monitoring data is determined to be dealt with when communication data corresponding to the general-purpose monitoring data is found.

The setting unit 12 extracts general-purpose monitoring data labeled with a label other than “unnecessary” (hereinafter referred to as “labeled individual monitoring data”) from the labeled general-purpose monitoring data group X, and extracts each labeled individual monitoring data. is set in the communication security monitoring device 20 (S102). At this time, the setting unit 12 specifies, based on the label given to each labeled individual monitoring data, a coping method when communication data corresponding to each labeled individual monitoring data is found, and communicates the coping method. Set to the security monitoring device 20 .

On the other hand, the learning unit 11 learns the labeled model 13 using the labeled general-purpose monitoring data group X as training data (S103). As a result, a trained labeling model 13a is generated.

Next, operation using the trained labeling model 13a and re-learning of the trained labeling model 13a will be described.

FIG. 7 is a diagram for explaining operation using the trained labeling model 13a and re-learning of the trained labeling model 13a. Re-learning is performed in parallel with operation using the trained labeling model 13a.

FIG. 7 shows operation and learned labeling when a new set of multiple general-purpose monitoring data (hereinafter referred to as “general-purpose monitoring data group Y”) is provided by a security vendor after generation of the trained labeling model 13a. An example of re-learning of the provision model 13a is shown. The general-purpose monitoring data group Y may include the general-purpose monitoring data group X or may contain only new data.

First, each general-purpose monitoring data included in the general-purpose monitoring data group Y is input to the trained labeling model 13a (S201). The trained labeling model 13a outputs, for each general-purpose monitoring data, a label for the general-purpose monitoring data and a correct/wrong determination result for the label. The result of correctness/incorrectness determination is "correct" or "wrong". Hereinafter, the general-purpose monitoring data group related to the label whose correctness judgment result is “correct” will be referred to as “certain labeled general-purpose monitoring data group Y (automatic)”, and the label related to the label whose judgment result is “wrong”. The general-purpose monitoring data group is referred to as "uncertain labeled general-purpose monitoring data group Y". It should be noted that "(automatic)" in "reliable labeled general-purpose monitoring data group Y (automatic)" is generated manually by the user, and is described later in a reliable labeled general-purpose monitoring data group Y (manual). It is convenient identification information for distinguishing from

The learning unit 11 receives input of a correct label from the user for each labeled general-purpose monitoring data included in the uncertain labeled general-purpose monitoring data group Y, and corrects the assigned label based on the label input by the user. (replace) (S202). As a result, the uncertain labeled general-purpose monitoring data group Y becomes the certain labeled general-purpose monitoring data group Y (manual). That is, since the label is manually corrected by the user, the labeled general-purpose monitoring data with the correct label is generated.

The setting unit 12 selects general-purpose monitoring data with a label other than “unnecessary” (hereinafter referred to as “label labeled individual monitoring data"), and set each labeled individual monitoring data to the communication security monitoring device 20 (S203).

The learning unit 11 also re-learns the labeling model 13 using the reliable labeled general-purpose monitoring data group Y (automatic) and the reliable labeled general-purpose monitoring data group Y (manual) as learning training data (S204). As a result, a relearned labeling model 13b is generated.

When the general-purpose monitoring data group Y includes the general-purpose monitoring data group X, the trained labeled model 13a can be re-learned using the reliable labeled general-purpose monitoring data group Y based thereon as learning training data. The greater the number of learning training data, the higher the learning effect and the higher the probability of obtaining correct results. Even if the general-purpose monitoring data group Y is only new data, new learning training data can be obtained. It can be expected that the performance of the labeling model 13 will be improved by re-learning with new learning training data.

After that, when a new set of general-purpose monitoring data is provided by the security vendor, the relearned labeling model 13b is operated by the same procedure as in FIG. learning takes place.

Next, the details of step S103 in FIG. 5 and step S204 in FIG. 7 will be described. FIG. 8 is a diagram for explaining the learning procedure of the labeling model 13. As shown in FIG. The (labeled) general monitoring data group Z in FIG. 8 is the (labeled) general monitoring data group X in the case of FIG. 5, and the (definitely labeled) general monitoring data group in the case of FIG. Y (automatic or manual).

First, the learning unit 11 uses the labeled general-purpose monitoring data group Z to cause the classification estimation unit 131 to learn the correspondence relationship between the general-purpose monitoring data and the labels (S301).

Subsequently, the learning unit 11 inputs the general-purpose monitoring data group Z to the learned classification estimation unit 131 (S302). The classification estimation unit 131 outputs a list of labels (hereinafter referred to as "estimated label list") estimated for each general-purpose monitoring data included in the general-purpose monitoring data group Z (S303). At this time, the classification estimation process observation unit 132 acquires data of the label estimation process for each general-purpose monitoring data (S304), and outputs a feature vector for each data (S305).

Subsequently, the learning unit 11 creates a list of correct labels assigned to each labeled general-purpose monitoring data included in the labeled general-purpose monitoring data group Z (hereinafter referred to as a “correct label list”) and an estimated label list. are compared for each element of the list (that is, for each label corresponding to the same general-purpose monitoring data), and a list indicating correctness/incorrectness of each label included in the estimated label list (hereinafter referred to as "correction list") is generated. (S306). The correct/incorrect list is a list of 1's or 0's, such as "1011...". A 0 indicates a correct label and a 1 indicates an incorrect label.

Subsequently, the learning unit 11 causes the error determination unit 133 to learn the correspondence relationship between the feature vector list and the correct/incorrect list (S307). As a result, the error determination unit 133 becomes learned. Note that the learning of the error determination unit 133 is detailed in Patent Document 2 as well.

As described above, according to the present embodiment, the labeling model 13 can automatically label general-purpose monitoring data. As a result, it is possible to reduce the load of the work of setting the coping method for the monitoring data. Further, by re-learning the labeling model 13, the classification accuracy can be improved.

In addition, in the present embodiment, the learning unit 11 is also an example of a relearning unit.

Although the embodiments of the present invention have been described in detail above, the present invention is not limited to such specific embodiments, and various modifications can be made within the scope of the gist of the present invention described in the claims.・Changes are possible.

10 labeling device 11 learning unit 12 setting unit 13 labeling model 13a learned labeling model 13b relearned labeling model 20 communication security monitoring device 100 drive device 101 recording medium 102 auxiliary storage device 103 memory device 104 processor 105 interface device 131 classification estimation unit 132 classification estimation process observation unit 133 error determination unit B bus

Claims

Based on a first monitoring data group in which each of a plurality of pieces of monitoring data each showing characteristics of specific communication data is given a label indicating whether or not the monitoring data is necessary and how to deal with the specific communication data a learning unit that learns a model that receives the monitoring data as input and outputs the label corresponding to the monitoring data;
a setting unit configured to set, from the first monitoring data group, monitoring data labeled as necessary to a monitoring device that monitors communication data based on the monitoring data;
A labeling device comprising:
The learning unit receives the monitoring data as an input and learns the model that outputs the label corresponding to the monitoring data and the correct/wrong judgment result for the label,
With a first label in which the first label is given to the monitoring data output with the first label judged to be correct by the model, among the monitoring data contained in the second monitoring data group based on the monitoring data and second labeled monitoring data with a second label input by a user to the monitoring data output with the label determined to be erroneous by the model; a relearning unit for relearning the model;
2. The labeling device according to claim 1, characterized by comprising:
The setting unit sets the first labeled monitoring data and the second labeled monitoring data to the monitoring device.
3. The labeling apparatus according to claim 2, characterized in that:
Based on a first monitoring data group in which each of a plurality of pieces of monitoring data each showing characteristics of specific communication data is given a label indicating whether or not the monitoring data is necessary and how to deal with the specific communication data , a learning procedure for learning a model that receives the monitoring data as input and outputs the label corresponding to the monitoring data;
a setting procedure for setting, from the first monitoring data group, monitoring data labeled as necessary to a monitoring device that monitors communication data based on the monitoring data;
A labeling method characterized in that the computer executes
The learning procedure learns the model that receives the monitoring data as an input and outputs the label corresponding to the monitoring data and the result of determining whether the label is correct or incorrect,
With a first label in which the first label is given to the monitoring data output with the first label judged to be correct by the model, among the monitoring data contained in the second monitoring data group based on the monitoring data and second labeled monitoring data with a second label input by a user to the monitoring data output with the label determined to be erroneous by the model; a retraining procedure for retraining the model;
5. The labeling method according to claim 4, wherein the computer executes:
The setting procedure sets the first labeled monitoring data and the second labeled monitoring data to the monitoring device.
6. The labeling method according to claim 5, characterized in that:
A program characterized by causing a computer to execute the labeling method according to any one of claims 4 to 6.