JP6845125B2

JP6845125B2 - Learning equipment, learning methods and learning programs

Info

Publication number: JP6845125B2
Application number: JP2017235752A
Authority: JP
Inventors: 友貴山中
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-12-08
Filing date: 2017-12-08
Publication date: 2021-03-17
Anticipated expiration: 2037-12-08
Also published as: JP2019102011A

Description

本発明は、学習装置、学習方法及び学習プログラムに関する。 The present invention relates to a learning device, a learning method and a learning program.

ＩｏＴ（Internet of Things）時代の到来に伴い、多種のデバイス（ＩｏＴ機器）がインターネットに接続され、多様な使われ方をされるようになっている。これにともない、ＩｏＴ機器向けのトラフィックセッション異常検知システムや侵入検知システム（ＩＤＳ：Intrusion Detection System）等のＩｏＴ機器のセキュリティ対策が期待されている。 With the advent of the IoT (Internet of Things) era, various devices (IoT devices) are connected to the Internet and are used in various ways. Along with this, security measures for IoT devices such as traffic session abnormality detection systems and intrusion detection systems (IDS) for IoT devices are expected.

このような技術として、例えば、ＶＡＥ（Variational Auto Encoder）等の教師なし学習による確率密度推定器を用いるものがある。この技術では、正常な通信データの確率密度を学習後、確率密度の低い通信を異常として検知する。このため、この技術では、正常な通信データのみが分かればよく、全ての悪性データを学習せずとも異常検知が可能である。したがって、この技術は、未だ過渡期にあり全ての脅威情報を知り尽くされていないＩｏＴ機器に対する脅威の検知に有効である。 As such a technique, for example, there is a technique using a probability density estimator by unsupervised learning such as VAE (Variational Auto Encoder). In this technique, after learning the probability density of normal communication data, communication with a low probability density is detected as an abnormality. Therefore, in this technique, only normal communication data needs to be known, and abnormality detection is possible without learning all malicious data. Therefore, this technology is effective in detecting threats to IoT devices that are still in a transitional period and do not know all the threat information.

Diederik P. Kingma, Max Welling，“Auto-Encoding Variational Bayes”，［平成２９年１１月１７日検索］，インターネット＜ＵＲＬ：https://arxiv.org/abs/1312.6114＞Diederik P. Kingma, Max Welling, "Auto-Encoding Variational Bayes", [Searched November 17, 2017], Internet <URL: https://arxiv.org/abs/1312.6114>

しかしながら、ＶＡＥ等の確率密度推定器は、学習対象の正常な通信データ間でデータ数に偏りがある場合に、精度よく学習できないことがある。 However, a probability density estimator such as VAE may not be able to learn accurately when the number of data is biased among the normal communication data to be learned.

通信データでは、データ数に偏りがある状況がよく発生する。例えば、ＨＴＴＰ（Hypertext Transfer Protocol）通信等は、よく使用されるため、短時間で大量のデータが集まる。これに対し、稀にしか通信を行わないＮＴＰ（Network Time Protocol）通信等は、データ数があまり集まらない。このような状況で、ＶＡＥによる学習を行うと、データ数が少ないＮＴＰ通信の学習が進まず、発生確率を低く見積もってしまい、正常な通信データに対する誤検知の原因になる場合がある。 In communication data, it often happens that the number of data is biased. For example, since HTTP (Hypertext Transfer Protocol) communication and the like are often used, a large amount of data is collected in a short time. On the other hand, in NTP (Network Time Protocol) communication, which rarely communicates, the number of data is not collected so much. If learning by VAE is performed in such a situation, learning of NTP communication with a small number of data does not proceed, and the probability of occurrence is underestimated, which may cause false detection of normal communication data.

本発明は、上記に鑑みてなされたものであって、学習対象の通信データ間でデータ数に偏りがある場合であっても、通信データの確率密度を精度よく学習できる学習装置、学習方法及び学習プログラムを提供することを目的とする。 The present invention has been made in view of the above, and a learning device, a learning method, and a learning device capable of accurately learning the probability density of communication data even when the number of data is biased among the communication data to be learned. The purpose is to provide a learning program.

上述した課題を解決し、目的を達成するために、本発明に係る学習装置は、学習対象である正常な通信データを複数収集する収集部と、正常な通信データに対し、一または複数の宛先の通信データの複製を加えることで、各宛先の通信データ数を揃える前処理部と、前処理部によって処理された正常な通信データについて学習を行い、正常な通信データの確率密度を推定して正常な通信データの確率密度の特徴を表すモデルのパラメータを更新する学習部と、を有することを特徴とする。 In order to solve the above-mentioned problems and achieve the object, the learning device according to the present invention has a collecting unit that collects a plurality of normal communication data to be learned, and one or a plurality of destinations for the normal communication data. By duplicating the communication data of, the pre-processing unit that equalizes the number of communication data of each destination and the normal communication data processed by the pre-processing unit are learned, and the probability density of the normal communication data is estimated. It is characterized by having a learning unit that updates the parameters of a model representing the characteristics of the probability density of normal communication data.

本発明によれば、学習対象の通信データ間でデータ数に偏りがある場合であっても、通信データの確率密度を精度よく学習できる。 According to the present invention, even when the number of data is biased among the communication data to be learned, the probability density of the communication data can be learned accurately.

図１は、実施の形態１に係る検知システムの構成の一例を示す図である。FIG. 1 is a diagram showing an example of the configuration of the detection system according to the first embodiment. 図２は、図１に示す前処理部の処理内容を説明する図である。FIG. 2 is a diagram for explaining the processing content of the preprocessing unit shown in FIG. 図３は、図１に示す学習装置が実行する学習処理の処理手順を示すフローチャートである。FIG. 3 is a flowchart showing a processing procedure of the learning process executed by the learning device shown in FIG. 図４は、実施の形態１に係る検知システムの適用例を説明する図である。FIG. 4 is a diagram illustrating an application example of the detection system according to the first embodiment. 図５は、従来手法による学習状態による各検知対象に対するＲＯＣ（Receiver Operatorating Characteristic）曲線を示す図である。FIG. 5 is a diagram showing a ROC (Receiver Operatorating Characteristic) curve for each detection target according to the learning state by the conventional method. 図６は、図１に示す学習装置の学習によって得られたモデルを用いた場合の各検知対象に対するＲＯＣ曲線を示す図である。FIG. 6 is a diagram showing ROC curves for each detection target when the model obtained by learning of the learning device shown in FIG. 1 is used. 図７は、実施の形態２に係る学習方法によって得られたモデルを用いた場合の各検知対象に対するＲＯＣ曲線を示す図である。FIG. 7 is a diagram showing ROC curves for each detection target when the model obtained by the learning method according to the second embodiment is used. 図８は、プログラムが実行されることにより、学習装置が実現されるコンピュータの一例を示す図である。FIG. 8 is a diagram showing an example of a computer in which a learning device is realized by executing a program.

以下、図面を参照して、本発明の一実施の形態を詳細に説明する。なお、この実施の形態により本発明が限定されるものではない。また、図面の記載において、同一部分には同一の符号を付して示している。 Hereinafter, an embodiment of the present invention will be described in detail with reference to the drawings. The present invention is not limited to this embodiment. Further, in the description of the drawings, the same parts are indicated by the same reference numerals.

［実施の形態１］
本発明の実施の形態１について説明する。図１は、実施の形態１に係る検知システムの構成の一例を示す図である。 [Embodiment 1]
Embodiment 1 of the present invention will be described. FIG. 1 is a diagram showing an example of the configuration of the detection system according to the first embodiment.

図１に示すように、実施の形態に係る検知システム１は、学習装置１０及び検知装置２０を有する。学習装置１０及び検知装置２０は、例えば、ネットワーク等を介して接続する。また、学習装置１０及び検知装置２０は、例えば、ネットワーク等を介して、外部装置と接続する。 As shown in FIG. 1, the detection system 1 according to the embodiment includes a learning device 10 and a detection device 20. The learning device 10 and the detection device 20 are connected via, for example, a network or the like. Further, the learning device 10 and the detection device 20 are connected to an external device via, for example, a network or the like.

学習装置１０は、複数の正常な通信データを学習して、確率密度を推定し、正常な通信データの確率密度の特徴を表すモデルのパラメータを更新し、更新したモデルのパラメータ（更新パラメータ）を検知装置２０に出力する。学習装置１０は、確率密度推定器として、例えば、ＶＡＥを有する。学習装置１０は、学習対象の通信データに対し、各宛先の通信データ数を揃える前処理を行ってから、学習を行い、確率密度を推定する。 The learning device 10 learns a plurality of normal communication data, estimates the probability density, updates the parameters of the model representing the characteristics of the probability density of the normal communication data, and updates the parameters (update parameters) of the updated model. Output to the detection device 20. The learning device 10 has, for example, a VAE as a probability density estimator. The learning device 10 performs preprocessing for aligning the number of communication data of each destination with respect to the communication data to be learned, and then performs learning to estimate the probability density.

検知装置２０は、学習装置１０によって更新されたモデルのパラメータを適用して、検知対象の通信データ（検知対象データ）の確率密度を推定する。検知装置２０は、確率密度推定器として、例えば、ＶＡＥを有する。続いて、検知装置２０は、推定した確率密度を基に検知対象データの異常の有無を検知する。具体的には、検知装置２０は、推定した確率密度が所定値よりも低い場合に、検知対象の通信データが異常であることを検知する。検知装置２０は、外部の対処装置等に検知結果を出力する。 The detection device 20 applies the parameters of the model updated by the learning device 10 to estimate the probability density of the communication data (detection target data) to be detected. The detection device 20 has, for example, a VAE as a probability density estimator. Subsequently, the detection device 20 detects the presence or absence of an abnormality in the detection target data based on the estimated probability density. Specifically, the detection device 20 detects that the communication data to be detected is abnormal when the estimated probability density is lower than a predetermined value. The detection device 20 outputs the detection result to an external coping device or the like.

なお、検知装置２０は、例えば、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）、ＣＰＵ（Central Processing Unit）等を含むコンピュータ等に所定のプログラムが読み込まれて、ＣＰＵが所定のプログラムを実行することで実現される。また、検知装置２０は、ネットワーク等を介して接続された他の装置との間で、各種情報を送受信する通信インタフェースを有する。例えば、検知装置２０は、ＮＩＣ（Network Interface Card）等を有し、ＬＡＮ（Local Area Network）やインターネットなどの電気通信回線を介した他の装置との間の通信を行う。 In the detection device 20, for example, a predetermined program is read into a computer or the like including a ROM (Read Only Memory), a RAM (Random Access Memory), a CPU (Central Processing Unit), and the CPU executes the predetermined program. It is realized by doing. Further, the detection device 20 has a communication interface for transmitting and receiving various information with other devices connected via a network or the like. For example, the detection device 20 has a NIC (Network Interface Card) or the like, and communicates with other devices via a telecommunication line such as a LAN (Local Area Network) or the Internet.

［学習装置の構成］
次に、学習装置１０の構成について説明する。学習装置１０は、図１に示すように、収集部１１、前処理部１２及び学習部１３を有する。 [Configuration of learning device]
Next, the configuration of the learning device 10 will be described. As shown in FIG. 1, the learning device 10 has a collecting unit 11, a preprocessing unit 12, and a learning unit 13.

なお、学習装置１０は、例えば、ＲＯＭ、ＲＡＭ、ＣＰＵ等を含むコンピュータ等に所定のプログラムが読み込まれて、ＣＰＵが所定のプログラムを実行することで実現される。また、学習装置１０は、ネットワーク等を介して接続された他の装置との間で、各種情報を送受信する通信インタフェースを有する。例えば、学習装置１０は、ＮＩＣ等を有し、ＬＡＮやインターネットなどの電気通信回線を介した他の装置との間の通信を行う。 The learning device 10 is realized, for example, by reading a predetermined program into a computer or the like including a ROM, RAM, CPU, etc., and executing the predetermined program by the CPU. Further, the learning device 10 has a communication interface for transmitting and receiving various information with other devices connected via a network or the like. For example, the learning device 10 has a NIC or the like and communicates with another device via a telecommunication line such as a LAN or the Internet.

収集部１１は、学習対象である正常な通信データ（学習対象データ）を複数収集する。例えば、収集部１１は、ネットワークを介して、学習対象である正常なトラフィックセッションの収集を行う。 The collecting unit 11 collects a plurality of normal communication data (learning target data) to be learned. For example, the collecting unit 11 collects a normal traffic session to be learned via the network.

この場合、収集部１１が収集する通信データのデータ間でデータ数に偏りが生じる場合がある。具体的には、通信がＨＴＴＰ通信については、しばしば使用されるため、収集部１１は、正常なＨＴＴＰ通信データを収集できる。これに対し、管理用ＦＴＰ通信については、使用頻度が少ないため、収集部１１は、正常なＦＴＰ通信データを少量しか得ることができない場合がある。このように、収集部１１が収集する学習対象データのデータ間では、データ数の偏りがある場合がある。学習装置１０では、データ数の偏りがある場合であっても、前処理部１２（後述）が、各宛先の通信データ数を揃える前処理を学習前に行うことによって、精度よく異常検知を行えるモデルを得る。 In this case, the number of data may be biased among the communication data collected by the collecting unit 11. Specifically, since communication is often used for HTTP communication, the collecting unit 11 can collect normal HTTP communication data. On the other hand, since the management FTP communication is used infrequently, the collecting unit 11 may be able to obtain only a small amount of normal FTP communication data. As described above, there may be a bias in the number of data among the data of the learning target data collected by the collecting unit 11. In the learning device 10, even if the number of data is biased, the preprocessing unit 12 (described later) can accurately detect an abnormality by performing preprocessing for aligning the number of communication data of each destination before learning. Get the model.

前処理部１２は、正常な通信データに対し、一または複数の宛先の通信データの複製を加えることで、各宛先の通信データ数を揃える前処理を行う。言い換えると、前処理部１２は、学習対象データに対し、各宛先の通信データ数が揃うように、通信データ数が少ない宛先については該宛先の通信データの複製を加える前処理を行う。図２は、図１に示す前処理部１２の処理内容を説明する図である。 The pre-processing unit 12 performs pre-processing to equalize the number of communication data of each destination by duplicating the communication data of one or a plurality of destinations with respect to the normal communication data. In other words, the preprocessing unit 12 performs preprocessing for the learning target data so that the number of communication data of each destination is the same, and the communication data of the destination is duplicated for the destination with a small number of communication data. FIG. 2 is a diagram for explaining the processing content of the preprocessing unit 12 shown in FIG.

例えば、学習対象データが、図２に示すデータＤ１である場合を例に説明する。データＤ１は、１０００データのＨＴＴＰ通信（８０ｐｏｒｔ）、５００データのＤＮＳ（Domain Name System）通信（５３ｐｏｒｔ）及び１００００データのＭＱＴＴ（Message Queueing Telemetry Transport）通信（１８８３ｐｏｒｔ）を有する。このデータＤ１は、ＭＱＴＴ通信が最もデータ数が多い。 For example, a case where the learning target data is the data D1 shown in FIG. 2 will be described as an example. The data D1 has 1000 data HTTP communication (80 port), 500 data DNS (Domain Name System) communication (53 port), and 10000 data MQTT (Message Queueing Telemetry Transport) communication (1883 port). As for this data D1, MQTT communication has the largest number of data.

このため、前処理部１２は、ＭＱＴＴ通信以外のＨＴＴＰ通信及びＤＮＳ通信のデータ数を、ＭＱＴＴ通信のデータ数と同数となるように複製する。すなわち、前処理部１２は、ＨＴＴＰ通信を、複製して１０倍にする（図２の(１)参照）。そして、前処理部１２は、ＤＮＳ通信を、複製して２０倍にする（図２の（２）参照）。 Therefore, the preprocessing unit 12 duplicates the number of data of HTTP communication and DNS communication other than MQTT communication so as to be the same as the number of data of MQTT communication. That is, the preprocessing unit 12 duplicates the HTTP communication and multiplyes it by 10 (see (1) in FIG. 2). Then, the preprocessing unit 12 duplicates the DNS communication and multiplyes it by 20 (see (2) in FIG. 2).

この結果、前処理部１２は、ＨＴＴＰ通信及びＤＮＳ通信のデータ数は、ＭＱＴＴ通信のデータ数と同数の１００００データとなるデータＤ１´を得る。前処理部１２は、得られたデータＤ１´を、学習対象データとして、学習部１３に出力する。 As a result, the preprocessing unit 12 obtains data D1'in which the number of data for HTTP communication and DNS communication is 10000, which is the same as the number of data for MQTT communication. The preprocessing unit 12 outputs the obtained data D1'as learning target data to the learning unit 13.

学習部１３は、前処理部１２によって処理された学習対象データについて学習を行い、該学習対象データの確率密度を推定して学習対象データの確率密度の特徴を表すモデルのパラメータを更新する。例えば、学習部１３は、前処理部１２から入力されたデータＤ１´を学習対象データとして学習する。学習部１３は、確率密度推定器として、ＶＡＥ１３１を有する。ＶＡＥ１３１は、推定対象の入力データの入力を受け付けると、このデータの確率密度の推定値を出力する演算器である。 The learning unit 13 learns about the learning target data processed by the preprocessing unit 12, estimates the probability density of the learning target data, and updates the parameters of the model representing the characteristics of the probability density of the learning target data. For example, the learning unit 13 learns the data D1'input from the preprocessing unit 12 as the learning target data. The learning unit 13 has a VAE 131 as a probability density estimator. The VAE 131 is an arithmetic unit that outputs an estimated value of the probability density of this data when it receives an input of input data to be estimated.

具体的には、ＶＡＥ１３１は、あるデータ点ｘ_ｉの入力を受け付けると、そのデータに対応したアノマリスコア（ｓｃｏｒｅ）（異常度）を出力する。確率密度の推定値をｐ（ｘ_ｉ）とすると、アノマリスコアは、−ｌｏｇｐ（ｘ_ｉ）の近似値となる。このように、ＶＡＥ１３１が出力するアノマリスコアは、値が高いほど、この通信データの異常度が高いことを示す。 Specifically, VAE131 accepts an input of a data point _{x i,} and outputs the anomaly score corresponding to the data (score) (degree of abnormality). Assuming that the estimated value of the probability density is p (x _i ), the anomaly score is an approximation of _{-logp (x i).} As described above, the higher the value of the anomaly score output by VAE131, the higher the degree of abnormality of this communication data.

学習部１３は、このような演算を行うＶＡＥ１３１を有し、入力された複数の学習対象データを学習し、各学習対象データのアノマリスコアを出力する。そして、学習部１３は、学習結果に応じてＶＡＥ１３１のモデルパラメータを更新する。学習部１３が更新したモデルパラメータは、検知装置２０に出力される。 The learning unit 13 has a VAE 131 that performs such an operation, learns a plurality of input learning target data, and outputs an anomaly score of each learning target data. Then, the learning unit 13 updates the model parameters of the VAE 131 according to the learning result. The model parameters updated by the learning unit 13 are output to the detection device 20.

［学習処理の処理手順］
次に、学習装置１０が実行する学習処理の処理手順について説明する。図３は、図１に示す学習装置１０が実行する学習処理の処理手順を示すフローチャートである。 [Processing procedure of learning process]
Next, the processing procedure of the learning process executed by the learning device 10 will be described. FIG. 3 is a flowchart showing a processing procedure of the learning process executed by the learning device 10 shown in FIG.

図３に示すように、学習装置１０では、収集部１１が、学習対象データを収集し（ステップＳ１１）、収集した学習対象データを前処理部１２に出力する。前処理部１２は、正常な通信データに対し、一または複数の宛先の通信データの複製を加えることで、各宛先の通信データ数を揃える前処理を行う（ステップＳ１２）。 As shown in FIG. 3, in the learning device 10, the collecting unit 11 collects the learning target data (step S11), and outputs the collected learning target data to the preprocessing unit 12. The preprocessing unit 12 performs preprocessing for equalizing the number of communication data of each destination by duplicating the communication data of one or a plurality of destinations with respect to the normal communication data (step S12).

そして、学習部１３は、前処理部１２によって前処理が施された学習対象データに対し、ＶＡＥ１３１を用いて学習を行い、学習対象データの確率密度を推定して学習対象データの確率密度の特徴を表すモデルのパラメータを更新する学習処理を行う（ステップＳ１３）。続いて、学習部１３は、更新したモデルのパラメータを検知装置２０に出力する出力処理を行う（ステップＳ１４）。検知装置２０では、学習装置１０によって更新されたモデルのパラメータを適用して、検知対象データの確率密度を推定し、該確率密度を基に検知対象データの異常の有無を検知する。 Then, the learning unit 13 learns the learning target data preprocessed by the preprocessing unit 12 using VAE131, estimates the probability density of the learning target data, and features the probability density of the learning target data. A learning process for updating the parameters of the model representing the above is performed (step S13). Subsequently, the learning unit 13 performs an output process of outputting the updated model parameters to the detection device 20 (step S14). The detection device 20 applies the parameters of the model updated by the learning device 10 to estimate the probability density of the detection target data, and detects the presence or absence of an abnormality in the detection target data based on the probability density.

［実施例］
例えば、本実施の形態１に係る検知システム１は、ＩｏＴ機器の異常検知に適用することができる。図４は、実施の形態１に係る検知システム１の適用例を説明する図である。図４に示すように、複数のＩｏＴ機器２が接続されたネットワーク３上に、検知システム１を設ける。この場合、検知システム１は、ＩｏＴ機器２が送受信するトラフィックセッション情報を収集し、正常トラフィックセッションの確率密度の学習、及び、異常トラフィックセッションの検知を行う。 [Example]
For example, the detection system 1 according to the first embodiment can be applied to the abnormality detection of the IoT device. FIG. 4 is a diagram illustrating an application example of the detection system 1 according to the first embodiment. As shown in FIG. 4, the detection system 1 is provided on the network 3 to which a plurality of IoT devices 2 are connected. In this case, the detection system 1 collects the traffic session information sent and received by the IoT device 2, learns the probability density of the normal traffic session, and detects the abnormal traffic session.

正常トラフィックセッションの確率密度の学習には、上記前処理を適用し、トラフィックセッションデータ間にデータ数の偏りがある場合にも精度よく学習を行う。また、異常トラフィックセッションの検知には、上記検知処理を適用し、学習処理において学習されたモデルパラメータを適用した確率密度推定を行い、高精度の異常検知を行う。 The above preprocessing is applied to the learning of the probability density of the normal traffic session, and the learning is performed accurately even when the number of data is biased between the traffic session data. Further, the above detection process is applied to detect an abnormal traffic session, probability density estimation is performed by applying the model parameters learned in the learning process, and highly accurate abnormality detection is performed.

［評価実験］
次に、実際のＩｏＴ機器間のトラフィックセッションデータに対し、従来手法と、本実施の形態１に係る手法とで評価を行った結果を示す。例えば、学習対象通信データは、正常な２１０５３データのＭＱＴＴ通信と、正常な１１０７データのカメラストリーミングとである。このデータ数に偏りのある上記学習対象データを、従来手法を用いて学習させた後に、この学習によって得られたモデルを用いた検知装置２０による各種通信データに対する検知率を求めた。一方、同じ学習対象データを、学習装置１０で学習させた後に、この学習によって得られたモデルを用いた検知装置２０による各種通信データに対する検知率を求めた。なお、検知率として、ＡＵＣ（Area Under Curve）値を求める。 [Evaluation experiment]
Next, the results of evaluation of the actual traffic session data between IoT devices by the conventional method and the method according to the first embodiment are shown. For example, the learning target communication data is MQTT communication of normal 21053 data and camera streaming of normal 1107 data. After learning the learning target data having a bias in the number of data by using the conventional method, the detection rate for various communication data by the detection device 20 using the model obtained by this learning was obtained. On the other hand, after learning the same learning target data with the learning device 10, the detection rate for various communication data by the detection device 20 using the model obtained by this learning was obtained. The AUC (Area Under Curve) value is obtained as the detection rate.

まず、従来手法を用いて評価を行った結果について説明する。従来手法は、本実施の形態１で示す前処理を実行せずに、そのままの学習対象データの確率密度を推定する手法である。 First, the result of evaluation using the conventional method will be described. The conventional method is a method of estimating the probability density of the learning target data as it is without executing the preprocessing shown in the first embodiment.

図５は、従来手法による学習状態による各検知対象に対するＲＯＣ曲線を示す図である。図５の（ａ）は、異常データであるマルウェア（ｍｉｒａｉｂｏｔ）に対する検知結果のＲＯＣ曲線である。図５の（ｂ）は、正常データであるＭＱＴＴ通信に対する検知結果のＲＯＣ曲線である。図５の（ｃ）は、正常データであるカメラストリーミングに対する検知結果のＲＯＣ曲線である。 FIG. 5 is a diagram showing ROC curves for each detection target according to the learning state by the conventional method. FIG. 5A is an ROC curve of the detection result for malware (mirai bot) which is abnormal data. FIG. 5B is a ROC curve of the detection result for MQTT communication which is normal data. FIG. 5C is an ROC curve of the detection result for camera streaming which is normal data.

従来手法を用いて検知を行った場合、図５の（ａ）に示すように、マルウェアの悪性通信は、ＡＵＣ値が０．９９９８と高い精度となっており、検知に成功していることが分かる。また、正常なＭＱＴＴ通信については、図５の（ｂ）に示すように、ＡＵＣ値が０．４７４６であり、異常データと検知しておらず、正常なふるまいを示す。このため、従来手法において、正常なＭＱＴＴ通信については、検知が成功していることが分かる。これは、ＭＱＴＴ通信については、学習対象データのデータ数が大量にあったため、確率密度推定の学習が精度よく行なえたためであると考えられる。 When detection is performed using the conventional method, as shown in FIG. 5 (a), the malicious communication of malware has a high accuracy of 0.9998, which means that the detection has been successful. I understand. Further, regarding the normal MQTT communication, as shown in FIG. 5 (b), the AUC value is 0.4746, which is not detected as abnormal data and shows normal behavior. Therefore, it can be seen that the detection of normal MQTT communication is successful in the conventional method. It is considered that this is because the MQTT communication has a large number of data to be learned, so that the learning of the probability density estimation can be performed accurately.

一方、図５の（ｃ）に示すように、正常データのカメラストリーミングは、正常であるにもかかわらず、ＡＵＣ値が０．９７１２と高く出てしまっている。これは、数の少ない正常データの確率密度推定の学習が進まない結果、誤検知してしまっているものと考えられる。 On the other hand, as shown in FIG. 5 (c), although the camera streaming of normal data is normal, the AUC value is as high as 0.9712. It is considered that this is a false detection as a result of the learning of the probability density estimation of the few normal data not progressing.

したがって、従来手法では、学習対象データのデータ数に偏りがある場合、データ数が少ない学習対象データについては学習が進まず、学習済みモデルを使用した検知においても、データ数が少ない種別のデータについては、正しい検知ができなかった。 Therefore, in the conventional method, when the number of data of the training target data is biased, the learning does not proceed for the learning target data with a small number of data, and even in the detection using the trained model, the data of the type with a small number of data is used. Could not detect correctly.

次に、本実施の形態１に係る学習装置１０の学習結果を用いて評価を行った結果について説明する。本実施の形態１では、前述の各宛先の通信データ数を揃える前処理を実行し、前処理後の学習対象データについて確率密度推定の学習を行う手法である。 Next, the result of evaluation using the learning result of the learning device 10 according to the first embodiment will be described. The first embodiment is a method of executing the pre-processing for aligning the number of communication data of each destination described above, and learning the probability density estimation for the learning target data after the pre-processing.

図６は、図１に示す学習装置１０の学習によって得られたモデルを用いた場合の各検知対象に対するＲＯＣ曲線を示す図である。図６の（ａ）は、異常データであるマルウェア（ｍｉｒａｉｂｏｔ）に対する検知結果のＲＯＣ曲線である。図６の（ｂ）は、正常データであるＭＱＴＴ通信に対する検知結果のＲＯＣ曲線である。図６の（ｃ）は、正常データであるカメラストリーミングに対する検知結果のＲＯＣ曲線である。 FIG. 6 is a diagram showing ROC curves for each detection target when the model obtained by learning of the learning device 10 shown in FIG. 1 is used. FIG. 6A is an ROC curve of the detection result for malware (mirai bot) which is abnormal data. FIG. 6B is a ROC curve of the detection result for MQTT communication which is normal data. FIG. 6C is an ROC curve of the detection result for camera streaming which is normal data.

学習装置１０によるモデルを用いて検知を行った場合、図６の（ａ）に示すように、マルウェアの悪性通信は、ＡＵＣ値が１．０と高い精度となっており、検知に成功していることが分かる。また、正常なＭＱＴＴ通信については、図６の（ｂ）に示すように、ＡＵＣ値が０．３８６４であり、異常データと検知しておらず、正しいふるまいを示す。さらに、図６の（ｃ）に示すように、正常データのカメラストリーミングは、学習対象データ数が少なかったにも関わらず、ＡＵＣ値が０．２７８７であり、異常データとして検知しておらず、正しいふるまいを示す。 When detection is performed using the model by the learning device 10, as shown in FIG. 6A, the malicious communication of malware has a high accuracy of AUC value of 1.0, and the detection is successful. You can see that there is. Further, for normal MQTT communication, as shown in FIG. 6B, the AUC value is 0.3864, which is not detected as abnormal data and shows correct behavior. Further, as shown in FIG. 6 (c), the camera streaming of the normal data has an AUC value of 0.2787 even though the number of data to be learned is small, and is not detected as abnormal data. Show correct behavior.

このように、本実施の形態１に係る学習装置１０によるモデルを用いて評価を行った結果、異常通信データの検知を正常に行うことができるとともに、正常な通信データに対する誤検知も低減できることが分かった。 As described above, as a result of evaluation using the model by the learning device 10 according to the first embodiment, it is possible to detect abnormal communication data normally and reduce false detections for normal communication data. Do you get it.

［実施の形態１の効果］
上記のように、実施の形態１では、正常な通信データに対し、一または複数の宛先の通信データの複製を加えることで、各宛先の通信データ数を揃える前処理を行い、前処理された学習対象データについて学習を行い、学習対象データの確率密度を推定して学習対象データの確率密度の特徴を表すモデルのパラメータを更新する。したがって、本実施の形態１によれば、学習対象データのデータ間にデータ数に偏りがある状態が解消され、確率密度を推定する学習処理を精度よく進めることができる。この結果、実施の形態１によれば、この学習処理により得られたモデルを用いることによって、誤検知の低減し、検知精度の向上を図ることができる。 [Effect of Embodiment 1]
As described above, in the first embodiment, the normal communication data is preprocessed by duplicating the communication data of one or more destinations to equalize the number of communication data of each destination, and the preprocessing is performed. Training is performed on the training target data, the probability density of the training target data is estimated, and the parameters of the model representing the characteristics of the probability density of the training target data are updated. Therefore, according to the first embodiment, the state where the number of data is biased among the data to be learned can be eliminated, and the learning process for estimating the probability density can be accurately advanced. As a result, according to the first embodiment, by using the model obtained by this learning process, it is possible to reduce false positives and improve the detection accuracy.

［実施の形態２］
次に、実施の形態２について説明する。実施の形態２に係る学習装置は、図１に示す学習装置と同様の構成を有する。 [Embodiment 2]
Next, the second embodiment will be described. The learning device according to the second embodiment has the same configuration as the learning device shown in FIG.

実施の形態２では、前処理部は、学習対象データに対し、各宛先の通信データ数を揃えるために通信データを複製するとともに、通信データの複製時にノイズを加える前処理を行う。この際、前処理部は、通信データの複製時に各次元に適用的なノイズを加える。 In the second embodiment, the preprocessing unit duplicates the communication data in order to make the number of communication data of each destination uniform with respect to the learning target data, and performs preprocessing for adding noise when the communication data is duplicated. At this time, the preprocessing unit adds applicable noise to each dimension when duplicating the communication data.

例えば、前処理部は、通信データ中に１００のオーダである特徴Ａと０．０１のオーダである特徴Ｂとが表れている場合、特徴Ａに対応させたノイズとして１のオーダのノイズを加え、特徴Ｂに対応させたノイズとして０．０００１のオーダのノイズを加える。また、前処理部は、複製した各通信データに加えるノイズとして、各通信データで発生位置を変えたノイズを加える。言い換えると、前処理部は、複製した各通信データに加えるノイズをばらつかせている。この場合、前処理部は、例えば、予め定めたルールにしたがって、各特徴に対応させたオーダのノイズであって、各通信データ間で発生位置がばらつくようにノイズを生成し、複製した各通信データに加える。 For example, when the feature A which is an order of 100 and the feature B which is an order of 0.01 appear in the communication data, the preprocessing unit adds the noise of the order 1 as the noise corresponding to the feature A. , The noise on the order of 0.0001 is added as the noise corresponding to the feature B. Further, the preprocessing unit adds noise whose generation position is changed in each communication data as noise to be added to each duplicated communication data. In other words, the pre-processing unit disperses the noise added to each duplicated communication data. In this case, the preprocessing unit generates, for example, noise of the order corresponding to each feature according to a predetermined rule, and generates noise so that the generation position varies among each communication data, and each communication duplicated. Add to data.

実施の形態２に係る学習装置は、図３に示すステップＳ１１と同様の処理を行った後、通信データ数が少ない宛先の通信データについて、通信データを複製するとともに、通信データの複製時にノイズを加える前処理を行う。そして、実施の形態２に係る学習装置は、ステップＳ１３，Ｓ１４と同様の処理を行う。 The learning device according to the second embodiment performs the same processing as in step S11 shown in FIG. 3, and then duplicates the communication data for the destination communication data having a small number of communication data, and also causes noise when the communication data is duplicated. Perform pretreatment to add. Then, the learning device according to the second embodiment performs the same processing as in steps S13 and S14.

実施の形態１では、各宛先の通信データ数が揃うように、通信データ数が少ない宛先については該宛先の通信データの複製を加えるため、この宛先については、学習部１３による学習時時に同じデータが複数現れることとなる。ここで、ＶＡＥ１３１は、同じデータを何度も学習させると学習が不安定になってしまう場合がある。本実施の形態２では、通信データの複製時にノイズを加えることによって、データ間のデータ数の偏りをなくすとともに、学習の安定化を図る。 In the first embodiment, the communication data of the destination is duplicated for the destination with a small number of communication data so that the number of communication data of each destination is the same. Therefore, the same data is used for this destination at the time of learning by the learning unit 13. Will appear more than once. Here, in VAE131, learning may become unstable if the same data is learned many times. In the second embodiment, noise is added when the communication data is duplicated to eliminate the bias in the number of data between the data and to stabilize the learning.

［評価実験］
実際に、ＩｏＴ機器間のトラフィックセッションデータに対し、本実施の形態２に係る手法とで評価を行った結果を示す。実施の形態１に対する評価実験と同様に、学習対象通信データは、正常な２１０５３データのＭＱＴＴ通信と、正常な１１０７データのカメラストリーミングとである。この学習対象データを、通信データの複製時にノイズを加える前処理を行った後に、ＶＡＥを用いて学習し、この学習によって得られたモデルを用いた検知装置による各種通信データに対する検知率を求めた。なお、検知率として、ＡＵＣ値を求める。 [Evaluation experiment]
The result of actually evaluating the traffic session data between IoT devices by the method according to the second embodiment is shown. Similar to the evaluation experiment for the first embodiment, the learning target communication data is MQTT communication of normal 21053 data and camera streaming of normal 1107 data. This learning target data was trained using VAE after preprocessing to add noise when duplicating the communication data, and the detection rate for various communication data by the detection device using the model obtained by this learning was obtained. .. The AUC value is obtained as the detection rate.

図７は、実施の形態２に係る学習方法によって得られたモデルを用いた場合の各検知対象に対するＲＯＣ曲線を示す図である。図７の（ａ）は、異常データであるマルウェア（ｍｉｒａｉｂｏｔ）に対する検知結果のＲＯＣ曲線である。図７の（ｂ）は、正常データであるＭＱＴＴ通信に対する検知結果のＲＯＣ曲線である。図７の（ｃ）は、正常データであるカメラストリーミングに対する検知結果のＲＯＣ曲線である。 FIG. 7 is a diagram showing ROC curves for each detection target when the model obtained by the learning method according to the second embodiment is used. FIG. 7A is an ROC curve of the detection result for malware (mirai bot) which is abnormal data. FIG. 7B is a ROC curve of the detection result for MQTT communication which is normal data. FIG. 7C is an ROC curve of the detection result for camera streaming which is normal data.

実施の形態２に係る学習方法によって得られたモデルを用いて検知を行った場合、図７の（ａ）に示すように、マルウェアの悪性通信は、ＡＵＣ値が０．９９９９と高い精度となっており、検知に成功していることが分かる。また、正常なＭＱＴＴ通信については、図７の（ｂ）に示すように、ＡＵＣ値が０．３７０６であり、異常データと検知しておらず、正しいふるまいを示す。さらに、図７の（ｃ）に示すように、正常データのカメラストリーミングは、学習対象データ数が少なかったにも関わらず、ＡＵＣ値が０．３１３４であり、異常データとして検知しておらず、正しいふるまいを示す。 When detection is performed using the model obtained by the learning method according to the second embodiment, the malicious communication of malware has a high accuracy of 0.9999 as shown in FIG. 7A. It can be seen that the detection was successful. Further, for normal MQTT communication, as shown in FIG. 7 (b), the AUC value is 0.3706, which is not detected as abnormal data and shows correct behavior. Further, as shown in FIG. 7 (c), the camera streaming of normal data has an AUC value of 0.3134 and is not detected as abnormal data even though the number of data to be learned is small. Show correct behavior.

このように、本実施の形態２に係る学習方法によって得られたモデルを用いて評価を行った結果、実施の形態１と同様に、異常通信データの検知を正常に行うことができるとともに、正常な通信データに対する誤検知も低減できることが分かった。 As a result of evaluation using the model obtained by the learning method according to the second embodiment in this way, as in the first embodiment, the abnormal communication data can be detected normally and is normal. It was found that false positives for various communication data can be reduced.

［実施の形態２の効果］
上記のように、実施の形態２では、学習対象データに対し、各宛先の通信データ数を揃えるために通信データを複製するとともに、通信データの複製時にノイズを加える前処理を行い、前処理された学習対象データについて学習を行っている。したがって、本実施の形態２によれば、学習対象データのデータ間にデータ数に偏りがある状態が解消されるとともに、学習の安定化を図ることができるため、実施の形態１と同様の効果を奏する。 [Effect of Embodiment 2]
As described above, in the second embodiment, the communication data is duplicated with respect to the learning target data in order to make the number of communication data of each destination uniform, and the preprocessing for adding noise at the time of duplicating the communication data is performed and the preprocessing is performed. We are learning about the data to be learned. Therefore, according to the second embodiment, the state in which the number of data is biased among the data to be learned can be eliminated, and the learning can be stabilized. Therefore, the same effect as that of the first embodiment can be obtained. Play.

［システム構成等］
図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部又は一部を、各種の負荷や使用状況等に応じて、任意の単位で機能的又は物理的に分散・統合して構成することができる。さらに、各装置にて行なわれる各処理機能は、その全部又は任意の一部が、ＣＰＵ及び当該ＣＰＵにて解析実行されるプログラムにて実現され、あるいは、ワイヤードロジックによるハードウェアとして実現され得る。 [System configuration, etc.]
Each component of each of the illustrated devices is a functional concept and does not necessarily have to be physically configured as shown in the figure. That is, the specific form of distribution / integration of each device is not limited to the one shown in the figure, and all or part of the device is functionally or physically distributed in arbitrary units according to various loads and usage conditions. Can be integrated and configured. Further, each processing function performed by each device may be realized by a CPU and a program analyzed and executed by the CPU, or may be realized as hardware by wired logic.

また、本実施の形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部又は一部を手動的におこなうこともでき、あるいは、手動的に行なわれるものとして説明した処理の全部又は一部を公知の方法で自動的におこなうこともできる。この他、上記文書中や図面中で示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。 Further, among the processes described in the present embodiment, all or part of the processes described as being automatically performed can be manually performed, or the processes described as being manually performed. It is also possible to automatically perform all or part of the above by a known method. In addition, the processing procedure, control procedure, specific name, and information including various data and parameters shown in the above document and drawings can be arbitrarily changed unless otherwise specified.

［プログラム］
図８は、プログラムが実行されることにより、学習装置１０が実現されるコンピュータの一例を示す図である。コンピュータ１０００は、例えば、メモリ１０１０、ＣＰＵ１０２０を有する。また、コンピュータ１０００は、ハードディスクドライブインタフェース１０３０、ディスクドライブインタフェース１０４０、シリアルポートインタフェース１０５０、ビデオアダプタ１０６０、ネットワークインタフェース１０７０を有する。これらの各部は、バス１０８０によって接続される。 [program]
FIG. 8 is a diagram showing an example of a computer in which the learning device 10 is realized by executing a program. The computer 1000 has, for example, a memory 1010 and a CPU 1020. The computer 1000 also has a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070. Each of these parts is connected by a bus 1080.

メモリ１０１０は、ＲＯＭ１０１１及びＲＡＭ１０１２を含む。ＲＯＭ１０１１は、例えば、ＢＩＯＳ（Basic Input Output System）等のブートプログラムを記憶する。ハードディスクドライブインタフェース１０３０は、ハードディスクドライブ１０９０に接続される。ディスクドライブインタフェース１０４０は、ディスクドライブ１１００に接続される。例えば磁気ディスクや光ディスク等の着脱可能な記憶媒体が、ディスクドライブ１１００に挿入される。シリアルポートインタフェース１０５０は、例えばマウス１１１０、キーボード１１２０に接続される。ビデオアダプタ１０６０は、例えばディスプレイ１１３０に接続される。 Memory 1010 includes ROM 1011 and RAM 1012. The ROM 1011 stores, for example, a boot program such as a BIOS (Basic Input Output System). The hard disk drive interface 1030 is connected to the hard disk drive 1090. The disk drive interface 1040 is connected to the disk drive 1100. For example, a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1100. The serial port interface 1050 is connected to, for example, a mouse 1110 and a keyboard 1120. The video adapter 1060 is connected to, for example, the display 1130.

ハードディスクドライブ１０９０は、例えば、ＯＳ（Operating System）１０９１、アプリケーションプログラム１０９２、プログラムモジュール１０９３、プログラムデータ１０９４を記憶する。すなわち、学習装置１０の各処理を規定するプログラムは、コンピュータにより実行可能なコードが記述されたプログラムモジュール１０９３として実装される。プログラムモジュール１０９３は、例えばハードディスクドライブ１０９０に記憶される。例えば、学習装置１０における機能構成と同様の処理を実行するためのプログラムモジュール１０９３が、ハードディスクドライブ１０９０に記憶される。なお、ハードディスクドライブ１０９０は、ＳＳＤ（Solid State Drive）により代替されてもよい。 The hard disk drive 1090 stores, for example, an OS (Operating System) 1091, an application program 1092, a program module 1093, and program data 1094. That is, the program that defines each process of the learning device 10 is implemented as a program module 1093 in which a code that can be executed by a computer is described. The program module 1093 is stored in, for example, the hard disk drive 1090. For example, a program module 1093 for executing a process similar to the functional configuration in the learning device 10 is stored in the hard disk drive 1090. The hard disk drive 1090 may be replaced by an SSD (Solid State Drive).

また、上述した実施の形態の処理で用いられる設定データは、プログラムデータ１０９４として、例えばメモリ１０１０やハードディスクドライブ１０９０に記憶される。そして、ＣＰＵ１０２０が、メモリ１０１０やハードディスクドライブ１０９０に記憶されたプログラムモジュール１０９３やプログラムデータ１０９４を必要に応じてＲＡＭ１０１２に読み出して実行する。 Further, the setting data used in the processing of the above-described embodiment is stored as program data 1094 in, for example, a memory 1010 or a hard disk drive 1090. Then, the CPU 1020 reads the program module 1093 and the program data 1094 stored in the memory 1010 and the hard disk drive 1090 into the RAM 1012 as needed, and executes the program.

なお、プログラムモジュール１０９３やプログラムデータ１０９４は、ハードディスクドライブ１０９０に記憶される場合に限らず、例えば着脱可能な記憶媒体に記憶され、ディスクドライブ１１００等を介してＣＰＵ１０２０によって読み出されてもよい。あるいは、プログラムモジュール１０９３及びプログラムデータ１０９４は、ネットワーク（ＬＡＮ、ＷＡＮ（Wide Area Network）等）を介して接続された他のコンピュータに記憶されてもよい。そして、プログラムモジュール１０９３及びプログラムデータ１０９４は、他のコンピュータから、ネットワークインタフェース１０７０を介してＣＰＵ１０２０によって読み出されてもよい。 The program module 1093 and the program data 1094 are not limited to those stored in the hard disk drive 1090, but may be stored in, for example, a removable storage medium and read by the CPU 1020 via the disk drive 1100 or the like. Alternatively, the program module 1093 and the program data 1094 may be stored in another computer connected via a network (LAN, WAN (Wide Area Network), etc.). Then, the program module 1093 and the program data 1094 may be read by the CPU 1020 from another computer via the network interface 1070.

以上、本発明者によってなされた発明を適用した実施の形態について説明したが、本実施の形態による本発明の開示の一部をなす記述及び図面により本発明は限定されることはない。すなわち、本実施の形態に基づいて当業者等によりなされる他の実施の形態、実施例及び運用技術等は全て本発明の範疇に含まれる。 Although the embodiment to which the invention made by the present inventor is applied has been described above, the present invention is not limited by the description and the drawings which form a part of the disclosure of the present invention according to the present embodiment. That is, other embodiments, examples, operational techniques, and the like made by those skilled in the art based on the present embodiment are all included in the scope of the present invention.

１検知システム
２ＩｏＴ機器
３ネットワーク
１０学習装置
１１収集部
１２前処理部
１３学習部
２０検知装置
１３１ＶＡＥ 1 Detection system 2 IoT equipment 3 Network 10 Learning device 11 Collection unit 12 Preprocessing unit 13 Learning unit 20 Detection device 131 VAE

Claims

A collection unit that collects multiple normal communication data to be learned by receiving normal communication data from multiple external communication devices.
For a protocol with a small number of communication data, a preprocessing unit that duplicates the communication data of the protocol with a small number of communication data so that the number of communication data of each protocol is uniform with respect to the normal communication data.
A learning unit that learns about the normal communication data processed by the preprocessing unit, estimates the probability density of the normal communication data, and updates the parameters of the model representing the characteristics of the probability density of the normal communication data. When,
A learning device characterized by having.

The learning device according to claim 1, wherein the preprocessing unit adds noise when duplicating the communication data.

A learning method performed by a learning device,
A collection process that collects multiple normal communication data to be learned by receiving normal communication data from multiple external communication devices.
For a protocol having a small number of communication data, a preprocessing step of adding a copy of the communication data of the protocol having a small number of communication data so that the number of communication data of each protocol is the same as that of the normal communication data.
A learning step of learning about the normal communication data processed in the preprocessing step, estimating the probability density of the normal communication data, and updating the parameters of the model representing the characteristics of the probability density of the normal communication data. When,
A learning method characterized by including.

A collection step that collects multiple normal communication data to be learned by receiving normal communication data from multiple external communication devices,
For a protocol having a small number of communication data, a preprocessing step of adding a copy of the communication data of the protocol having a small number of communication data so that the number of communication data of each protocol is the same as that of the normal communication data.
A learning step in which learning is performed on the normal communication data processed in the preprocessing step, the probability density of the normal communication data is estimated, and the parameters of the model representing the characteristics of the probability density of the normal communication data are updated. When,
A learning program that lets a computer run.