JP2023031591A

JP2023031591A - Anomaly determination system, anomaly determination program, and anomaly determination method

Info

Publication number: JP2023031591A
Application number: JP2021137181A
Authority: JP
Inventors: 康太土江; Kota Tsuchie; 信之中村; Nobuyuki Nakamura
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2021-08-25
Filing date: 2021-08-25
Publication date: 2023-03-09

Abstract

To reduce a false-positive rate while maintaining detection accuracy.SOLUTION: An anomaly determination system which determines an anomaly on a network from analysis target data includes: multiple pieces of detection means; filter processing means which executes filter processing based on filter condition data, on each of results detected by the detection means; feature quantity processing means which acquires feature quantity data from each of the results of the detection means; determination means which determines an anomaly on the network by supplying the feature quantity data to a discriminator equipped with a machine learned learning model; feedback receiving means which receives false-positive feedback for a determination result; and filter condition generation means which generates filter condition data to be set on the filter processing means in accordance with the contents of the false-positive feedback.SELECTED DRAWING: Figure 1

Description

この発明は、異常判定システム、異常判定プログラム、及び異常判定方法に関し、例えば、セキュリティ機器のログを分析し、マルウェア感染や情報漏洩といった異常を検出するシステムに適用し得る。 The present invention relates to an anomaly determination system, an anomaly determination program, and an anomaly determination method, and can be applied, for example, to a system that analyzes logs of security devices and detects anomalies such as malware infection and information leakage.

近年、年々増加・進化するサイバー攻撃が社会問題となっている。サイバー攻撃に対しては、プロキシやファイアウォールなどのネットワーク機器での防御が従来有効であるが、近年ではネットワーク機器の機能だけでは防ぎきれない攻撃も増えている。このような攻撃に対しては、ネットワーク機器のログを分析し攻撃の見逃しを防ぐことが重要である。しかし、従業員数の多い企業ではネットワーク機器のログの量は膨大であり、人手で分析するのは非常に困難である。こういった膨大なログに対しては機械的に異常な通信ログを絞り込むアプローチが有効である。 In recent years, cyberattacks, which are increasing and evolving year by year, have become a social problem. Network devices such as proxies and firewalls have traditionally been effective against cyber-attacks, but in recent years, the number of attacks that cannot be prevented by the functions of network devices alone is increasing. Against such attacks, it is important to analyze the logs of network devices to prevent attacks from being overlooked. However, in a company with a large number of employees, the amount of network device logs is enormous, and it is very difficult to analyze them manually. An effective approach is to mechanically narrow down abnormal communication logs for such huge logs.

ログ分析を用いた異常判定方法の一つとして、攻撃通信の挙動や振舞いといった特徴を検知エンジンとして落とし込み、特徴に合致するログを異常として検出する方法が考えられる。しかしながら、攻撃の特徴は目的や手段によって多種多様であり、単一の検知エンジンでは攻撃を網羅し切れない。また単一の検知エンジンでは、正常な通信であるが攻撃と同様の特徴を持つ通信を異常と判定する誤判定も避けられない。そこで複数の検知エンジンを導入し、それら複数エンジンの検知結果から総合的に判断して、本当に異常と思われる通信のみを異常と検出する方法をとることで、検知率と誤判定率の両方を改善できる。 As one of the anomaly determination methods using log analysis, a method is conceivable in which the behavior and behavior of the attacking communication are used as a detection engine, and logs that match the characteristics are detected as anomalies. However, the characteristics of attacks vary widely depending on the purpose and method, and a single detection engine cannot cover all attacks. In addition, with a single detection engine, erroneous determinations of normal communications that have similar characteristics to attacks as abnormalities cannot be avoided. Therefore, by introducing multiple detection engines and comprehensively judging from the detection results of those multiple engines, adopting a method to detect only the communication that seems to be really abnormal, both the detection rate and the misjudgment rate are improved. can.

複数の検知エンジンの検知結果から総合的に異常を判断するために、各検知エンジンの検知結果（例えば０～１００点の異常スコアなど）を特徴量とし、過去の異常な通信ログと正常な通信ログにラベル付けしたものを教師データとして機械学習する方法が考えられる。機械学習では、検知エンジン間の相関も見て異常な通信を絞り込むため、人の目では気付けない攻撃の特徴から異常な通信を検知できることが期待される。機械学習では、異常な通信ログの見落としを減らすのか、誤判定を減らすのかといった目的に応じてハイパーパラメータと呼ばれるパラメータを調整する。サイバー攻撃検知では、異常の見逃しを防ぐことが重要であるため、機械学習器の学習時に異常な通信ログの検知率が高くなるようにパラメータを設定することが多い。このようにした場合、異常な通信ログの特徴量に近い正常な通信ログも異常検知されやすくなるため誤判定が発生する。誤判定に対しては、システム利用者が異常検知されたログが本当に異常か調査し結果誤判定であった時に、システムにフィードバックを行い、システムが誤判定フィードバックされたログを正常通信としてラベル付けして機械学習器を再学習させることで誤判定率を減らす効果が期待される。しかし、機械学習器は異常を重視して学習されるため、一度の誤判定フィードバックでは機械学習器の異常検知精度に影響を与えない場合がある。システム利用者の立場で考えると、一度誤判定フィードバックしたログは以後検知されない、または異常検知結果が異常の度合いを示すスコアで表示される場合は検知スコアが減少することが望ましい。 In order to comprehensively judge anomalies from the detection results of multiple detection engines, the detection results of each detection engine (for example, anomaly scores of 0 to 100 points) are used as feature quantities, and past abnormal communication logs and normal communication A method of performing machine learning using labeled logs as teacher data is conceivable. Machine learning narrows down abnormal communications by looking at correlations between detection engines, so it is expected to be able to detect abnormal communications from characteristics of attacks that are invisible to the human eye. In machine learning, parameters called hyperparameters are adjusted according to the purpose of reducing oversight of abnormal communication logs or reducing erroneous judgments. In cyberattack detection, it is important to prevent anomalies from being overlooked, so parameters are often set to increase the detection rate of anomalous communication logs during machine learning. In this case, an erroneous determination occurs because even a normal communication log that is close to the characteristic amount of an abnormal communication log is likely to be detected as abnormal. For erroneous judgments, the system user investigates whether the log in which an abnormality was detected is really abnormal, and when the result is an erroneous judgment, feedback is provided to the system, and the system labels the log with the erroneous judgment feedback as normal communication. It is expected that the misjudgment rate will be reduced by retraining the machine learning device. However, since the machine learner learns with an emphasis on anomalies, there are cases in which a single misjudgment feedback does not affect the anomaly detection accuracy of the machine learner. From the system user's point of view, it is desirable that logs that have once been fed back an erroneous judgment will not be detected again, or that the detection score will decrease if the anomaly detection result is displayed with a score indicating the degree of anomaly.

誤判定フィードバックされたログを以後利用者に提示しない方法として、例えば誤判定ログの通信先ＦＱＤＮ（ＦｕｌｌｙＱｕａｌｉｆｉｅｄＤｏｍａｉｎＮａｍｅ）をホワイトリストで管理し条件に合致するログをフィルタして利用者に表示させない方法が考えられる。この方法では、誤判定ログは異常検知結果として現れなくなるが、例えばホワイトリストに設定した通信先ホストのＷｅｂサイトが攻撃者によって改ざんされたり、乗っ取られたりした場合に異常検知結果に表示されないという問題が発生し得る。また、誤判定フィードバックの回数が増えれば、誤判定されたログの誤判定率は減少していくと考えられるが、今度は異常検知されていた異常通信ログが正常と判定される見逃しの問題が発生し得る。 As a method of not presenting logs that have received incorrect judgment feedback to the user, for example, the communication destination FQDN (Fully Qualified Domain Name) of the incorrect judgment log is managed with a white list, and logs that match the conditions are filtered and not displayed to the user. I can think of a way. With this method, erroneous judgment logs do not appear as anomaly detection results. can occur. Also, if the number of false positive feedbacks increases, the false positive rate of false negative logs will decrease. can.

以上より、異常検知精度は維持したまま、誤判定フィードバックされたログを異常検知させない方法が求められる。 From the above, there is a demand for a method that does not detect anomalies in logs that are fed back erroneous judgments while maintaining anomaly detection accuracy.

従来の異常判定システムとしては、例えば、特許文献１の記載技術が存在する。 As a conventional abnormality determination system, for example, the technology described in Patent Document 1 exists.

特許文献１の記載技術は、検知ルールが異なる複数の異常検知手段を多層化して異常検知する場合に、誤判定や見逃しを抑えることを目的とした技術である。そして、特許文献１に記載されたシステムでは、異常通信識別ルールに基づいてルールに合致する通信をブロックする第１の異常通信検知システムと、正常通信識別ルール（正常状態を学習させた判定器）に基づいてルールに合致しない通信をブロックする第２の異常通信検知システムの２つからなる。また、特許文献１に記載されたシステムでは、誤判定（正常だが第１の異常判定システムで異常と識別）された通信に対して、当該通信を異常と識別した異常識別ルールを削除する。さらに、特許文献１では、見逃し（正解は異常だが第２の異常判定システムで正常通信と識別）した通信に対して、当該通信を第１の異常判定システムで異常と識別するように異常識別ルールを追加する。これにより、特許文献１に記載されたシステムでは、誤判定と見逃しの両方を抑えることができる。 The technique described in Patent Literature 1 is a technique aimed at suppressing erroneous determinations and oversights when anomaly detection is performed by multilayering a plurality of anomaly detection means with different detection rules. Then, in the system described in Patent Document 1, a first abnormal communication detection system that blocks communication that matches the rule based on the abnormal communication identification rule, and a normal communication identification rule (determination device that learns the normal state) A second abnormal communication detection system that blocks communication that does not match the rules based on Further, in the system described in Patent Literature 1, for a communication erroneously determined (normal but identified as abnormal by the first abnormality determination system), the abnormality identification rule that identified the communication as abnormal is deleted. Furthermore, in Patent Literature 1, an abnormality identification rule is provided so that the first abnormality determination system identifies an overlooked communication (the correct answer is abnormal, but the second abnormality determination system identifies the communication as normal) as an abnormality. Add As a result, the system described in Patent Literature 1 can suppress both erroneous determinations and oversights.

特開２０１９－４２６０号公報Japanese Patent Application Laid-Open No. 2019-4260

しかしながら、特許文献１に記載された第１の異常判定システムは、異常検知をルールベースで行うため、誤判定の際にどのルールが誤判定の原因なのか判断することは容易だが、本発明が対象とする複数の異常検知エンジンを使って機械学習で総合的に異常検知するシステムでは、検知結果から検知に寄与した検知エンジンのルールまでを特定するのは従来技術の構成では困難である。また、特許文献１に記載された第２の異常判定システムでは、機械学習により正常パターンを学習させて異常検知することを想定しているが、正常パターンを学習させて異常検知する方式は、異常な通信パターンを学習させる方式に比べ、異常な通信パターンを用意する必要が無い一方で一般に誤判定が多いという課題がある。したがって、特許文献１において、第２の異常判定システムでの誤判定の抑制も必要となると考えられる。 However, since the first abnormality determination system described in Patent Document 1 performs abnormality detection based on rules, it is easy to determine which rule is the cause of an erroneous determination when an erroneous determination is made. In a system that uses machine learning to comprehensively detect anomalies using multiple target anomaly detection engines, it is difficult to identify the rules of the detection engines that contributed to the detection from the detection results using conventional technology. In addition, in the second abnormality determination system described in Patent Document 1, it is assumed that normal patterns are learned by machine learning and abnormality detection is performed. Compared to the method of learning an abnormal communication pattern, there is a problem that there is generally a large number of erroneous judgments while there is no need to prepare an abnormal communication pattern. Therefore, in Patent Document 1, it is considered necessary to suppress erroneous determination in the second abnormality determination system.

以上のような問題に鑑みて、検知精度を維持したまま誤判定率を低減することができる異常判定システム、異常判定プログラム、及び異常判定方法が望まれている。 In view of the above problems, an abnormality determination system, an abnormality determination program, and an abnormality determination method capable of reducing the erroneous determination rate while maintaining the detection accuracy are desired.

第１の本発明は、ネットワーク上に配置された１又は複数のネットワーク装置から収集した解析対象データから、前記ネットワーク上の異常を判定する異常判定システムにおいて、それぞれ異なる観点で前記解析対象データから異常を検知する複数の検知手段と、前記検知手段ごとのフィルタ条件が記述されたフィルタ条件データを保持し、前記検知手段ごとの検知結果について前記フィルタ条件データに基づいたフィルタ処理を行い、それぞれの前記検知手段について前記フィルタ処理後の検知結果を出力するフィルタ処理手段と、前記フィルタ処理手段が出力するそれぞれの前記検知手段の検知結果に基づき機械学習に適した特徴量データを取得する特徴量処理手段と、教師データに基づき機械学習した学習モデルを備える判定器に、前記特徴量データを供給して前記ネットワーク上の異常を判定する判定手段と、前記判定手段による判定結果が誤判定である旨の誤判定フィードバックを受け付けるフィードバック受付手段と、前記フィードバック受付手段が受け付けた前記誤判定フィードバックの内容に従って前記フィルタ処理手段に設定する前記フィルタ条件データを生成するフィルタ条件生成手段とを有することを特徴とする。 A first aspect of the present invention provides an anomaly determination system for determining an anomaly on a network from analysis target data collected from one or a plurality of network devices arranged on the network. A plurality of detection means for detecting and filter condition data describing filter conditions for each of the detection means are held, and the detection results of each of the detection means are filtered based on the filter condition data, and each of the Filter processing means for outputting the detection result after the filter processing for the detection means, and feature amount processing means for acquiring feature amount data suitable for machine learning based on the detection results of the respective detection means output by the filter processing means. and determining means for determining an abnormality on the network by supplying the feature amount data to a determining device equipped with a learning model machine-learned based on teacher data, and indicating that the determination result by the determining means is an erroneous determination. and a filter condition generation means for generating the filter condition data to be set in the filter processing means according to the content of the erroneous judgment feedback received by the feedback reception means. .

第２の本発明の異常判定プログラムは、ネットワーク上に配置された１又は複数のネットワーク装置から収集した解析対象データから、前記ネットワーク上の異常を判定する異常判定システムを構成するコンピュータを、それぞれ異なる観点で前記解析対象データから異常を検知する複数の検知手段と、前記検知手段ごとのフィルタ条件が記述されたフィルタ条件データを保持し、前記検知手段ごとの検知結果について前記フィルタ条件データに基づいたフィルタ処理を行い、それぞれの前記検知手段について前記フィルタ処理後の検知結果を出力するフィルタ処理手段と、前記フィルタ処理手段が出力するそれぞれの前記検知手段の検知結果に基づき機械学習に適した特徴量データを取得する特徴量処理手段と、教師データに基づき機械学習した学習モデルを備える判定器に、前記特徴量データを供給して前記ネットワーク上の異常を判定する判定手段と、前記判定手段による判定結果が誤判定である旨の誤判定フィードバックを受け付けるフィードバック受付手段と、前記フィードバック受付手段が受け付けた前記誤判定フィードバックの内容に従って前記フィルタ処理手段に設定する前記フィルタ条件データを生成するフィルタ条件生成手段として機能させることを特徴とする。 According to the abnormality determination program of the second aspect of the present invention, different computers constituting an abnormality determination system for determining an abnormality on the network from analysis target data collected from one or a plurality of network devices arranged on the network. a plurality of detection means for detecting anomalies from the data to be analyzed from a viewpoint, and filter condition data describing filter conditions for each of the detection means are held; filtering means for performing filtering and outputting detection results after the filtering for each of the detecting means; and feature amounts suitable for machine learning based on the detection results of the respective detecting means output by the filtering means. feature amount processing means for acquiring data; determination means for determining abnormality on the network by supplying the feature amount data to a determination device having a learning model machine-learned based on teacher data; and determination by the determination means. feedback receiving means for receiving erroneous determination feedback indicating that a result is an erroneous determination; and filter condition generating means for generating the filter condition data to be set in the filtering means according to the content of the erroneous determination feedback received by the feedback receiving means. It is characterized by functioning as

第３の本発明は、ネットワーク上に配置された１又は複数のネットワーク装置から収集した解析対象データから、前記ネットワーク上の異常を判定する異常判定システムが行う異常判定方法において、前記異常判定システムは、複数の検知手段、フィルタ処理手段、特徴量処理手段、判定手段、フィードバック受付手段、及びフィルタ条件生成手段を備え、それぞれの前記検知手段は、それぞれ異なる観点で前記解析対象データから異常を検知し、前記フィルタ処理手段は、前記検知手段ごとのフィルタ条件が記述されたフィルタ条件データを保持し、前記検知手段ごとの検知結果について前記フィルタ条件データに基づいたフィルタ処理を行い、それぞれの前記検知手段について前記フィルタ処理後の検知結果を出力し、前記特徴量処理手段は、前記フィルタ処理手段が出力するそれぞれの前記検知手段の検知結果に基づき機械学習に適した特徴量データを取得し、前記判定手段は、教師データに基づき機械学習した学習モデルを備える判定器に、前記特徴量データを供給して前記ネットワーク上の異常を判定し、前記フィードバック受付手段は、前記判定手段による判定結果が誤判定である旨の誤判定フィードバックを受け付け、前記フィルタ条件生成手段は、前記フィードバック受付手段が受け付けた前記誤判定フィードバックの内容に従って前記フィルタ処理手段に設定する前記フィルタ条件データを生成することを特徴とする。 A third aspect of the present invention provides an abnormality determination method performed by an abnormality determination system that determines an abnormality on the network from analysis target data collected from one or more network devices arranged on the network, wherein the abnormality determination system , a plurality of detection means, filtering means, feature amount processing means, determination means, feedback reception means, and filter condition generation means, and each of the detection means detects an abnormality from the analysis target data from a different point of view. , the filter processing means holds filter condition data describing filter conditions for each of the detection means, performs filter processing on the detection results of each of the detection means based on the filter condition data, and performs filter processing for each of the detection means The feature amount processing means acquires feature amount data suitable for machine learning based on the detection results of the respective detection means output by the filtering means, and the determination The means supplies the feature amount data to a determiner equipped with a learning model machine-learned based on teacher data to determine an abnormality on the network, and the feedback receiving means determines whether the determination result by the determination means is an erroneous determination. and the filter condition generating means generates the filter condition data to be set in the filtering means according to the content of the erroneous determination feedback received by the feedback receiving means. .

本発明によれば、検知精度を維持したまま誤判定率を低減する異常判定システムを提供することができる。 Advantageous Effects of Invention According to the present invention, it is possible to provide an abnormality determination system that reduces an erroneous determination rate while maintaining detection accuracy.

第１の実施形態に係る異常判定システムの全体構成を示すブロック図である。1 is a block diagram showing the overall configuration of an abnormality determination system according to a first embodiment; FIG. 第１の実施形態に係る、特徴量データの構成例について示した図である。4 is a diagram showing a configuration example of feature amount data according to the first embodiment; FIG. 第１の実施形態に係る、判定結果表示画面の構成例について示した図である。FIG. 7 is a diagram showing a configuration example of a determination result display screen according to the first embodiment; 第１の実施形態に係る、フィルタ条件生成定義情報の構成例について示した図である。4 is a diagram showing a configuration example of filter condition generation definition information according to the first embodiment; FIG. 第１の実施形態に係る、検知処理部の動作について示したフローチャートである。4 is a flowchart showing operations of a detection processing unit according to the first embodiment; 第１の実施形態に係る、特徴量処理部、判定部（判定器）及び判定器作成部の動作の例について示したフローチャートである。7 is a flowchart showing an example of operations of a feature quantity processing unit, a determination unit (determiner), and a determiner creation unit according to the first embodiment; 第１の実施形態に係る、判定結果処理部及びフィルタ条件生成部の動作の例について示したフローチャートである。8 is a flowchart showing an example of operations of a determination result processing unit and a filter condition generation unit according to the first embodiment; 第２の実施形態に係る判定結果処理部が表示する判定結果表示画面の構成例（その１）について示した図である。FIG. 11 is a diagram showing a configuration example (part 1) of a determination result display screen displayed by a determination result processing unit according to the second embodiment; 第２の実施形態に係る判定結果処理部が表示する判定結果表示画面の構成例（その２）について示した図である。FIG. 11 is a diagram showing a configuration example (No. 2) of a determination result display screen displayed by a determination result processing unit according to the second embodiment; 第２の実施形態に係るフィルタ条件生成定義情報の構成例について示した図である。FIG. 11 is a diagram showing a configuration example of filter condition generation definition information according to the second embodiment; 第２の実施形態に係る判定結果処理部及びフィルタ条件生成部の動作について示したフローチャートである。9 is a flow chart showing operations of a determination result processing unit and a filter condition generation unit according to the second embodiment; 第１の実施形態の変形実施例に係る判定結果表示画面について示した図である。It is the figure which showed about the determination result display screen based on the modification example of 1st Embodiment.

（Ａ）第１の実施形態
以下、本発明による異常判定システム、異常判定プログラム、及び異常判定方法の第１の実施形態を、図面を参照しながら詳述する。 (A) First Embodiment Hereinafter, a first embodiment of an abnormality determination system, an abnormality determination program, and an abnormality determination method according to the present invention will be described in detail with reference to the drawings.

（Ａ－１）第１の実施形態の構成
図１は、この実施形態の異常判定システム１の全体構成を示すブロック図である。なお、図１において括弧内の符号は、後述する第２の実施形態でのみ用いられる符号である。 (A-1) Configuration of First Embodiment FIG. 1 is a block diagram showing the overall configuration of an abnormality determination system 1 of this embodiment. Note that the symbols in parentheses in FIG. 1 are symbols used only in the second embodiment, which will be described later.

異常判定システム１は、検知対象であるネットワークＮＥから解析対象のデータ（例えば、ネットワークＮＥ上に配置されたネットワーク機器のログ情報；以下「解析対象データ」とも呼ぶ）を取得し、取得した解析対象データに基づいてネットワークＮＥ上の異常を検知する処理（以下、「異常検知処理」と呼ぶ）を行う。ここでは、異常判定システム１は、ネットワークＮＥ上に配置された１又は複数のネットワーク機器（例えば、プロキシサーバ、ファイアウォール、ルータ、Ｌ４スイッチ等）上で発生した種々のログ情報を解析対象データとして取得し、取得したログ情報に基づいて異常検知処理を行うものとする。 The abnormality determination system 1 acquires analysis target data (for example, log information of network devices arranged on the network NE; hereinafter also referred to as “analysis target data”) from a network NE that is a detection target, and acquires the acquired analysis target. Based on the data, a process of detecting an abnormality on the network NE (hereinafter referred to as "anomaly detection process") is performed. Here, the abnormality determination system 1 acquires various log information generated on one or a plurality of network devices (for example, proxy servers, firewalls, routers, L4 switches, etc.) placed on the network NE as data to be analyzed. and perform anomaly detection processing based on the acquired log information.

ログ情報の内容や形式は限定されないものである。この実施形態の例では、説明を簡易とするため、ログ情報は、ネットワークＮＥ上の図示しないプロキシサーバにおける通信ログであるものとして説明する。この実施形態の例におけるプロキシサーバは、図示しない通信元のクライアント端末から通信先（例えば、所定のＵＲＬにより指定されたインターネット上のサイト）への通信を中継する際に、通信ごと（例えば、通信セッションごと）の情報をログ情報として出力するものとして説明する。 The contents and format of the log information are not limited. In the example of this embodiment, to simplify the explanation, the log information is assumed to be a communication log in a proxy server (not shown) on the network NE. The proxy server in the example of this embodiment relays communication from a communication source client terminal (not shown) to a communication destination (for example, a site on the Internet specified by a predetermined URL) for each communication (for example, communication session) is output as log information.

このようなプロキシサーバが通信ごとに出力するログ情報には、例えば、当該通信における通信先のホストを特定するためのＦＱＤＮ（以下、「通信先ＦＱＤＮ」と呼ぶ）、当該通信において伝送されたデータ量（以下、「送信バイト数」と呼ぶ）、当該通信において通信元で用いられたＵＲＬからＦＱＤＮを除外したパス部分（以下、「ＵＲＬパス」と呼ぶ）等が含まれる。たとえば、ＵＲＬ全体が「ｘｘｘ．ｙｙｙ．ｊｐ／ａ．ｅｘｅ」である場合、「ｘｘｘ．ｙｙｙ．ｊｐ」の部分が通信先ＦＱＤＮで「／ａ．ｅｘｅ」の部分がＵＲＬパスとなる。以下では、１件のログ情報には１つの通信（例えば、１つの通信セッションが開始されてから終了するまでの通信）に関する情報が含まれるものとして説明する。異常判定システム１は、例えば、異常のようなログ情報に基づき、通信元のマルウェア感染や情報漏洩等の異常を検出するものとして説明する。 The log information output by such a proxy server for each communication includes, for example, the FQDN for specifying the host of the communication destination in the communication (hereinafter referred to as "communication destination FQDN"), the data transmitted in the communication amount (hereinafter referred to as "transmission byte count"), a path portion (hereinafter referred to as "URL path") obtained by excluding the FQDN from the URL used at the communication source in the communication, and the like. For example, if the entire URL is "xxx.yyy.jp/a.exe", the "xxx.yyy.jp" part is the communication destination FQDN and the "/a.exe" part is the URL path. In the following description, it is assumed that one piece of log information includes information about one communication (for example, communication from the start to the end of one communication session). The anomaly determination system 1 will be described as a system that detects an anomaly such as a malware infection of a communication source or information leakage based on log information such as an anomaly.

以下では、ログ情報を構成する各項目（例えば、上記の「通信先ＦＱＤＮ」、「ＵＲＬパス」、「送信バイト数」等）をそれぞれ「フィールド」とも呼ぶものとする。 Hereinafter, each item (for example, the above-mentioned "communication destination FQDN", "URL path", "transmission byte count", etc.) constituting the log information is also referred to as a "field".

次に、異常判定システム１の内部構成について説明する。 Next, the internal configuration of the abnormality determination system 1 will be described.

異常判定システム１は、検知処理部１０、特徴量処理部２０、判定部３０、判定結果処理部４０、フィルタ条件生成部５０、判定器作成部６０、及び制御部７０を有している。なお、検知処理部１０は、検知エンジン部１１及びフィルタ処理部１２を有している。 The abnormality determination system 1 has a detection processing unit 10 , a feature amount processing unit 20 , a determination unit 30 , a determination result processing unit 40 , a filter condition generation unit 50 , a determination device generation unit 60 and a control unit 70 . Note that the detection processing unit 10 has a detection engine unit 11 and a filter processing unit 12 .

異常判定システム１は、例えば、コンピュータにプログラム（実施形態に係る異常判定プログラム）をインストールすることにより構築してもよいが、機能的には図１のように示すことができる。 The abnormality determination system 1 may be constructed by, for example, installing a program (an abnormality determination program according to the embodiment) in a computer, and can be functionally shown as shown in FIG.

検知エンジン部１１は、Ｎ個（Ｎは１以上の整数）の検知エンジン１１１（１１１－１～１１１－Ｎ）を有している。また、フィルタ処理部１２は、Ｎ個のフィルタ条件データＤ２（Ｄ２ー１～Ｄ２－Ｎ）を有している。さらに、判定部３０は、判定器３１を有している。さらにまた、フィルタ条件生成部５０は、フィルタ条件生成定義情報５１を保持している。 The detection engine unit 11 has N (N is an integer equal to or greater than 1) detection engines 111 (111-1 to 111-N). The filtering unit 12 also has N pieces of filter condition data D2 (D2-1 to D2-N). Furthermore, the determination unit 30 has a determination device 31 . Furthermore, the filter condition generator 50 holds filter condition generation definition information 51 .

検知エンジン部１１は、ログ情報から、一つの観点（それぞれ異なる観点）で異常検知するＮ個の検知エンジン１１１（１１１－１～１１１－Ｎ）を管理する。各検知エンジン１１１は、ログ情報を所定の観点に関する異常の検知を行い、その検知結果を出力する。以下では、検知エンジン１１１－１～１１１－Ｎの検知結果を、それぞれ検知結果Ｄ１－１～Ｄ１－Ｎと表す。 The detection engine unit 11 manages N detection engines 111 (111-1 to 111-N) that detect anomalies from one viewpoint (different viewpoints) from log information. Each detection engine 111 detects an abnormality in the log information with respect to a predetermined viewpoint, and outputs the detection result. The detection results of the detection engines 111-1 to 111-N are hereinafter referred to as detection results D1-1 to D1-N, respectively.

この実施形態では、各検知エンジン１１１に対して識別子として検知エンジン名が付与されているものとする。各検知エンジン１１１に対する識別子はＩＤ番号でもよいし所定の文字列でも良い。この実施形態では、各検知エンジン１１１には、当該検知エンジン１１１がログ情報に対して検知する基準（検知方法）を示す文字列（タイトルの文字列）であるものとして説明する。 In this embodiment, it is assumed that each detection engine 111 is given a detection engine name as an identifier. An identifier for each detection engine 111 may be an ID number or a predetermined character string. In this embodiment, it is assumed that each detection engine 111 has a character string (title character string) indicating a criterion (detection method) for the detection engine 111 to detect log information.

なお、この実施形態では、検知エンジン１１１－１の検知エンジン名を「URLパスが疑わしい」、検知エンジン１１１－２の検知エンジン名を「実行形式ファイルダウンロード」、検知エンジン１１１－３の検知エンジン名を「アップロードサイズが大きい」であるものとする。 In this embodiment, the detection engine name of the detection engine 111-1 is "URL path is suspicious", the detection engine name of the detection engine 111-2 is "executable file download", and the detection engine name of the detection engine 111-3 is be "large upload size".

「ＵＲＬパスが疑わしい」は、例えば、悪性サイトが利用するディレクトリ構造のパターンの検知を意味するものとする。また、「実行形式ファイルダウンロード」は、実行形式のファイル（例えば、実行可能な形式の拡張子が付与されたファイル）がダウンロードされたことの検知を意味するものとする。実行可能な形式の拡張子が付与されたファイルとは、例えば、「ｅｘｅ」、「ｄｏｃｘ」等のコンピュータ上でダウンロード後に単独又は対応するアプリケーション上で実行可能な形式の拡張子が付与されたファイルを示すものとする。さらに、「アップロードサイズが大きい」は通信元（クライアント）からインターネット側へアップロードされるデータ量（１つの通信セッションでアップロードされるデータサイズ）が大きい（例えば、所定サイズよりも大きいか否か）ことの検知を意味するものとする。 "URL path is suspicious" means, for example, the detection of a directory structure pattern used by a malicious site. Also, "executable file download" means detection that an executable file (for example, a file with an executable file extension) has been downloaded. A file with an executable format extension is, for example, a file with an executable format extension such as "exe" or "docx" after being downloaded on a computer or on a corresponding application. shall indicate Furthermore, "large upload size" means that the amount of data uploaded from the communication source (client) to the Internet side (data size uploaded in one communication session) is large (for example, whether it is larger than a predetermined size). shall mean the detection of

フィルタ処理部１２は、それぞれの検知エンジン１１１（１１１－１～１１１－Ｎ）の検知結果Ｄ１（Ｄ１－１～Ｄ１－Ｎ）に対するフィルタ条件が記述されたフィルタ条件データＤ２（Ｄ２－１～Ｄ２－Ｎ）を管理している。そして、フィルタ処理部１２は、それぞれの検知結果Ｄ１（Ｄ１－１～Ｄ１－Ｎ）に対して、フィルタ条件データＤ２（Ｄ２－１～Ｄ２－Ｎ）に基づくフィルタ処理を施した内容としてフィルタ処理済検知結果Ｄ３（Ｄ３－１～Ｄ３－Ｎ）を出力する。例えば、フィルタ処理部１２は、検知エンジン１１１－ｉ（ｉは１～Ｎのいずれかの整数）の検知結果Ｄ１－ｉに対して、フィルタ条件データＤ２－ｉに基づくフィルタ処理を実行し、フィルタ処理済検知結果Ｄ３－ｉを得る。フィルタ処理部１２の詳細については後述する。 The filter processing unit 12 generates filter condition data D2 (D2-1 to D2 -N). Then, the filter processing unit 12 filters the detection results D1 (D1-1 to D1-N) based on the filter condition data D2 (D2-1 to D2-N). A completed detection result D3 (D3-1 to D3-N) is output. For example, the filter processing unit 12 performs filtering based on the filter condition data D2-i on the detection result D1-i of the detection engine 111-i (where i is an integer from 1 to N). A processed detection result D3-i is obtained. Details of the filter processing unit 12 will be described later.

特徴量処理部２０は、検知エンジン部１１による検知結果（フィルタ条件データＤ２－１～Ｄ２－Ｎ）を用いて、機械学習処理に適用可能な特徴量を表すデータ（以下、「特徴量データ」と呼ぶ）を生成する処理を行う。特徴量データの形式（例えば、ベクトルの形式や次元数等）は限定されないものである。ここでは、特徴量データは、例えば、各検知エンジン１１１の検知結果を一つの特徴量として表したものとする。例えば、各検知エンジン１１１が、０～１００点の数値スコアで検知結果を出力する場合、特徴量データは、図２に示すように検知エンジン単位に数値スコアを並べたものとしてもよい。 The feature amount processing unit 20 uses the detection results (filter condition data D2-1 to D2-N) by the detection engine unit 11 to generate data representing feature amounts applicable to machine learning processing (hereinafter referred to as "feature amount data"). ) is generated. The format of the feature amount data (for example, vector format, number of dimensions, etc.) is not limited. Here, it is assumed that the feature amount data represents, for example, the detection result of each detection engine 111 as one feature amount. For example, when each detection engine 111 outputs a detection result with a numerical score of 0 to 100 points, the feature amount data may be numerical scores arranged for each detection engine as shown in FIG.

図２は、特徴量データの構成例について示した図である。 FIG. 2 is a diagram showing a configuration example of feature amount data.

この実施形態では、特徴量データは、図２に示すように、Ｎ次元のベクトルデータ（Ｎ個のパラメータで構成されるベクトルデータ）であるものとして説明する。以下では、特徴量データにおいて、フィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎ（検知エンジン１１１－１～１１１－Ｎ）に対応するパラメータ（特徴量データを構成するパラメータ）を、それぞれ「Ｘ１、ＸＮ、・・・、ＸＮ」と表すものとする。そうすると、特徴量データＶは以下の（１）式のように示すことができる。
Ｖ＝｛Ｘ１、Ｘ２、・・・、ＸＮ｝ …（１） In this embodiment, as shown in FIG. 2, feature amount data is N-dimensional vector data (vector data composed of N parameters). In the following, in the feature amount data, the parameters (parameters constituting the feature amount data) corresponding to the filtered detection results D3-1 to D3-N (detection engines 111-1 to 111-N) are respectively defined as "X1, XN, . . . , XN”. Then, the feature amount data V can be represented by the following equation (1).
V={X1, X2, . . . , XN} (1)

以下では、Ｘ１～ＸＮは、それぞれ異常度合を０～１００の数値（大きいほど異常度合が大きいことを示す数値）で表されたパラメータであるものとして説明する。この実施形態の例では、検知エンジン１１１－１～１１１－Ｎが、それぞれ検知結果として異常度合を０～１００の数値を出力し、フィルタ処理部１２においてフィルタ条件に応じたフィルタ処理（例えば、フィルタ条件に該当するパラメータを０に変更する処理等）が行われるものとして説明する。なお、以下では、検知結果Ｄ１－１～Ｄ１－Ｎ（フィルタ処理部１２によりフィルタ処理される前の検知結果）を、それぞれＹ１～ＹＮと表すものとする。 In the following explanation, X1 to XN are parameters representing the degrees of abnormality respectively with numerical values from 0 to 100 (the larger the numerical value, the greater the degree of abnormality). In the example of this embodiment, the detection engines 111-1 to 111-N each output a numerical value of 0 to 100 indicating the degree of abnormality as the detection result, and the filter processing unit 12 performs filtering according to the filter conditions (for example, filter It is assumed that processing such as changing the parameter corresponding to the condition to 0) is performed. In the following description, detection results D1-1 to D1-N (detection results before being filtered by the filtering unit 12) are denoted by Y1 to YN, respectively.

図２では、１行で１つの特徴量データＶ（Ｘ１、Ｘ２、・・・ＸＮ）について図示している。図２では、各行に時系列を示すインデックス（１、２、３、・・・）を付している。図２では、インデックスは、解析対象（ログ情報）と対応づけられた識別子となっている。この実施形態では、説明を簡易とするため、解析対象（ログ情報）、特徴量データ、及び判定結果について対応付けて管理するための識別子として上記のインデックス（インデックス番号）を用いるものとして説明するが、その他の識別子（例えば、タイムスタンプ等の他の識別子）を用いるようにしてもよい。 In FIG. 2, one line indicates one feature amount data V (X1, X2, . . . XN). In FIG. 2, each row is given an index (1, 2, 3, . . . ) indicating a time series. In FIG. 2, the index is an identifier associated with an analysis target (log information). In this embodiment, in order to simplify the explanation, it is assumed that the above index (index number) is used as an identifier for managing the analysis target (log information), the feature amount data, and the determination result in association with each other. , other identifiers (eg, other identifiers such as timestamps) may be used.

判定部３０は、特徴量処理部２０から供給された特徴量データＶに対して総合的な異常の判定処理を行う。 The determination unit 30 performs comprehensive abnormality determination processing on the feature amount data V supplied from the feature amount processing unit 20 .

なお、以下では、判定部３０による異常の判定処理を単に「判定処理」又は「異常判定処理」と呼び、判定部３０による異常の判定処理の結果を単に「判定結果」又は「異常判定結果」と呼ぶものとする。 Hereinafter, the abnormality determination processing by the determination unit 30 is simply referred to as “determination processing” or “abnormality determination processing”, and the result of the abnormality determination processing by the determination unit 30 is simply “determination result” or “abnormality determination result”. shall be called

判定部３０が出力する判定結果の形式については限定されないものである。この実施形態の例では、判定部３０は、例えば、判定結果として、正常又は異常（異常の有り又は異常の無し）を２値（例えば、正常を表す０又は異常を表す１のいずれか）で表したデータを出力するようにしてもよいし、異常度の度合を表す数値スコア（例えば、０～１００のいずれかの値）を出力するようにしてもよい。この実施形態では、判定部３０は、判定結果として異常度を表す０～１００のいずれかの値（大きいほど異常度が高いことを示す値）を出力するものとして説明する。 The format of the determination result output by the determination unit 30 is not limited. In the example of this embodiment, the determination unit 30, for example, determines normality or abnormality (abnormality or absence of abnormality) as a binary value (for example, either 0 representing normality or 1 representing abnormality). The represented data may be output, or a numerical score (for example, any value from 0 to 100) representing the degree of abnormality may be output. In this embodiment, the determination unit 30 will be described as outputting a value between 0 and 100 representing the degree of abnormality as a determination result (the larger the value, the higher the degree of abnormality).

この実施形態において、判定部３０は、判定器３１を用いて判定処理を行うものとする。判定部３０は、判定器３１を入力して得られる結果を判定結果として出力する。判定器３１は、予め教師データとしての特徴量データと、当該教師データ（特徴量データ）に対応する教師ラベル（正解データ）とを含むデータ（以下、「教師データセット」と呼ぶ）に基づいた学習処理により取得された学習モデルを用いて判定処理を行う。この実施形態では、判定器３１は、判定器作成部６０の制御に応じて学習処理を行い、学習モデルを取得するものとする。 In this embodiment, the determination unit 30 uses the determiner 31 to perform determination processing. The determination unit 30 outputs a result obtained by inputting the determination device 31 as a determination result. The decision device 31 is based on data (hereinafter referred to as a "teacher data set") including feature data as teacher data and teacher labels (correct data) corresponding to the teacher data (feature data). A determination process is performed using the learning model acquired by the learning process. In this embodiment, the determiner 31 performs learning processing under the control of the determiner generator 60 and acquires a learning model.

判定器３１は、複数の教師データセットが供給されると、供給された教師データセットに基づいて学習処理を行って学習モデルを得る。また、判定器３１は、だけが供給されると、取得した学習モデルを用いて、に対する判定処理を行う。判定器３１に適用する機械学習器についても限定はされないものであるが、例えば、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）やＲａｎｄｏｍＦｏｒｅｓｔなどが適用し得る。 When a plurality of teacher data sets are supplied, the determiner 31 performs learning processing based on the supplied teacher data sets to obtain a learning model. Also, when only is supplied, the determiner 31 performs determination processing for using the acquired learning model. The machine learning device applied to the determination device 31 is not limited, but for example, SVM (Support Vector Machine), Random Forest, etc. can be applied.

教師データセットにおける教師ラベルの取得方法については限定しないが、例えば、過去に実際にあった攻撃通信に係るログ情報の特徴量データに異常の教師ラベルを適用するようにしてもよい。また、例えば、過去においていずれの検知エンジン１１１の判定結果（Ｙ１、Ｙ２、・・・、ＹＮ）についても一定値以下であるようなログ情報に対応する特徴量データについて正常（例えば、一定値以下（例えば０）の数値スコアの判定結果）の教師ラベルを適用するようにしてもよい。 Although the method of acquiring the teacher label in the teacher data set is not limited, for example, the abnormal teacher label may be applied to the feature data of the log information related to the attack communication that actually occurred in the past. Further, for example, the feature quantity data corresponding to the log information for which the judgment results (Y1, Y2, . (for example, a numerical score judgment result of 0)) may be applied.

判定結果処理部４０は、判定部３０から供給された判定結果を出力する。また、判定結果処理部４０は、出力した判定結果に対するフィードバックの情報入力をオペレータＯＰから受け付ける処理も行うものとする。判定結果処理部４０により、判定結果を出力する手段や出力するデータ形式については限定されないものである。判定結果処理部４０は、例えば、通信により他の装置に判定結果を出力するようにしてもよいし、所定のデータ記録媒体に判定結果を記録するようにしてもよい。また、判定結果処理部４０は、判定結果と共に関連するデータ（以下、「関連データ」と呼ぶ）を付加して出力するようにしてもよい。関連データには、例えば、当該判定結果に対応するログ情報等、当該判定結果に関する情報を含むようにしてもよい。この実施形態では、判定結果処理部４０は、判定結果を関連データと共に、オペレータＯＰが使用する監視端末ＴＥに供給（通信により送信）するものとする。 The determination result processing section 40 outputs the determination result supplied from the determination section 30 . In addition, the determination result processing unit 40 also performs a process of receiving feedback information input from the operator OP with respect to the output determination result. The means for outputting the determination result and the output data format by the determination result processing unit 40 are not limited. For example, the determination result processing section 40 may output the determination result to another device through communication, or may record the determination result on a predetermined data recording medium. In addition, the determination result processing unit 40 may add data related to the determination result (hereinafter referred to as “related data”) and output the determination result. The related data may include information related to the determination result, such as log information corresponding to the determination result. In this embodiment, the determination result processing unit 40 supplies (transmits by communication) the determination result together with related data to the monitoring terminal TE used by the operator OP.

判定結果処理部４０が、判定結果を監視端末ＴＥに提示する際の画面（例えば、Ｗｅｂページ等の操作画面；以下、「判定結果表示画面」と呼ぶ）の構成については限定されないものであるが、例えば、異常と判定されたログ情報ごとに、判定結果を表す数値スコア、各検知エンジン１１１の検知結果（Ｘ１～ＸＮ）を表示する構成としてもよい。 The configuration of the screen (for example, an operation screen such as a web page; hereinafter referred to as a "determination result display screen") when the determination result processing unit 40 presents the determination result to the monitoring terminal TE is not limited. For example, for each piece of log information determined to be abnormal, a numerical score representing the determination result and detection results (X1 to XN) of each detection engine 111 may be displayed.

図３は、判定結果表示画面の構成例について示した図である。 FIG. 3 is a diagram showing a configuration example of a determination result display screen.

図３に示す判定結果表示画面では、ログ情報ごとに、インデックス、判定結果（数値スコア）、及び各検知エンジン１１１の検知結果（Ｘ１～ＸＮ）をテーブル形式で表示している。図３に示す判定結果表示画面では、１行で１つのログ情報に関する情報を表示している。 On the determination result display screen shown in FIG. 3, the index, determination result (numerical score), and detection results (X1 to XN) of each detection engine 111 are displayed in a table format for each piece of log information. In the determination result display screen shown in FIG. 3, one line displays information about one piece of log information.

ログ情報は、複数項目（以下、これらの項目を「フィールド」と呼ぶ）の情報により構成されている。 The log information is composed of information of a plurality of items (these items are hereinafter referred to as "fields").

また、判定結果処理部４０は、オペレータＯＰから、誤判定された判定結果（例えば、異常判定されたログ情報のうちオペレータＯＰが誤判定と判断したログ情報；以下、「誤判定結果」と呼ぶ）の内容についてフィードバック（以下、「誤判定フィードバック」と呼ぶ）を受け付ける手段を備えているものとする。なお、以下では、誤判定フィードバックにおいて指定されたログ情報を「誤判定ログ」と呼ぶものとする。 In addition, the determination result processing unit 40 receives an erroneous determination result from the operator OP (for example, log information determined to be an erroneous determination by the operator OP among the log information determined to be abnormal; hereinafter referred to as an “erroneous determination result”). ) (hereafter referred to as “erroneous judgment feedback”). Note that the log information specified in the erroneous determination feedback is hereinafter referred to as an "erroneous determination log".

判定結果処理部４０において、誤判定結果を受け付ける方法については限定されないものである。判定結果処理部４０は、例えば、判定結果表示画面において、オペレータＯＰから、誤判定ログ情報（異常判定されたログ情報のうちオペレータＯＰが誤判定と判断したログ情報）を特定する操作を受け付けることができるようにしてもよい。例えば、判定結果表示画面において、選択操作（例えば、マウスによるクリック操作やポインティングデバイスによる選択操作）された行に対応するログ情報を誤判定ログとして受け付けるようにしてもよい。例えば、図３に示す判定結果表示画面において、インデックスを１とするログ情報の行（破線により囲われた領域Ａ１０１）について選択操作された場合、判定結果処理部４０は、当該ログ情報を誤判定ログと受け付けるようにしてもよい。同様に、判定結果処理部４０は、インデックスを２とするログ情報の行（破線により囲われた領域Ａ１０２）について選択操作された場合、当該ログ情報を誤判定ログと受け付けるようにしてもよい。 In the determination result processing section 40, the method of receiving the erroneous determination result is not limited. For example, the determination result processing unit 40 accepts an operation of specifying erroneous determination log information (log information determined to be erroneous by the operator OP among log information determined to be abnormal) from the operator OP on the determination result display screen. may be made possible. For example, on the determination result display screen, log information corresponding to a row selected by a selection operation (for example, a click operation using a mouse or a selection operation using a pointing device) may be accepted as an erroneous determination log. For example, in the determination result display screen shown in FIG. 3, when a selection operation is performed on a line of log information with an index of 1 (area A101 surrounded by a dashed line), the determination result processing unit 40 erroneously determines the log information. It may be accepted as a log. Similarly, when a row of log information with an index of 2 (area A102 surrounded by a dashed line) is selected, the determination result processing unit 40 may accept the log information as an erroneous determination log.

判定器作成部６０は、判定器３１による学習処理を制御するものである。判定器作成部６０は、教師データセットを蓄積し、蓄積した教師データセットを判定器３１に供給して学習処理（機械学習処理）を実行させ、判定器３１に判定に必要な学習モデルを保持させる。判定器作成部６０で保持する教師用データセットは、オペレータＯＰ（監視端末ＴＥ）により付与された教師ラベル（異常判定システム１が出力した判定結果に対するフィードバック）に基づいた教師データセットとしてもよいし、ユーザや設計者等により疑似的に作成されたログ（例えば、一定時間内に特定の種類のネットワーク装置で発生し得るログ）としてもよい。具体的には、例えば、判定器作成部６０は、過去に実際にあった攻撃的な通信のログ情報（例えば、過去に異常判定システム１で処理したログ情報）に基づく特徴量データを教師データとし、それらの教師データに対して異常を示す判定結果（例えば、オペレータによりフィードバックされた値、又は異常を示す所定の値）の教師ラベル（正解ラベル）を付与した教師データセットを保持するようにしてもよい。また、例えば、判定器作成部６０は、過去に実際にあった正常時な通信のログ情報（例えば、いずれの検知エンジン１１１のでもパラメータが一定値以下であるようなログ情報）の特徴量データを教師データとし、それらの教師データに対して正常を示す判定結果（例えば、オペレータによりフィードバックされた値、又は正常を示す所定の値）の教師ラベル（正解ラベル）を付与した教師データセットを保持するようにしてもよい。 The determiner creation unit 60 controls learning processing by the determiner 31 . The determiner creation unit 60 accumulates a teacher data set, supplies the accumulated teacher data set to the determiner 31 to execute learning processing (machine learning processing), and holds a learning model necessary for determination in the determiner 31. Let The teacher data set held by the determiner creation unit 60 may be a teacher data set based on a teacher label (feedback on the determination result output by the abnormality determination system 1) given by the operator OP (monitoring terminal TE). , a log created artificially by a user or a designer (for example, a log that can occur in a specific type of network device within a certain period of time). Specifically, for example, the determiner creation unit 60 converts feature amount data based on log information of past actual offensive communications (for example, log information processed by the abnormality determination system 1 in the past) into teacher data. Then, a teacher data set is held in which a teacher label (correct label) of a judgment result indicating an abnormality (for example, a value fed back by an operator or a predetermined value indicating an abnormality) is assigned to the teacher data. may In addition, for example, the determiner creation unit 60 generates feature amount data of log information of normal communication that actually occurred in the past (for example, log information in which parameters are equal to or less than a certain value in any detection engine 111). is used as teacher data, and a teacher data set with a teacher label (correct label) of a judgment result indicating normality (for example, a value fed back by an operator or a predetermined value indicating normality) is attached to the teacher data. You may make it

判定器作成部６０に教師データセットが供給されるタイミングについても限定されないものである。例えば、判定結果処理部４０では、オペレータＯＰ（監視端末ＴＥ）により手動で教師データセットの供給を受けるようにしてもよいし、判定結果処理部４０からオペレータＯＰ（監視端末ＴＥ）の操作に基づいたフィードバック（判定結果に対する教師ラベルと、当該教師ラベルに対応する特徴量データを含む教師データセット）を受けて蓄積するようにしてもよい。 The timing at which the teacher data set is supplied to the determiner generator 60 is also not limited. For example, the determination result processing unit 40 may receive the training data set manually by the operator OP (monitoring terminal TE), or the determination result processing unit 40 may receive the training data set based on the operation of the operator OP (monitoring terminal TE). It is also possible to receive and accumulate such feedback (a teacher label for the determination result and a teacher data set including feature amount data corresponding to the teacher label).

また、判定器作成部６０が判定器３１に学習処理させるタイミングについても限定されないものである。例えば、判定器作成部６０は、異常判定システム１の運用開始時にだけ判定器３１に学習処理を実行させるようにしてもよいし、定期又は不定期の間隔で判定器３１に学習処理を実行させるようにしてもよい。例えば、判定器作成部６０は、蓄積する学習用データセットが一定量追加されたタイミングで判定器３１に学習処理を実行させるようにしてもよい。 Also, the timing at which the determiner creating unit 60 causes the determiner 31 to perform learning processing is not limited. For example, the determiner creation unit 60 may cause the determiner 31 to perform the learning process only when the abnormality determination system 1 starts operating, or may cause the determiner 31 to perform the learning process at regular or irregular intervals. You may do so. For example, the determiner creation unit 60 may cause the determiner 31 to perform the learning process at the timing when a certain amount of accumulated learning data sets is added.

次に、フィルタ条件生成部５０の詳細構成について説明する。 Next, a detailed configuration of the filter condition generator 50 will be described.

フィルタ条件生成部５０は、誤判定フィードバックされたログ情報及び当該ログ情報に関連する情報（各検知エンジン１１１の検知結果、判定結果の数値スコア等）に基づいて、誤判定に寄与した検知エンジン１１１の検知結果に適用するフィルタ条件データＤ２を生成する処理（フィルタ条件生成処理）を行う。 The filter condition generation unit 50 selects the detection engine 111 that contributed to the erroneous judgment based on the log information fed back to the erroneous judgment and the information related to the log information (the detection result of each detection engine 111, the numerical score of the judgment result, etc.). A process (filter condition generation process) for generating filter condition data D2 to be applied to the detection result of is performed.

フィルタ条件生成部５０は、まず誤判定フィードバックされたログ情報の特徴量データから、どの検知エンジン１１１が当該ログ情報に対して異常を検知したかを判断する。次に、フィルタ条件生成部５０は、フィルタ条件生成定義情報５１から当該検知エンジン１１１のエントリを取得し、フィールド列に記載のフィールド名から当該ログのフィールドの値を取得し、値条件列に記載の条件式でフィルタ条件を作成する。フィルタ条件生成部５０は、生成したフィルタ条件を対応する検知エンジン１１１のフィルタ条件リストに追加するよう検知エンジン１１１部に要求する。フィルタ条件生成部５０は、誤判定ログと指定されたログ情報（例えば、判定結果表示画面で誤判定ログとして選択されたログ情報；以下、「指定ログ情報」と呼ぶ）に対応する特徴量データを参照し、指定ログ情報を異常として検知した検知エンジン１１１の検知結果を除外する（例えば、異常でないという検知結果に相当する値に変更する）ためのフィルタ条件を生成して、当該検知エンジン１１１に対応するフィルタ条件データＤ２に反映させる。 The filter condition generation unit 50 first determines which detection engine 111 has detected an abnormality in the log information based on the feature amount data of the log information fed back as an erroneous determination. Next, the filter condition generation unit 50 acquires the entry of the detection engine 111 from the filter condition generation definition information 51, acquires the field value of the log from the field name described in the field column, and describes it in the value condition column. Create a filter condition with a conditional expression in . The filter condition generating unit 50 requests the detection engine 111 unit to add the generated filter condition to the corresponding filter condition list of the detection engine 111 . The filter condition generation unit 50 generates feature amount data corresponding to log information designated as an erroneous determination log (for example, log information selected as an erroneous determination log on the determination result display screen; hereinafter referred to as "designated log information"). , generates a filter condition for excluding the detection result of the detection engine 111 that detected the specified log information as abnormal (for example, changing it to a value corresponding to the detection result that it is not abnormal), and generates a filter condition for the detection engine 111 is reflected in the filter condition data D2 corresponding to .

図４は、フィルタ条件生成定義情報５１の構成例について示した図である。 FIG. 4 is a diagram showing a configuration example of the filter condition generation definition information 51. As shown in FIG.

図４に示すようにフィルタ条件生成定義情報５１では、１行で１つのエントリ（フィルタを生成する条件を定義した情報）が記述されている。図４に示すようにフィルタ条件生成定義情報５１では、エントリごとに、検知エンジン１１１を識別するための「検知エンジン名」、ログ情報において当該検知エンジンが検査する項目を示す「フィールド名」、及び「値条件」の情報が登録されている。本明細書では、フィルタ条件生成定義情報５１については、説明を簡易とするため図４のような表形式で示しているが、フィルタ条件生成定義情報５１の形式は限定されないものである。例えば、フィルタ条件生成定義情報５１は、ＪＳＯＮ（ＪａｖａＳｃｒｉｐｔＯｂｊｅｃｔＮｏｔａｔｉｏｎ）やＸＭＬ（ｅＸｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）等によりフィルタ条件の生成ルールが設定される形式としてもよい。 As shown in FIG. 4, in the filter condition generation definition information 51, one entry (information defining conditions for generating a filter) is described in one line. As shown in FIG. 4, in the filter condition generation definition information 51, for each entry, a "detection engine name" for identifying the detection engine 111, a "field name" indicating an item to be inspected by the detection engine in the log information, and "Value condition" information is registered. In this specification, the filter condition generation definition information 51 is shown in a tabular form as shown in FIG. 4 to simplify the explanation, but the format of the filter condition generation definition information 51 is not limited. For example, the filter condition generation definition information 51 may have a format in which filter condition generation rules are set by JSON (Javascript Object Notation), XML (eXtensible Markup Language), or the like.

「値条件」の項目は、フィールド名の項目に適用する抽出条件を示している。例えば、フィールド名で指定された項目が文字列形式（例えば、パス、通信先ＦＱＤＮ、ファイルの拡張子等）である場合には、値条件に、指定ログ情報とフィールド名の項目で文字列が一致するログ情報を抽出することを示す「一致」が設定される。また、例えば、フィールド名で指定された項目が数値形式（例えば、送信バイト数等）である場合には、値条件に、指定ログ情報のフィールド名の項目の値以下のログ情報を抽出することを示す「以下」や、指定ログ情報のフィールド名の項目の値以上のログ情報を抽出することを示す「以上」等を設定することができる。 The "value condition" item indicates an extraction condition to be applied to the field name item. For example, if the item specified by the field name is in character string format (e.g., path, communication destination FQDN, file extension, etc.), the specified log information and field name items are "Match" is set to indicate that matching log information is to be extracted. Also, for example, if the item specified by the field name is in a numerical format (e.g., the number of bytes sent, etc.), the value condition is to extract log information that is less than or equal to the value of the item of the field name of the specified log information. or "greater than or equal to" indicating that the log information equal to or greater than the value of the field name item of the specified log information is to be extracted.

図１では１～３行目のエントリの検知エンジン名は、それぞれ「ＵＲＬパスが疑わしい」、「実行形式ファイルダウンロード」、「アップロードサイズが大きい」となっている。 In FIG. 1, the detection engine names of the entries on the 1st to 3rd lines are "URL path is suspicious", "executable file download", and "upload size is large", respectively.

フィルタ処理部１２は、フィルタ条件生成定義情報５１の検知エンジン名の列からフィルタ条件を生成する検知エンジン１１１を検索し、フィールド名の列から誤判定ログの当該フィールドの値を取得し、値条件の列に記載の条件式を適用することで行う。 The filter processing unit 12 searches the detection engine 111 that generates the filter condition from the detection engine name column of the filter condition generation definition information 51, acquires the value of the field of the misjudgment log from the field name column, and sets the value condition. This is done by applying the conditional expression described in the column.

この場合、１行目のエントリは、検知エンジン名が「ＵＲＬパスが疑わしい」であり、フィールド名が「ＵＲＬパス」であり、値条件が「一致」である。この場合、１行目のエントリでは、「ＵＲＬパスが疑わしい」という検知エンジン１１１（この場合は、検知エンジン１１１－１）に対して、誤判定ログのＵＲＬパスのフィールドの値と一致するログ情報をフィルタするフィルタ条件が生成されることを示している。 In this case, the entry in the first row has the detection engine name "URL path is suspect", the field name "URL path", and the value condition "match". In this case, in the entry on the first line, for the detection engine 111 (in this case, the detection engine 111-1) "the URL path is suspicious", log information that matches the value of the URL path field of the misjudgment log indicates that a filter condition is generated to filter the .

また、２行目のエントリは、検知エンジン名が「実行形式ファイルダウンロード」であり、フィールド名に「通信先ＦＱＤＮ」と「パスのファイル拡張子」の２つが設定され、値条件が「一致」となっている。この場合、２行目のエントリでは、誤判定ログの「通信先ＦＱＤＮ」と「パスのファイル拡張子」の両方のフィールドの値が一致するログをフィルタするフィルタ条件が生成されることを示している。 In the entry on the second line, the detection engine name is "executable file download", two field names, "communication destination FQDN" and "path file extension" are set, and the value condition is "match". It has become. In this case, the entry in the second line indicates that a filter condition will be generated to filter the false positive logs with matching values in both the Destination FQDN and Path File Extension fields. there is

さらに、３行目のエントリは、検知エンジン名が「アップロードサイズが大きい」であり、フィールド名が「送信バイト数」であり、値条件が「以下」である。この場合、３行目のエントリでは、誤判定ログの送信バイト数フィールドの値以下のログ情報をフィルタするフィルタ条件が生成されることを意味している。 Furthermore, the entry in the third line has the detection engine name "large upload size", the field name "number of bytes sent", and the value condition "less than or equal to". In this case, the entry on the third line means that a filter condition for filtering log information equal to or less than the value of the number of bytes sent field of the erroneous determination log is generated.

以上のように、異常判定システム１では、複数の検知エンジン１１１の検知結果をまとめて特徴量データとし、異常パターンを学習させた判定部３０（判定器３１）で総合的に異常か否かを判定して出力する。そして、異常判定システム１では、誤判定フィードバック時に異常検知に寄与した検知エンジン１１１に対して誤判定ログを検知しない処理（例えば、異常の検知結果を正常に変更する処理）ためのフィルタ条件を生成する仕組みを提供する。具体的には、異常判定システム１では、フィルタ条件生成部５０が、誤判定ログに対して各検知エンジン１１１がどのようなフィルタ条件を生成すればよいかを記載したフィルタ条件生成定義情報５１を保持し、誤判定フィードバックがあったときに、フィルタ条件生成定義情報５１を参照して各検知エンジン１１１のフィルタ条件（フィルタ条件データＤ２）を生成する。 As described above, in the abnormality determination system 1, the detection results of the plurality of detection engines 111 are collected as feature amount data, and the determination unit 30 (determiner 31) that learns the abnormality pattern comprehensively determines whether or not there is an abnormality. judge and output. Then, the abnormality determination system 1 generates a filter condition for processing not to detect an erroneous determination log (for example, processing for changing the abnormality detection result to normal) for the detection engine 111 that contributed to the abnormality detection at the time of the erroneous determination feedback. provide a mechanism to Specifically, in the abnormality determination system 1, the filter condition generation unit 50 generates filter condition generation definition information 51 describing what kind of filter condition each detection engine 111 should generate for the erroneous determination log. When there is erroneous determination feedback, the filter condition generation definition information 51 is referenced to generate filter conditions (filter condition data D2) for each detection engine 111 .

（Ａ－２）第１の実施形態の動作
次に、以上のような構成を有する第１の実施形態の異常判定システム１の動作（実施形態に係る異常判定方法）を説明する。 (A-2) Operation of First Embodiment Next, the operation (abnormality determination method according to the embodiment) of the abnormality determination system 1 of the first embodiment having the configuration as described above will be described.

まず、検知処理部１０の動作の詳細について図５を用いて説明する。 First, details of the operation of the detection processing unit 10 will be described with reference to FIG.

図５は、検知処理部１０の動作について示したフローチャートである。 FIG. 5 is a flow chart showing the operation of the detection processing unit 10. As shown in FIG.

検知処理部１０は、制御部７０の制御にしたがったタイミングで、図５のフローチャートの処理を行う。 The detection processing unit 10 performs the processing of the flowchart of FIG. 5 at the timing according to the control of the control unit 70 .

検知処理部１０では、解析対象のログ情報が供給されると（Ｓ１０１）、検知エンジン部１１の検知エンジン１１１（１１１－１～１１１－Ｎ）により当該ログ情報に対する検知処理が行われ、検知結果Ｄ１（Ｄ１－１～Ｄ１－Ｎ）が取得される（Ｓ１０２）。 In the detection processing unit 10, when log information to be analyzed is supplied (S101), detection processing is performed on the log information by the detection engines 111 (111-1 to 111-N) of the detection engine unit 11, and detection results are obtained. D1 (D1-1 to D1-N) is acquired (S102).

各検知エンジン１１１（１１１－１～１１１－Ｎ）は、それぞれの異常検知の観点で異常と判断したログ情報とその数値スコアを出力する。ここで、各検知エンジン１１１において、異常検知の観点は限定しないが、例えば、上記のように「実行形式のファイル拡張子をダウンロード」のようなルールベースの観点や、「通信先へのアップロードサイズが異常に大きい」のような統計値ベースの観点がある。各検知エンジン１１１の詳細構成については、従来技術を適用することができるので詳しい説明を省略する。 Each detection engine 111 (111-1 to 111-N) outputs log information judged to be abnormal from the viewpoint of respective abnormality detection and its numerical score. Here, in each detection engine 111, the viewpoint of anomaly detection is not limited. is abnormally large”. A detailed description of the detailed configuration of each detection engine 111 is omitted because conventional technology can be applied.

検知結果Ｄ１－１～Ｄ１－Ｎは、それぞれフィルタ処理部１２において、フィルタ条件データＤ２－１～Ｄ２－Ｎに基づくフィルタ条件が適用され、フィルタ条件に合致する場合にフィルタ処理される（Ｓ１０３）。 The detection results D1-1 to D1-N are subjected to filter conditions based on the filter condition data D2-1 to D2-N in the filter processing unit 12, respectively, and are filtered when they match the filter conditions (S103). .

フィルタ処理部１２は、検知結果Ｄ１－１～Ｄ１－Ｎに対してフィルタ条件を適用し必要に応じてフィルタ処理した結果としてフィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎを得る。なお、ログ情報がフィルタ条件に合致しない場合は、検知結果Ｄ１－１～Ｄ１－Ｎは特にフィルタ処理されないため、検知結果Ｄ１－１～Ｄ１－Ｎとフィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎは同じ内容となる。 The filter processing unit 12 applies filter conditions to the detection results D1-1 to D1-N, and obtains filtered detection results D3-1 to D3-N as a result of filtering as necessary. Note that if the log information does not match the filter conditions, the detection results D1-1 to D1-N are not particularly filtered. N has the same content.

検知処理部１０は、全ての検知エンジン１１１についてフィルタ処理が完了して、フィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎの取得が完了（Ｓ１０４）すると、取得したフィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎを出力（特徴量処理部２０に供給）する。このとき、検知処理部１０は、フィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎに解析対象のログ情報等を付加して、特徴量処理部２０に供給する。 When the filtering process for all the detection engines 111 is completed and the acquisition of the filtered detection results D3-1 to D3-N is completed (S104), the detection processing unit 10 acquires the acquired filtered detection result D3-1. ˜D3-N are output (supplied to the feature amount processing unit 20). At this time, the detection processing unit 10 adds log information or the like to be analyzed to the filtered detection results D3-1 to D3-N, and supplies them to the feature amount processing unit 20. FIG.

次に、特徴量処理部２０、判定部３０（判定器３１）及び判定器作成部６０の動作の詳細について図６を用いて説明する。 Next, details of operations of the feature amount processing unit 20, the determination unit 30 (determination device 31), and the determination device creation unit 60 will be described with reference to FIG.

図６は、特徴量処理部２０、判定部３０（判定器３１）及び判定器作成部６０の動作の例について示したフローチャートである。 FIG. 6 is a flow chart showing an example of operations of the feature amount processing unit 20, the determination unit 30 (determination unit 31), and the determination unit creation unit 60. FIG.

特徴量処理部２０、判定部３０（判定器３１）及び判定器作成部６０は、制御部７０の制御に従ったタイミング（学習処理又は判定処理が必要となったタイミング）で、図６のフローチャートの処理を行う。制御部７０において、学習処理又は判定処理の制御タイミングについては限定されないものである。例えば、制御部７０は、スケジューリングによって定期的（例えば、１日１回の周期）なタイミングで学習処理を行うと判断してもよいし、オペレータＯＰ（監視端末ＴＥ）からの誤判定フィードバックがあったタイミングで学習処理を行うと判断するようにしてもよい。また、例えば、制御部７０は、検知処理部１０から検知結果（フィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎ）が出力されたことをトリガとし判定処理を行うと判断してもよいし、送信元ＩＰアドレス単位にある期間の検知結果（フィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎ）を集約したものを特徴量データとする場合などは必要な期間の検知結果が蓄積（例えば、判定部３０や制御部７０で蓄積）されたタイミングで、それらの蓄積された検知結果について判定処理を行うと判断するようにしてもよい。 The feature amount processing unit 20, the determining unit 30 (determining device 31), and the determining device creating unit 60 perform the processing according to the flowchart of FIG. process. The control timing of the learning process or the determination process in the control unit 70 is not limited. For example, the control unit 70 may determine that the learning process is to be performed periodically (for example, once a day) by scheduling, or if there is an erroneous determination feedback from the operator OP (monitoring terminal TE). It may be determined that the learning process is to be performed at this timing. Further, for example, the control unit 70 may determine that the output of the detection results (filtered detection results D3-1 to D3-N) from the detection processing unit 10 triggers the determination process, If feature data is a collection of detection results (filtered detection results D3-1 to D3-N) for a certain period for each source IP address, the detection results for the required period are accumulated (for example, judgment At the timing when the detection results are stored in the unit 30 or the control unit 70, it may be determined that the determination process is to be performed on the stored detection results.

特徴量処理部２０は、フィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎが供給されると、フィルタ処理済検知結果Ｄ３－１～Ｄ３－Ｎに基づいて特徴量データを生成し、判定部３０の判定器３１に供給する（Ｓ２０１）。 When the filtered detection results D3-1 to D3-N are supplied, the feature amount processing unit 20 generates feature amount data based on the filtered detection results D3-1 to D3-N, and the determining unit 30 (S201).

ここで、判定部３０（判定器３１）は、制御部７０の制御に従って、学習処理を実行するか否か判断し（Ｓ２０２）、学習処理を実行する場合には後述するステップＳ２０３に移行し、判定処理を実行する場合には後述するステップＳ２０５に移行する。 Here, the determination unit 30 (determiner 31) determines whether or not to execute the learning process according to the control of the control unit 70 (S202). When executing the determination process, the process proceeds to step S205, which will be described later.

判定部３０は、学習処理を実行する場合、蓄積された教師データセットを保持し（Ｓ２０３）、保持した教師データセットを用いて判定器３１に機械学習処理を実行させ、新たな学習モデルを取得させる（Ｓ２０４）。 When executing the learning process, the determination unit 30 holds the accumulated teacher data set (S203), causes the determiner 31 to execute the machine learning process using the held teacher data set, and acquires a new learning model. (S204).

また、判定部３０は、判定処理を実行する場合、供給された特徴量データを判定器３１にセットして判定処理を実行させ、判定結果を判定結果処理部４０に供給する（Ｓ２０５）。 When executing the determination process, the determination unit 30 sets the supplied feature amount data in the determination unit 31 to perform the determination process, and supplies the determination result to the determination result processing unit 40 (S205).

次に、判定結果処理部４０及びフィルタ条件生成部５０の動作の詳細について図７を用いて説明する。 Next, details of operations of the determination result processing unit 40 and the filter condition generation unit 50 will be described with reference to FIG.

図７は、判定結果処理部４０及びフィルタ条件生成部５０の動作の例について示したフローチャートである。 FIG. 7 is a flow chart showing an example of operations of the determination result processing unit 40 and the filter condition generation unit 50. As shown in FIG.

判定結果処理部４０は、判定部３０（判定器３１）から供給された判定結果について判定結果表示画面としてオペレータＯＰが使用する監視端末ＴＥに提示する（Ｓ３０１）。 The determination result processing unit 40 presents the determination result supplied from the determination unit 30 (determiner 31) as a determination result display screen to the monitoring terminal TE used by the operator OP (S301).

そして、判定結果処理部４０は、オペレータＯＰが使用する監視端末ＴＥに判定結果の情報を含む判定結果表示画面を表示させ、オペレータＯＰから誤判定ログのフィードバック（誤判定ログの指定操作）を受け付ける（Ｓ３０２）。このとき、判定結果処理部４０は、異常となった判定結果（例えば、判定結果の数値が一定値異常の判定結果）のみを出力（判定結果表示画面に表示）するものとする。また、このとき、制御部７０は、判定結果処理部４０から、誤判定フィードバックに係る誤判定ログ等の関連データ（例えば、当該誤判定ログに対応する特徴量データ等）をフィルタ条件生成部５０に供給させ、フィルタ条件生成部５０にフィルタ条件の生成を要求する。 Then, the determination result processing unit 40 causes the monitoring terminal TE used by the operator OP to display a determination result display screen including the information of the determination result, and receives feedback of the erroneous determination log (an operation of designating the erroneous determination log) from the operator OP. (S302). At this time, the determination result processing unit 40 outputs (displays on the determination result display screen) only the abnormal determination result (for example, the determination result that the numerical value of the determination result is a constant value abnormality). At this time, the control unit 70 transmits related data such as an erroneous determination log related to the erroneous determination feedback (for example, feature amount data corresponding to the erroneous determination log) from the determination result processing unit 40 to the filter condition generation unit 50. , and requests the filter condition generator 50 to generate a filter condition.

判定結果処理部４０が、判定結果をオペレータＯＰに通知し、誤判定フィードバック（誤判定ログの指定等）を受け付ける手段については、上記のような判定結果表示画面の提示（例えば、Ｗｅｂページを用いた操作画面の提示）に限定されないものであり、メール送受信や種々のメッセージ送受信によりオペレータＯＰから誤判定フィードバックを受け付けるようにしてもよい。オペレータＯＰは、異常と判定されたログ情報が本当に異常に起因するものであるか否かを調査し、結果誤判定と判断した場合、判定結果処理部４０に当該ログ情報を誤判定ログとして誤判定フィードバックすることができる。 As for means for the judgment result processing unit 40 to notify the operator OP of the judgment result and receive the erroneous judgment feedback (designation of erroneous judgment log, etc.), presentation of the judgment result display screen as described above (for example, using a web page). It is not limited to presenting an operation screen that has been used), and erroneous determination feedback may be received from the operator OP by sending/receiving mail or sending/receiving various messages. The operator OP investigates whether or not the log information determined to be abnormal is actually caused by the abnormality, and if it is determined that the result is an erroneous determination, the log information is sent to the determination result processing unit 40 as an erroneous determination log. Judgment feedback can be provided.

例えば、図３に示すような判定結果表示画面において、１行目の判定結果（インデックス：１とする判定結果）が選択された場合、判定結果処理部４０は、１行目の判定結果（インデックス：１とする判定結果）に対応するログ情報及び特徴量データ（Ｘ１～ＸＮ）を誤判定フィードバックの情報として取得する。 For example, in the determination result display screen as shown in FIG. : 1) is obtained as the information of the erroneous determination feedback.

次に、フィルタ条件生成部５０は、誤判定ログの供給を受けると、誤判定ログとしてフィードバックされた誤判定ログに対応する特徴量データ（Ｘ１、Ｘ２、・・・、ＸＮ）に基づき、フィルタ条件の生成対象となるエントリ（検知エンジン１１１）を選択する（Ｓ３０３）。 Next, when receiving the false determination log, the filter condition generation unit 50 filters based on the feature amount data (X1, X2, . . . , XN) corresponding to the false determination log fed back as the false determination log. An entry (detection engine 111) for which a condition is to be generated is selected (S303).

例えば、フィルタ条件生成部５０は、誤判定ログに対応する特徴量データにおいて、検知結果の数値が０より大きくなっている検知エンジン１１１については異常が検知されたものとし、フィルタ条件の生成対象となる検知エンジン１１１としてエントリ選択するようにしてもよい。 For example, the filter condition generation unit 50 determines that the detection engine 111 having a detection result value greater than 0 in the feature amount data corresponding to the erroneous determination log is detected as being abnormal, and the filter condition is generated. The entry may be selected as the detection engine 111 that is different.

例えば、図３に示す判定結果表示画面において１行目の判定結果（インデックス：１とする判定結果）に対応するログ情報及び特徴量データ（Ｘ１～ＸＮ）を誤判定フィードバックの情報として取得された場合を想定する。この場合、誤判定ログに対応する特徴量データにおいてＸ１＝６０で、Ｘ２～ＸＮが全て０となるとすれば、Ｘ１に係る検知エンジン１１１－１の検知結果のみが異常を示し、それ以外は正常を示していたと判断できる。この場合、フィルタ条件生成部５０は、フィルタ条件の生成対象となるエントリ（検知エンジン１１１）として、検知エンジン１１１－１に対応するエントリを選択する。ここでは、検知エンジン１１１－１に対応するエントリは、図４に示す１行目のエントリ（検知エンジン名：ＵＲＬパスが疑わしい）であったものとする。 For example, log information and feature amount data (X1 to XN) corresponding to the first row of the judgment result (index: 1 judgment result) on the judgment result display screen shown in FIG. 3 are acquired as incorrect judgment feedback information. Assume the case. In this case, if X1=60 and X2 to XN are all 0 in the feature amount data corresponding to the erroneous determination log, only the detection result of the detection engine 111-1 related to X1 indicates an abnormality, and the others are normal. It can be judged that the In this case, the filter condition generator 50 selects the entry corresponding to the detection engine 111-1 as the entry (detection engine 111) for which the filter condition is to be generated. Here, it is assumed that the entry corresponding to the detection engine 111-1 is the entry (detection engine name: URL path is suspicious) on the first line shown in FIG.

次に、フィルタ条件生成部５０は、フィルタ条件生成定義情報５１から、フィルタ条件の生成対象として選択した検知エンジン１１１のエントリを取得し、取得したエントリに基づいて選択した検知エンジン１１１に対応するフィルタ条件を生成する（Ｓ３０４）。 Next, the filter condition generation unit 50 acquires the entry of the detection engine 111 selected as a filter condition generation target from the filter condition generation definition information 51, and filters corresponding to the detection engine 111 selected based on the acquired entry. Conditions are generated (S304).

このとき、フィルタ条件生成部５０は、取得したエントリのフィールド名に対応するフィールドの値を誤判定ログから取得し、当該フィールドの値と取得したエントリの値条件に基づいてフィルタ条件データＤ２にフィードバックするフィルタ条件を生成する。 At this time, the filter condition generator 50 acquires the value of the field corresponding to the field name of the acquired entry from the misjudgment log, and feeds it back to the filter condition data D2 based on the value of the field and the value condition of the acquired entry. Generate a filter condition that

フィルタ条件生成部５０は、生成したフィルタ条件を、検知エンジン部１１（フィルタ処理部１２）に通知する。このとき、フィルタ条件生成部５０は、生成したフィルタ条件と共に、選択した検知エンジン１１１の識別子（例えば、検知エンジン名等の検知エンジン１１１のＩＤ）を通知する。 The filter condition generation unit 50 notifies the detection engine unit 11 (filter processing unit 12) of the generated filter conditions. At this time, the filter condition generator 50 notifies the identifier of the selected detection engine 111 (for example, the ID of the detection engine 111 such as the name of the detection engine) together with the generated filter condition.

フィルタ処理部１２は、フィルタ条件生成部５０から供給されたフィルタ条件を、対応するフィルタ条件データＤ２に追加更新する（Ｓ３０５）。 The filter processing unit 12 additionally updates the filter condition supplied from the filter condition generation unit 50 to the corresponding filter condition data D2 (S305).

例えば、ステップＳ３０２で図３に示すような判定結果表示画面において、１行目の判定結果（インデックス：１とする判定結果）が選択され、ステップＳ３０３で、図４に示すフィルタ条件生成定義情報５１で１行目のエントリ（検知エンジン１１１－１に対応するエントリ）が選択されていた場合を想定する。この場合、当該エントリでは、フィールド名が「ＵＲＬパス」で、値条件が「一致」となっている。この場合、フィルタ条件生成部５０は、ＵＲＬパスが誤判定ログにおける値である「／ａ．ｅｘｅ」と一致するログ情報が供給された場合には判定結果を異常ではなく正常とする値（例えば、０）とするようなフィルタ条件を生成して、検知エンジン１１１－１に対応するフィルタ条件データＤ２－１に追加登録するように更新する。 For example, in the judgment result display screen as shown in FIG. 3 in step S302, the judgment result in the first row (the judgment result with index: 1) is selected, and in step S303, the filter condition generation definition information 51 shown in FIG. , the entry in the first row (the entry corresponding to the detection engine 111-1) is selected. In this case, the entry has a field name of "URL path" and a value condition of "match". In this case, the filter condition generation unit 50 determines that the determination result is normal instead of abnormal when log information whose URL path matches "/a.exe", which is the value in the false determination log, is supplied (for example, , 0) is generated and updated so as to be additionally registered in the filter condition data D2-1 corresponding to the detection engine 111-1.

以後は、ＵＲＬパスが「／ａ．ｅｘｅ」と一致するログ情報が検知エンジン１１１－１に供給された場合、検知結果Ｄ１－１が異常を示す値（１以上の値；例えば、以前と同じく６０）であったとしても、フィルタ処理部１２（フィルタ条件データＤ２－１）のフィルタ処理により正常を示す値（例えば、０）にフィルタ処理され、フィルタ処理済検知結果Ｄ３－１として出力される。 Thereafter, when log information whose URL path matches "/a.exe" is supplied to the detection engine 111-1, the detection result D1-1 will have a value indicating abnormality (a value of 1 or more; 60), the filter processing unit 12 (filter condition data D2-1) filters to a value indicating normality (for example, 0) and outputs it as a filtered detection result D3-1. .

（Ａ－３）第１の実施形態の効果
第１の実施形態によれば、以下のような効果を奏することができる。 (A-3) Effects of First Embodiment According to the first embodiment, the following effects can be obtained.

第１の実施の異常判定システム１では、判定結果処理部４０がオペレータＯＰ（監視端末ＴＥ）からの誤判定フィードバックを受け、フィルタ条件生成部５０が、フィルタ条件生成定義情報５１に基づいて誤判定に寄与した検知エンジン１１１のフィルタ条件を生成し、検知処理部１０（フィルタ処理部１２）に設定する。 In the abnormality determination system 1 of the first embodiment, the determination result processing unit 40 receives erroneous determination feedback from the operator OP (monitoring terminal TE), and the filter condition generation unit 50 generates an erroneous determination based on the filter condition generation definition information 51. is generated, and set in the detection processing unit 10 (filter processing unit 12).

これにより、異常判定システム１では、誤判定フィードバックの対象となる検知エンジン１１１について、誤判定フィードバックされたログ情報を異常として出力させない（フィルタ処理部１２で正常な値にフィルタ処理する）ため、当該検知エンジン１１１に関連する検知事由での誤判定を抑制できる。 As a result, the abnormality determination system 1 prevents the detection engine 111, which is the target of the erroneous determination feedback, from outputting the log information of the erroneous determination feedback as an abnormality (the filter processing unit 12 filters the log information into a normal value). It is possible to suppress erroneous determinations due to detection reasons related to the detection engine 111 .

また、異常判定システム１では、過去に誤判定されたログ情報について異常検知に寄与しない検知エンジン１１１に対しては、フィルタ条件が追加生成されないため、当該検知エンジン１１１に関連する検知事由での異常の見逃しを抑制し、検知精度を維持できる。 Further, in the anomaly determination system 1, additional filter conditions are not generated for the detection engine 111 that does not contribute to anomaly detection for log information that has been erroneously determined in the past. It is possible to suppress the oversight of detection and maintain detection accuracy.

さらに、異常判定システム１では、フィルタ条件生成定義情報５１により、誤判定に寄与した検知エンジン１１１のみにフィルタ条件が生成されるため、誤判定ログに対する正確なホワイトリスト型のフィルタが可能になる。例えば、異常判定システム１では、悪性サイトからの実行ファイルダウンロードを検知したという事由で異常検知された場合、誤判定事由に関連する、悪性サイトを検知する検知エンジン１１１や実行ファイルをダウンロードする検知エンジン１１１にはホワイトリストによるフィルタが適用され、誤判定事由に関連しない正規Ｗｅｂサイトの改ざんや乗っ取りを検知する検知エンジン１１１はホワイトリストのフィルタが適用されない。このようにすることで、異常判定システム１では、誤判定された異常検知事由での異常検知を抑制し、それ以外の事由での検知精度は維持することが可能になる。
（Ｂ）第２の実施形態
以下、本発明による異常判定システム、異常判定プログラム、及び異常判定方法の第２の実施形態を、図面を参照しながら詳述する。 Furthermore, in the abnormality determination system 1, the filter condition generation definition information 51 generates a filter condition only for the detection engine 111 that contributed to the erroneous determination, so it is possible to perform an accurate whitelist type filter for erroneous determination logs. For example, in the anomaly determination system 1, when an anomaly is detected due to the detection of an execution file download from a malicious site, the detection engine 111 that detects the malicious site and the detection engine that downloads the execution file are associated with the reason for the erroneous determination. A whitelist filter is applied to 111, and the whitelist filter is not applied to the detection engine 111 that detects falsification or hijacking of a legitimate website that is not related to an erroneous determination reason. By doing so, in the abnormality determination system 1, it is possible to suppress abnormality detection due to an erroneously determined abnormality detection reason, and to maintain detection accuracy for other reasons.
(B) Second Embodiment Hereinafter, a second embodiment of an abnormality determination system, an abnormality determination program, and an abnormality determination method according to the present invention will be described in detail with reference to the drawings.

（Ｂ－１）第２の実施形態の構成
第２の実施形態の異常判定システム１Ａについても、上述の図１を用いて示すことができる。 (B-1) Configuration of Second Embodiment The abnormality determination system 1A of the second embodiment can also be shown using FIG. 1 described above.

なお、図１において、括弧内の符号は第２の実施形態でのみ用いられる符号である。 In FIG. 1, the symbols in parentheses are symbols used only in the second embodiment.

以下では、第２の実施形態の異常判定システム１Ａについて第１の実施形態との差異を説明する。 Below, difference with 1st Embodiment is demonstrated about the abnormality determination system 1A of 2nd Embodiment.

図１に示すように、第２の実施形態の異常判定システム１Ａでは判定結果処理部４０、及びフィルタ条件生成部５０（フィルタ条件生成定義情報５１）が、判定結果処理部４０Ａ、及びフィルタ条件生成部５０Ａ（フィルタ条件生成定義情報５１Ａ）に置き換わっている点で第１の実施形態と異なる。 As shown in FIG. 1, in the abnormality determination system 1A of the second embodiment, a determination result processing unit 40 and a filter condition generation unit 50 (filter condition generation definition information 51) are configured so that the determination result processing unit 40A and the filter condition generation It differs from the first embodiment in that it is replaced with a section 50A (filter condition generation definition information 51A).

第２の実施形態の判定結果処理部４０Ａは、オペレータＯＰから、ログ情報単位ではなく、ログ情報を構成するフィールドの値を指定した誤判定フィードバックを受け付けることが可能な点で第１の実施形態と異なっている。判定結果処理部４０Ａがログ情報を構成するフィールドの値を指定した誤判定フィードバックを受け付ける方法については限定されないものであるが、例えば、図８、図９に示すような判定結果表示画面の構成を適用するようにしてもよい。 The determination result processing unit 40A of the second embodiment can receive erroneous determination feedback specifying the values of the fields that make up the log information from the operator OP instead of the units of log information. is different from The method by which the judgment result processing unit 40A accepts the erroneous judgment feedback specifying the values of the fields that make up the log information is not limited. may be applied.

図８、図９は、第２の実施形態における判定結果処理部４０Ａが表示する判定結果表示画面の構成例について示した図である。 8 and 9 are diagrams showing configuration examples of the determination result display screen displayed by the determination result processing unit 40A in the second embodiment.

第１の実施形態の判定結果表示画面では、ログ情報単位（行単位）で選択を受け付けることが可能であったが、第２の実施形態の判定結果表示画面では、ログ情報のフィールドの値（フィールド名とその値）を指定した誤判定フィードバックを受け付けることができる。 In the determination result display screen of the first embodiment, it was possible to accept selection in log information units (row units), but in the determination result display screen of the second embodiment, the log information field values ( field name and its value) can be accepted.

第２の実施形態の誤判定結果表示画面において、フィールド名とその値の入力を受け付ける方法については限定されないものであるが、例えば、図８に示すような操作画面（ＧＵＩ）を適用するようにしてもよい。 In the erroneous determination result display screen of the second embodiment, the method of receiving input of the field name and its value is not limited, but for example, an operation screen (GUI) as shown in FIG. 8 is applied. may

図８に示す判定結果表示画面では、フィールド名の入力を受け付けることができるオブジェクトＯＢ２０１、フィールドの値（オブジェクトＯＢ２０１で選択されたフィールドの値）の入力を受けることができるテキストボックス形式のオブジェクトＯＢ２０２、及びオブジェクトＯＢ２０１で指定されたフィールド名とオブジェクトＯＢ２０２で入力された値で誤判定フィードバックを受け付けるためのボタン形式のオブジェクトＯＢ２０３が配置されている。 In the determination result display screen shown in FIG. 8, an object OB201 capable of receiving field name input, a text box type object OB202 capable of receiving field value input (field value selected in object OB201), And a button-type object OB203 is arranged for receiving feedback on an erroneous determination based on the field name designated by the object OB201 and the value input by the object OB202.

オブジェクトＯＢ２０１では、例えば、リストボックス等の形式のオブジェクト（ＧＵＩ）によりフィールド名の選択入力を受け付けるようにしてもよいし、その上に表示された表で選択されたセルに応じたフィールド名を入力するようにしてもよい。また、例えば、判定結果表示画面において、オブジェクトＯＢ２０１には、選択されたセル（行）に応じたフィールド名を自動入力するようにしてもよい。例えば、図８に示す判定結果表示画面では、ログ情報の通信先ＦＱＤＮのセル（インデックス：１の通信先ＦＱＤＮのセル）が選択（ハイライト）された状態について示しており、オブジェクトＯＢ２０１には選択されたフィールドのフィールド名（通信先ＦＱＤＮ）が入力されている。具体的には、例えば、図８に示す判定結果表示画面では、オブジェクトＯＢ２０２には選択されたフィールドの値（ｘｘｘ．ｙｙｙ．ｚｚｚ）が自動入力されている。なお、オブジェクトＯＢ２０２においては、オペレータＯＰによる自由なテキスト編集も受け付け可能とするようにしてもよい。 In the object OB201, for example, an object (GUI) in the form of a list box or the like may accept a field name selection input, or input a field name corresponding to the cell selected in the table displayed thereon. You may make it Further, for example, on the determination result display screen, a field name corresponding to the selected cell (row) may be automatically entered in the object OB201. For example, the determination result display screen shown in FIG. 8 shows a state in which the cell of the communication destination FQDN of the log information (the cell of the communication destination FQDN with index: 1) is selected (highlighted), and the object OB201 is selected (highlighted). The field name (communication destination FQDN) of the specified field is entered. Specifically, for example, in the judgment result display screen shown in FIG. 8, the value (xxx.yyy.zzz) of the selected field is automatically entered in the object OB202. Note that the object OB202 may also accept free text editing by the operator OP.

そして、図８に示すように、フィールド名として「通信先ＦＱＤＮ」、フィールド値として「ｘｘｘ．ｙｙｙ．ｚｚｚ」が指定された状態でオブジェクトＯＢ２０３のボタンが押下されると、判定結果処理部４０Ａは、フィールド名「通信先ＦＱＤＮ」、フィールド値「ｘｘｘ．ｙｙｙ．ｚｚｚ」とする誤判定フィードバックを受け付けることができる。 Then, as shown in FIG. 8, when the button of the object OB 203 is pressed with "communication destination FQDN" specified as the field name and "xxx.yyy.zzz" specified as the field value, the determination result processing unit 40A , field name “communication destination FQDN” and field value “xxx.yyy.zzz”.

さらに、第２の実施形態の判定結果表示画面のオブジェクトＯＢ２０２では、図９に示すように、ワイルドカードを用いたテキスト記述を可能とするようにしてもよい。 Furthermore, in the object OB202 of the determination result display screen of the second embodiment, as shown in FIG. 9, text description using wildcards may be allowed.

例えば、図９では、オブジェクトＯＢ２０２に、「＊．ｙｙｙ．ｚｚｚ」と「＊」をワイルドカードとしたテキスト記述となっている。この場合「＊」は、０文字以上の任意の文字列が該当する（置き換え可能とする）ことを意味する。例えば、誤判定フィードバックされた通信先ＦＱＤＮが「＊．ｙｙｙ．ｚｚｚ」だった場合、「ａａａ．ｙｙｙ．ｚｚｚ」や「ｔｔｔ．ｙｙｙ．ｚｚｚ」等、通信先ＦＱＤＮの末尾が「ｙｙｙ．ｚｚｚ」となるログ情報を全て指定したことと同義となる。例えば、図９に示す判定結果表示画面では、マウスのドラッグ操作等により、いずれかのセルの一部の文字列のみ選択された場合には、選択されなかった部分について、オブジェクトＯＢ２０２で上記のようなワイルドカードを用いた表示に置き換えるようにしてもよい。例えば、図９では、インデックス：１の通信先ＦＱＤＮのセルについて「ｘｘｘ．ｙｙｙ．ｚｚｚ」のうち「ｙｙｙ．ｚｚｚ」のみが選択（ハイライト）された状態になっているので、オブジェクトＯＢ２０２では選択されなかった「ｘｘｘ．」の部分がワイルドカード「＊」に置き換わって表示（自動表示）されている。 For example, in FIG. 9, the object OB202 has a text description with "*.yyy.zzz" and "*" as wildcards. In this case, "*" means that any character string of 0 or more characters is applicable (replaceable). For example, if the FQDN of the communication destination that is fed back with an erroneous judgment is "*.yyy.zzz", the end of the communication destination FQDN such as "aaa.yyy.zzz" or "ttt.yyy.zzz" is "yyy.zzz" It is synonymous with specifying all log information. For example, on the determination result display screen shown in FIG. 9, if only a part of the character string in any cell is selected by a mouse drag operation or the like, the unselected part is displayed in the object OB202 as described above. may be replaced with a display using a wild card. For example, in FIG. 9, only "yyy.zzz" of "xxx.yyy.zzz" is selected (highlighted) for the cell of the communication destination FQDN with index: 1. The portion of "xxx." which was not displayed is replaced with a wild card "*" and displayed (automatically displayed).

以上のように、オペレータＯＰは、図９のように、ワイルドカード形式でフィールド値を誤判定フィードバックすることもできる。ワイルドカード形式の誤判定フィードバックは、例えばホスト名が異なる通信先ＦＱＤＮが複数異常検知されるような場合に、共通のドメイン名に対して誤判定フィードバックすることで、一括でフィードバックしたい場合や、今後発生し得る他の異なるホスト名をまとめて誤判定フィードバックしたい場合に利用できる。 As described above, the operator OP can feed back erroneous determinations of field values in a wildcard format as shown in FIG. The wildcard format misjudgment feedback, for example, when multiple failures are detected for communication destination FQDNs with different host names, by giving misjudgment feedback to a common domain name. It can be used when you want to collectively give feedback on misjudgments of other different host names that may occur.

以上のように、判定結果処理部４０Ａは、フィールド値（フィールド名とその値）について誤判定フィードバックを受け付けると、フィルタ条件生成部５０Ａに受け付けた情報（フィールド名とその値）を通知し、フィルタ条件の生成を要求する。 As described above, when the determination result processing unit 40A receives erroneous determination feedback for a field value (field name and its value), it notifies the filter condition generation unit 50A of the received information (field name and its value), Request the creation of a condition.

フィルタ条件生成部５０Ａは、判定結果処理部４０Ａから通知された情報（フィールド名とその値）をフィルタ条件生成定義情報５１Ａに適用して検知エンジン１１１ごとのフィルタ条件を生成し、フィルタ処理部１２に通知する。 The filter condition generation unit 50A applies the information (the field name and its value) notified from the determination result processing unit 40A to the filter condition generation definition information 51A to generate a filter condition for each detection engine 111, and the filter processing unit 12 to notify.

図１０は、第２の実施形態に係るフィルタ条件生成定義情報５１Ａの構成例について示した図である。 FIG. 10 is a diagram showing a configuration example of filter condition generation definition information 51A according to the second embodiment.

図１０に示すように、フィルタ条件生成定義情報５１Ａも、各エントリについてフィールド名、検知エンジン名、及び値条件の項目で構成されているが、検索時のキー（主キー）がフィールド名となっている点で第１の実施形態と異なる。すなわち、フィルタ条件生成定義情報５１Ａは、フィールド名をキーとするフィードバック条件生成定義情報となっている。 As shown in FIG. 10, the filter condition generation definition information 51A also consists of field name, detection engine name, and value condition items for each entry. It differs from the first embodiment in that That is, the filter condition generation definition information 51A is feedback condition generation definition information with field names as keys.

フィルタ条件生成部５０Ａは、判定結果処理部４０Ａから、誤判定フィードバックの情報（フィールド名とその値）を受けると、当該誤判定フィードバック情報のフィールド名に合致するエントリをフィルタ条件生成定義情報５１Ａから検索し、当該エントリの検知エンジン名と値条件を取得する。そして、フィルタ条件生成部５０Ａは、検出したエントリの検知エンジン名、値条件、及び誤判定フィードバックされた値に基づいて、フィルタ条件を生成する。 When the filter condition generation unit 50A receives the information of the erroneous determination feedback (field name and its value) from the determination result processing unit 40A, the filter condition generation unit 50A selects an entry that matches the field name of the erroneous determination feedback information from the filter condition generation definition information 51A. Search and get the detection engine name and value condition of the entry. Then, the filter condition generator 50A generates a filter condition based on the detection engine name of the detected entry, the value condition, and the erroneous determination feedback value.

図１０では１～３行目のエントリのフィールド名は、それぞれ「ＵＲＬパス」、「通信先ＦＱＤＮ」、「ＵＲＬパス」となっている。 In FIG. 10, the field names of the entries on the first to third lines are "URL path", "communication destination FQDN", and "URL path", respectively.

この場合、１行目のエントリは、フィールド名が「ＵＲＬパス」であり、検知エンジン名が「ＵＲＬパスが疑わしい」であり、値条件が「一致」である。この場合、１行目のエントリでは、誤判定フィードバックのフィールド名であるＵＲＬパスにおいて誤判定フィードバックの値と一致するログ情報をフィルタするフィルタ条件が生成されることを示している。 In this case, the entry in the first row has the field name "URL path", the detection engine name "URL path is suspect", and the value condition "match". In this case, the entry in the first line indicates that a filter condition is generated to filter log information that matches the false positive feedback value in the URL path, which is the false positive feedback field name.

また、２行目のエントリは、フィールド名が「通信先ＦＱＤＮ」であり、検知エンジン名が「通信先ドメインが疑わしい」であり、値条件が「部分一致（ドメイン名）」となっている。この場合、２行目のエントリでは、誤判定フィードバックのフィールド名である通信先ＦＱＤＮにおいて誤判定フィードバックの値と部分一致（通信先ＦＱＤＮのうちドメイン名の部分の文字列が一致）するログ情報をフィルタするフィルタ条件が生成されることを示している。ここでは、ドメイン名は、ＦＱＤＮからホスト名を除外した文字列とする。例えば、ＦＱＤＮが「ｈｏｓｔ１.ｘｘｘ．ｙｙｙ．ｚｚｚ」であった場合、「ｈｏｓｔ１」がホスト名で「ｘｘｘ．ｙｙｙ．ｚｚｚ」がドメイン名となる。 In the entry on the second line, the field name is "communication destination FQDN", the detection engine name is "communication destination domain is suspicious", and the value condition is "partial match (domain name)". In this case, in the entry on the second line, the log information that partially matches the value of the misjudgment feedback (the character string of the domain name part of the communication destination FQDN matches) in the communication destination FQDN, which is the field name of the misjudgment feedback. It shows that a filtering filter condition is generated. Here, the domain name is a character string obtained by excluding the host name from the FQDN. For example, if the FQDN is "host1.xxx.yyy.zzz", "host1" is the host name and "xxx.yyy.zzz" is the domain name.

さらに、３行目のエントリは、フィールド名が「ＵＲＬパス」であり、検知エンジン名が「ダウンロードファイル名が疑わしい」であり、値条件が「部分一致（ファイル名）」となっている。この場合、３行目のエントリでは、誤判定フィードバックのフィールド名であるＵＲＬパスにおいて誤判定フィードバックの値と部分一致（ＵＲＬパスのうちファイル名の部分の文字列が一致）するログ情報をフィルタするフィルタ条件が生成されることを示している。ここでは、ファイル名は、ＵＲＬパスからディレクトリの表記を除外したファイル名の文字列であるものとする。 Furthermore, the entry in the third line has a field name of "URL path", a detection engine name of "suspect download file name", and a value condition of "partial match (file name)". In this case, the entry in the third line filters log information that partially matches the value of the false positive feedback in the URL path, which is the field name of the false positive feedback (the character string of the file name part of the URL path matches). It shows that the filter condition is generated. Here, the file name is assumed to be a character string of the file name excluding the notation of the directory from the URL path.

なお、ここでは、検知エンジン１１１－１の検知エンジン名が「ＵＲＬパスが疑わしい」、検知エンジン１１１－２の検知エンジン名が「通信先ドメインが疑わしい」、検知エンジン１１１－３の検知エンジン名が「ダウンロードファイル名が疑わしい」であるものとする。 Here, the detection engine name of the detection engine 111-1 is "suspicious URL path", the detection engine name of the detection engine 111-2 is "suspicious destination domain", and the detection engine name of the detection engine 111-3 is "suspicious". Assume that "download file name is suspicious".

（Ｂ－２）第２の実施形態の動作
次に、以上のような構成を有する第２の実施形態の異常判定システム１Ａの動作（実施形態に係る異常判定方法）について第１の実施形態との差異のみを説明する。 (B-2) Operation of Second Embodiment Next, the operation (abnormality determination method according to the embodiment) of the abnormality determination system 1A of the second embodiment having the configuration as described above will be described with respect to that of the first embodiment. Only differences between

第２の実施形態では、判定結果処理部４０Ａ及びフィルタ条件生成部５０Ａの動作のみが第１の実施形態と異なっている。 The second embodiment differs from the first embodiment only in the operations of the determination result processing unit 40A and the filter condition generation unit 50A.

次に、判定結果処理部４０Ａ及びフィルタ条件生成部５０Ａの動作の詳細について図１１を用いて説明する。 Next, details of operations of the determination result processing unit 40A and the filter condition generation unit 50A will be described with reference to FIG.

図１１は、判定結果処理部４０Ａ及びフィルタ条件生成部５０Ａの動作の例について示したフローチャートである。 FIG. 11 is a flow chart showing an example of operations of the determination result processing unit 40A and the filter condition generation unit 50A.

判定結果処理部４０Ａは、判定部３０（判定器３１）から供給された判定結果について判定結果表示画面としてオペレータＯＰが使用する監視端末ＴＥに提示し（Ｓ４０１）、誤判定フィードバックの情報入力（誤判定に係るフィールド名とその値の入力）を受け付ける（Ｓ４０２）。このとき、判定結果処理部４０Ａは、図８、図９のような判定結果表示画面を提示して誤判定フィードバックの情報入力を受け付ける点で第１の実施形態と異なっている。判定結果処理部４０Ａは、誤判定フィードバックの情報入力を受け付けると、当該情報（フィールド名とその値）を、フィルタ条件生成部５０Ａに通知すると共にフィルタ条件の更新を要求する。 The determination result processing unit 40A presents the determination result supplied from the determination unit 30 (determiner 31) as a determination result display screen to the monitoring terminal TE used by the operator OP (S401), and inputs information on erroneous determination feedback (error Input of the field name and its value related to determination) is received (S402). At this time, the determination result processing unit 40A presents a determination result display screen as shown in FIGS. 8 and 9, and receives input of incorrect determination feedback information, unlike in the first embodiment. When the determination result processing unit 40A receives information input of incorrect determination feedback, it notifies the information (field name and its value) to the filter condition generation unit 50A and requests update of the filter condition.

誤判定フィードバック情報（フィールド名とその値）が供給されると、フィルタ条件生成部５０Ａは、当該誤判定フィードバック情報のフィールド名に合致するエントリをフィルタ条件生成定義情報５１Ａから検索して選択する（Ｓ４０３）。 When the erroneous determination feedback information (field name and its value) is supplied, the filter condition generation unit 50A searches the filter condition generation definition information 51A for an entry matching the field name of the erroneous determination feedback information and selects it ( S403).

次に、フィルタ条件生成部５０Ａは、フィルタ条件生成定義情報５１Ａから、選択したエントリの情報（検知エンジン名及び値条件）を取得し、取得したエントリごとにフィルタ条件を生成する（Ｓ４０４）。 Next, the filter condition generation unit 50A acquires the selected entry information (detection engine name and value condition) from the filter condition generation definition information 51A, and generates a filter condition for each acquired entry (S404).

フィルタ条件生成部５０Ａは、生成したフィルタ条件を、検知エンジン部１１（フィルタ処理部１２）に通知する。このとき、フィルタ条件生成部５０Ａは、生成したフィルタ条件と共に、選択した検知エンジン１１１の識別子（例えば、検知エンジン名等の検知エンジン１１１のＩＤ）を通知してフィルタ条件の更新を要求する。そして、フィルタ処理部１２は、フィルタ条件生成部５０Ａから供給されたフィルタ条件を、対応するフィルタ条件データＤ２に追加更新する（Ｓ４０５）。 The filter condition generation unit 50A notifies the detection engine unit 11 (filter processing unit 12) of the generated filter conditions. At this time, the filter condition generation unit 50A notifies the generated filter condition and the identifier of the selected detection engine 111 (for example, the ID of the detection engine 111 such as the detection engine name) to request update of the filter condition. Then, the filter processing unit 12 additionally updates the filter condition supplied from the filter condition generation unit 50A to the corresponding filter condition data D2 (S405).

例えば、ステップＳ４０２で図９に示すような判定結果表示画面において、誤判定フィードバックのフィールド名が「通信先ＦＱＤＮ」で、フィールド値が「＊．ｙｙｙ．ｚｚｚ」であったものとする。 For example, in the judgment result display screen shown in FIG. 9 in step S402, it is assumed that the field name of the erroneous judgment feedback is "communication destination FQDN" and the field value is "*.yyy.zzz".

この場合、ステップＳ４０３で、図４に示すフィルタ条件生成定義情報５１で２行目のエントリ（フィールド名：通信先ＦＱＤＮ）が選択されることになる。当該エントリでは、フィールド名が「通信先ＦＱＤＮ」で、値条件が「部分一致（ドメイン名）」となっている。 In this case, in step S403, the second line entry (field name: communication destination FQDN) is selected in the filter condition generation definition information 51 shown in FIG. In this entry, the field name is "communication destination FQDN" and the value condition is "partial match (domain name)".

したがって、この場合、フィルタ条件生成部５０Ａは、通信先ＦＱＤＮを構成するドメイン名（通信先ＦＱＤＮの一部）が、誤判定フィードバックされたフィールド値である「＊．ｙｙｙ．ｚｚｚ」と一致するログ情報が供給された場合には判定結果を異常ではなく正常とする値（例えば、０）とするようなフィルタ条件を生成して、検知エンジン１１１－２に対応するフィルタ条件データＤ２－２に追加登録するように更新する。 Therefore, in this case, the filter condition generation unit 50A creates a log in which the domain name (a part of the communication destination FQDN) that constitutes the communication destination FQDN matches the field value "*.yyy.zzz" that was fed back as an erroneous judgment. Generates a filter condition that sets a value (for example, 0) as normal rather than abnormal when the information is supplied, and adds it to the filter condition data D2-2 corresponding to the detection engine 111-2. Update to register.

以後は、通信先ＦＱＤＮのうちドメイン名の末尾が「ｙｙｙ．ｚｚｚ」と一致するログ情報が検知エンジン１１１－２に供給された場合、検知結果Ｄ１－２が異常を示す値（１以上の値）であったとしても、フィルタ処理部１２（フィルタ条件データＤ２－１）のフィルタ処理により正常を示す値（例えば、０）にフィルタ処理され、フィルタ処理済検知結果Ｄ３－２として出力される。 Henceforth, when the detection engine 111-2 is supplied with log information in which the end of the domain name of the communication destination FQDN matches "yyy.zzz", the detection result D1-2 is set to a value indicating abnormality (a value of 1 or more). ), the filter processing unit 12 (filter condition data D2-1) filters to a value (for example, 0) indicating normality, and is output as a filtered detection result D3-2.

（Ｂ－３）第２の実施形態の効果
第２の実施形態によれば、以下のような効果を奏することができる。 (B-3) Effects of Second Embodiment According to the second embodiment, the following effects can be obtained.

第２の実施形態の異常判定システム１Ａでは、判定結果処理部４０Ａが、オペレータＯＰ（監視端末ＴＥ）から誤判定フィードバックを受け付ける際に、ログ情報のフィールドの値（フィールド名とその値）の指定を受け付ける。そして、異常判定システム１Ａでは、フィルタ条件生成部５０Ａがフィールド名をキーとするフィルタ条件生成定義情報５１Ａをもとにフィルタ条件を生成する。 In the abnormality determination system 1A of the second embodiment, when the determination result processing unit 40A receives false determination feedback from the operator OP (monitoring terminal TE), the value of the log information field (field name and its value) is specified. accept. Then, in the abnormality determination system 1A, the filter condition generation unit 50A generates filter conditions based on the filter condition generation definition information 51A having the field name as a key.

これにより、異常判定システム１Ａでは、ログ情報全体に対して誤判定か否かの判断ができない場合にも、フィールド単位に正常と判断できる項目を指定して誤判定フィードバックできる。これにより、第２の実施形態では、例えば、ログ監視経験の浅いオペレータＯＰでも異常判定システム１Ａを活用することができる。また、異常判定システム１Ａでは、ワイルドカード形式で誤判定フィードバックを受け付けることができるため、例えばＦＱＤＮのドメイン名を誤判定フィードバックすることで、例えば、ホスト名が異なるだけの類似するＦＱＤＮを一括で誤判定フィードバックすることができる。このように、異常判定システム１Ａでは、ログ情報のフィールドの値を指定した誤判定フィードバックに基づきフィルタ条件を生成するが、フィールド名をキーとするフィルタ条件生成定義情報５１Ａに記載される検知エンジン１１１のみに対してフィルタ条件（フィルタ条件データＤ２）が追加生成される。したがって、異常判定システム１Ａでは、例えば、あるログ情報に対してフィルタするのが望ましくない検知エンジン１１１（例えば、上記のような正規Ｗｅｂサイトの改ざんを検知するような検知エンジン１１１）についてはフィルタ条件が生成されないため、検知精度の低下を抑制できる。 As a result, in the abnormality determination system 1A, even if it is not possible to determine whether or not an erroneous determination has been made for the entire log information, it is possible to designate an item that can be determined to be normal for each field and provide erroneous determination feedback. Thus, in the second embodiment, for example, even an operator OP who has little log monitoring experience can utilize the abnormality determination system 1A. In addition, since the abnormality determination system 1A can receive erroneous determination feedback in a wildcard format, for example, by providing erroneous determination feedback of the domain name of the FQDN, for example, similar FQDNs with different host names can be collectively erroneously identified. Judgment feedback can be provided. As described above, in the abnormality determination system 1A, filter conditions are generated based on erroneous determination feedback specifying field values of log information. A filter condition (filter condition data D2) is additionally generated only for . Therefore, in the anomaly determination system 1A, for example, for a detection engine 111 that is not desirable to filter certain log information (for example, a detection engine 111 that detects falsification of a legitimate website as described above), the filter condition is not generated, it is possible to suppress deterioration in detection accuracy.

（Ｃ）他の実施形態
本発明は、上記の各実施形態に限定されるものではなく、以下に例示するような変形実施形態も挙げることができる。 (C) Other Embodiments The present invention is not limited to the above-described embodiments, and modified embodiments such as those illustrated below can be exemplified.

（Ｃ－１）第１の実施形態の判定結果処理部４０が表示する判定結果表示画面では、ログ情報単位（行単位）で誤判定フィードバックを受け付ける構成となっていたが、他の単位で誤判定フィードバックを受け付けるようにしてもよい。例えば、判定結果表示画面において、図１２に示すように、各ログ情報の検知エンジン１１１単位で誤判定フィードバックを受け付けるようにしてもよい。 (C-1) The determination result display screen displayed by the determination result processing unit 40 of the first embodiment was configured to accept erroneous determination feedback in log information units (row units). You may make it receive a judgment feedback. For example, on the determination result display screen, as shown in FIG. 12, erroneous determination feedback may be received for each log information detection engine 111 unit.

図１２は、第２の実施形態に係る判定結果表示画面の変形実施例について示した図である。 FIG. 12 is a diagram showing a modified example of the determination result display screen according to the second embodiment.

図１２では、各ログ情報（各行）について、検知エンジン１１１ごとの検知結果（検知エンジン１１１ごとの検知結果Ｘ１～ＸＮ）のセルに当該セルの選択を受け付けるためのオブジェクトであるチェックボックスＣＢが配置されている。 In FIG. 12, for each piece of log information (each row), a check box CB, which is an object for accepting selection of the cell, is placed in the cell of the detection result for each detection engine 111 (detection results X1 to XN for each detection engine 111). It is

図１２のような判定結果表示画面の提示を受けたオペレータＯＰに検知エンジン１１１の観点毎に、各ログ情報について誤判定か否かを判断させることができる。そして、図１２の判定結果表示画面では、オペレータＯＰにより誤判定と判断されたログ情報と検知エンジン１１１の組合せに該当するセルのチェックボックスＣＢについて操作（チェックをオンとする操作）を受け付けることができる。例えば、図１２に示す判定結果操作画面では、インデックスが１のログ情報に係る第１の検知エンジン１１１－１の検知結果（Ｘ１）と、インデックスが２のログ情報に係る第２の検知エンジン１１１－２の検知結果（Ｘ２）のチェックボックスＣＢにチェックが入った状態となっている。図１２に示す判定結果操作画面では、１又は複数のチェックボックスＣＢについてチェックがＯＮの状態でボタンＯＢ３０１が操作されると、チェックボックスＣＢがＯＮの状態のセルに対応するログ情報（インデックス）と検知エンジン１１１の組合せを指定した誤判定フィードバックを受け付けることができる。 The operator OP, who has been presented with the judgment result display screen as shown in FIG. Then, on the determination result display screen of FIG. 12, it is possible to accept an operation (operation to turn on the check) for the check box CB of the cell corresponding to the combination of the log information and the detection engine 111 determined to be erroneously determined by the operator OP. can. For example, on the determination result operation screen shown in FIG. The check box CB of the detection result (X2) of -2 is checked. On the determination result operation screen shown in FIG. 12, when the button OB301 is operated with one or more check boxes CB checked, log information (index) corresponding to the cell with the check box CB ON is displayed. False positive feedback specifying a combination of detection engines 111 can be received.

この場合、判定結果処理部４０は、誤判定フィードバックの情報としてチェックボックスＣＢがオンとなっていたログ情報（インデックス）に検知エンジン１１１の識別子（例えば、検知エンジン名）を含む情報を対応付けた情報を取得し、フィルタ条件生成部５０に供給する。 In this case, the determination result processing unit 40 associates information including the identifier of the detection engine 111 (for example, the name of the detection engine) with the log information (index) in which the check box CB is turned on as false determination feedback information. Information is acquired and supplied to the filter condition generation unit 50 .

そして、フィルタ条件生成部５０は、誤判定フィードバックで指定された検知エンジン名のエントリをフィルタ条件生成定義情報５１から取得する。そして、フィルタ条件生成部５０は、取得したエントリと誤判定フィードバックで指定されたエントリに対応するログ情報に基づいて、誤判定フィードバックに係る検知エンジンに対するフィルタ条件を生成し、フィルタ処理部１２に通知する。 Then, the filter condition generation unit 50 acquires the entry of the detection engine name specified in the erroneous determination feedback from the filter condition generation definition information 51 . Then, the filter condition generation unit 50 generates a filter condition for the detection engine related to the erroneous judgment feedback based on the acquired entry and the log information corresponding to the entry specified in the erroneous judgment feedback, and notifies the filter processing unit 12 of it. do.

判定結果処理部４０において、上記のような誤判定フィードバックを適用することにより、オペレータＯＰの誤判定フィードバック事由を反映できるため、より正確にフィルタ条件の生成が必要な検知エンジンを選定することができ、さらに誤判定の抑制と検知精度低下の抑制効果を高めることができる。 By applying the erroneous determination feedback as described above in the determination result processing unit 40, it is possible to reflect the cause of the erroneous determination feedback of the operator OP, so that it is possible to more accurately select the detection engine that needs to generate the filter condition. Furthermore, it is possible to enhance the effect of suppressing erroneous determination and suppressing deterioration of detection accuracy.

（Ｃ－２）上記の各実施形態の異常判定システム１、１Ａにおいて、検知処理部１０（フィルタ処理部１２）が、設定されたフィルタ条件（フィルタ条件データＤ２－１～Ｄ２－Ｎ）をリセットする処理（以下、「リセット処理」と呼ぶ）を実行する手段（以下、「リセット手段」と呼ぶ）を備えるようにしてもよい。 (C-2) In the abnormality determination systems 1 and 1A of the above embodiments, the detection processing unit 10 (filter processing unit 12) resets the set filter conditions (filter condition data D2-1 to D2-N). A means (hereinafter referred to as "reset means") for executing the process (hereinafter referred to as "reset process") may be provided.

例えば、正規サイトの改ざんを検知する検知エンジン１１１があった場合に、この検知エンジン１１１で検知したログの通信先ＦＱＤＮを含むログをフィルタするフィルタ条件をフィルタ処理部１２（フィルタ条件データＤ２）から削除することで、「正規のサイトの改ざん」を契機として発生する他の異常検知事由を検出できる可能性がある。このような条件をトリガとしてフィルタ条件を削除する場合、検知処理部１０において、フィルタ条件削除のトリガとなる検知エンジン名と「フィルタ条件を削除する条件」の情報を予め保持させるようにしてもよい。 For example, if there is a detection engine 111 that detects falsification of a legitimate site, the filter condition for filtering logs containing the communication destination FQDN of the log detected by this detection engine 111 is set from the filter processing unit 12 (filter condition data D2). By deleting it, it is possible to detect other anomaly detection reasons triggered by "authorized site defacement". When the filter condition is deleted using such a condition as a trigger, the detection processing unit 10 may store in advance the name of the detection engine that triggers the deletion of the filter condition and the information of the “condition for deleting the filter condition”. .

「フィルタ条件を削除する条件」とは、例えば、上記の「正規サイトの改ざん」を検知する検知エンジン１１１の例では、当該検知エンジン１１１で検知したログの通信先ＦＱＤＮと一致するフィルタ条件を持つフィルタ条件リストのエントリを削除する、というような条件が該当する。 For example, in the example of the detection engine 111 that detects "falsification of a legitimate site", the "condition for deleting a filter condition" is a filter condition that matches the communication destination FQDN of the log detected by the detection engine 111. Conditions such as deleting an entry in a filter condition list apply.

検知エンジン部１１は、フィルタ条件を削除するトリガとなる検知エンジンが検知した場合、当該検知エンジンのフィルタ条件を削除する条件に従って、他の検知エンジンのフィルタ条件をフィルタ条件リストから削除する。 When a detection engine that triggers deletion of a filter condition is detected, the detection engine unit 11 deletes the filter conditions of other detection engines from the filter condition list according to the conditions for deleting the filter condition of the detection engine.

また、フィルタ処理部１２のフィルタ条件（フィルタ条件データＤ２）をリセットする他の方法として、各フィルタ条件データＤ２に、フィルタ条件の生成時刻情報（フィルタ条件を追加した時刻の情報）を付加し、時間経過でフィルタ条件を削除するようにしてもよい。具体的には、例えば、フィルタ処理部１２では、各フィルタ条件データＤ２でフィルタ条件を追加する際に、フィルタ条件の生成時刻情報（フィルタ条件を追加した時刻の情報）を付加し、フィルタ条件データＤ２のフィルタ条件ごとに生存期間（フィルタ条件を追加した時刻からリセット処理までの期間）を管理し、作成日時から生存期間を経過したフィルタ条件を削除するようにしてもよい。 As another method for resetting the filter conditions (filter condition data D2) of the filter processing unit 12, filter condition generation time information (information on the time when the filter condition was added) is added to each filter condition data D2, Filter conditions may be deleted over time. Specifically, for example, when adding a filter condition to each filter condition data D2, the filter processing unit 12 adds filter condition generation time information (information on the time when the filter condition was added), It is also possible to manage the lifetime (period from the time when the filter condition is added until the reset process) for each filter condition of D2, and delete the filter condition whose lifetime has passed since the creation date and time.

異常判定システム１、１Ａでは、上記のようにリセット手段により定期的にフィルタ条件がリセットされると、過去に誤判定フィードバックされた判定結果が定期的にオペレータＯＰに通知されることになるため、過去の時点では誤判定と判断されたが時間経過で異常に変わるような事象の見逃しを防ぐことができる。 In the abnormality determination systems 1 and 1A, when the filter conditions are periodically reset by the resetting means as described above, the operator OP is periodically notified of the determination result that was fed back as an erroneous determination in the past. It is possible to prevent oversight of an event that was judged to be an erroneous judgment at a past point in time but changes abnormally with the passage of time.

１…異常判定システム、１０…検知処理部、１１…検知エンジン部、１１１、１１１－１～１１１－Ｎ…検知エンジン、１２…フィルタ処理部、２０…特徴量処理部、３０…判定部、３１…判定器、４０…判定結果処理部、５０…フィルタ条件生成部、５１…フィルタ条件生成定義情報、６０…判定器作成部、７０…制御部、Ｄ１、Ｄ１－１～Ｄ１－Ｎ…検知結果、Ｄ２、Ｄ２－１～Ｄ２－Ｎ…フィルタ条件データ、Ｄ３、Ｄ３－１～Ｄ３－Ｎ…フィルタ処理済検知結果、ＮＥ…ネットワーク、ＯＰ…オペレータ、ＴＥ…監視端末 Reference Signs List 1... Abnormality determination system 10... Detection processing unit 11... Detection engine unit 111, 111-1 to 111-N... Detection engine 12... Filter processing unit 20... Feature amount processing unit 30... Judging unit 31 Determination device 40 Determination result processing unit 50 Filter condition generation unit 51 Filter condition generation definition information 60 Determination device creation unit 70 Control unit D1, D1-1 to D1-N Detection result , D2, D2-1 to D2-N... filter condition data, D3, D3-1 to D3-N... filtered detection result, NE... network, OP... operator, TE... monitoring terminal

Claims

In an abnormality determination system that determines an abnormality on the network from analysis target data collected from one or more network devices arranged on the network,
a plurality of detection means for detecting anomalies from the analysis target data from different viewpoints;
Filter condition data describing filter conditions for each of the detection means is held, detection results of each of the detection means are subjected to filter processing based on the filter condition data, and detection is performed for each of the detection means after the filter processing. filtering means for outputting results;
feature amount processing means for acquiring feature amount data suitable for machine learning based on the detection results of the respective detection means output by the filtering means;
determining means for determining an abnormality on the network by supplying the feature data to a determining device having a learning model machine-learned based on teacher data;
feedback receiving means for receiving erroneous determination feedback indicating that the determination result by the determining means is an erroneous determination;
and filter condition generation means for generating the filter condition data to be set in the filter processing means according to the contents of the erroneous judgment feedback received by the feedback reception means.

The feedback receiving means receives the erroneous determination feedback specifying the analysis target data,
The filter condition generation means holds filter condition generation definition information describing a filter condition generation rule for each of the detection means, and performs any of the detections based on the detection result of each of the detection means related to the erroneous judgment feedback. 2. The abnormality determination system according to claim 1, wherein said filter condition generation rule of means is selected, and said filter condition data to be set in said filter processing means is generated according to said selected filter condition generation rule.

3. The abnormality determination system according to claim 2, wherein each of said filter condition generation rules describes at least item names of items of said analysis target data and conditions for values of said items.

The feedback receiving means receives an item name and its value of the item of the analysis target data as the content of the erroneous determination feedback,
The filter condition generation means holds filter condition generation definition information describing a filter condition generation rule for each item, selects the filter condition generation rule corresponding to the item related to the erroneous determination feedback, and selects the filter condition generation rule. 2. The abnormality determination system according to claim 1, wherein said filter condition data to be set in said filter processing means is generated according to said filter condition generation rule.

5. The abnormality determination system according to claim 4, wherein each of said filter condition generation rules describes at least an identifier of said detection means and a condition for a value of said item related to said data to be analyzed. .

A computer that constitutes an abnormality determination system that determines an abnormality on the network from analysis target data collected from one or more network devices arranged on the network,
a plurality of detection means for detecting anomalies from the analysis target data from different viewpoints;
Filter condition data describing filter conditions for each of the detection means is held, detection results of each of the detection means are subjected to filter processing based on the filter condition data, and detection is performed for each of the detection means after the filter processing. filtering means for outputting results;
feature amount processing means for acquiring feature amount data suitable for machine learning based on the detection results of the respective detection means output by the filtering means;
determining means for determining an abnormality on the network by supplying the feature data to a determining device having a learning model machine-learned based on teacher data;
feedback receiving means for receiving erroneous determination feedback indicating that the determination result by the determining means is an erroneous determination;
An abnormality determination program functioning as filter condition generation means for generating the filter condition data to be set in the filter processing means according to the content of the erroneous determination feedback received by the feedback reception means.

In the above determination method performed by an abnormality determination system that determines an abnormality on the network from analysis target data collected from one or more network devices arranged on the network,
The abnormality determination system includes a plurality of detection means, filtering means, feature amount processing means, determination means, feedback reception means, and filter condition generation means,
each of the detection means detects an abnormality from the analysis target data from a different point of view,
The filter processing means holds filter condition data describing filter conditions for each of the detection means, performs filter processing on detection results of each of the detection means based on the filter condition data, and performs filter processing on each of the detection means. Outputting the detection result after the filtering process,
The feature amount processing means acquires feature amount data suitable for machine learning based on the detection results of the respective detection means output by the filtering means,
The determination means supplies the feature amount data to a determination device having a learning model machine-learned based on teacher data to determine an abnormality on the network;
The feedback receiving means receives erroneous determination feedback indicating that the determination result by the determining means is an erroneous determination,
The abnormality determination method, wherein the filter condition generation means generates the filter condition data to be set in the filter processing means according to the content of the erroneous determination feedback received by the feedback reception means.