US20250165799A1 - Training data generation apparatus, training data generation method and program - Google Patents

Training data generation apparatus, training data generation method and program Download PDF

Info

Publication number
US20250165799A1
US20250165799A1 US18/840,569 US202218840569A US2025165799A1 US 20250165799 A1 US20250165799 A1 US 20250165799A1 US 202218840569 A US202218840569 A US 202218840569A US 2025165799 A1 US2025165799 A1 US 2025165799A1
Authority
US
United States
Prior art keywords
learning data
generator
vector
learning
data generation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
US18/840,569
Other languages
English (en)
Inventor
Yoichi Matsuo
Keishiro Watanabe
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NTT Inc
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Assigned to NIPPON TELEGRAPH AND TELEPHONE CORPORATION reassignment NIPPON TELEGRAPH AND TELEPHONE CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MATSUO, YOICHI, WATANABE, KEISHIRO
Publication of US20250165799A1 publication Critical patent/US20250165799A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/094Adversarial learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0475Generative networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks

Definitions

  • the present disclosure relates to a learning data generation apparatus, a learning data generation method, and a program.
  • NPL 1 and NPL 2 propose schemes for estimating an abnormal portion.
  • NPL 3 proposes a scheme for modeling a relationship between an abnormal portion and a change in data in an ICT system caused in the abnormal portion as a causal model by using a Bayesian network, and estimating the abnormal portion from data observed during abnormality.
  • NPL 4 proposes an abnormality factor identifying scheme by generating fault data by chaos engineering.
  • the first method is a method of defining and modeling an abnormal portion and a rule of a change in data in an ICT system caused by the abnormal portion based on knowledge or the like of an expert operator (for example, NPL 3).
  • the second method is a method of constructing a causal model from an abnormal portion during past abnormality and data at that time.
  • a causal model is constructed by one of the two methods and an abnormal portion is estimated.
  • the collected data can be used for modeling a Bayesian network or can be used for learning data such as a support-vector machine (SVM), and an abnormal portion and a factor can be estimated.
  • SVM support-vector machine
  • the two construction methods of the causal models in the studies of the related art have problems.
  • the second method has a problem that it is difficult to sufficiently collect data during an abnormality necessary to construct the causal model. This is because an ICT system generally rarely generates an abnormality, and even if an abnormality occurs, a recurrence prevention measure is taken so that the same abnormality does not occur again.
  • a causal model is constructed based only on past abnormalities, so that the causal model cannot cope with unknown abnormalities, and an abnormal portion cannot be estimated.
  • Chaotic engineering is likely to partially solve the problem that it is difficult to sufficiently collect data during abnormality necessary to construct a causal model, but cannot be said to suffice. This is because a wide variety of abnormalities occur in an ICT system, but chaos engineering is a method of intentionally inserting a fault, and thus only data related to abnormalities able to be conceived by humans can be obtained.
  • the present disclosure has been devised in view of the foregoing circumstances and provides a technique for generating data used to construct a model for estimating an abnormal portion.
  • x i is a k-dimensional vector representing past abnormal data.
  • k is the number of types of data, such as a traffic amount collected from the ICT system and a central processing unit (CPU) usage rate. That is, each x i represents any of various states such as a traffic amount and a CPU usage rate when the ICT system is abnormal.
  • N is the number of pieces of abnormal data.
  • Each x i may have a data value at a certain time as an element, or may have a statistical value such as an average of data values in a certain time duration as an element.
  • y 1 is an l-dimensional (where, l is a lower case letter of L) vector.
  • l denotes the number of apparatuses in the ICT system.
  • each element of y i corresponds to each apparatus in the ICT system.
  • the present invention is not limited thereto.
  • each element of y i corresponds to an I/F of an apparatus or a device built into the apparatus.
  • each element corresponds to a device built into the apparatus, it is possible to estimate an abnormal portion in units of devices.
  • y i is a one-hot vector in which only a j ⁇ 1, . . . , j ⁇ -th element corresponding to the abnormal portion is 1, and the other elements are 0.
  • the discriminator D ( ⁇ ; ⁇ D ) accepts the k-dimensional vector as an input and outputs a scalar value of 0 or 1.
  • One of the abnormal data x i actually collected from the ICT system or the data ⁇ circumflex over ( ) ⁇ x i generated by the generator G is input to the discriminator D ( ⁇ ; ⁇ D ), and it is determined whether x i or ⁇ circumflex over ( ) ⁇ x i is input.
  • the discriminator D ( ⁇ ; ⁇ D ) outputs 1 when it is determined that x i is input, and outputs 1 when it is determined that ⁇ circumflex over ( ) ⁇ x i is input.
  • the parameter ⁇ D is learned so that discrimination performance is enhanced.
  • a loss function L of the CGAN including the generator G ( ⁇ ; ⁇ G ) and the discriminator D ( ⁇ ; ⁇ D ) is shown in the following Formula (1).
  • E( ⁇ ) is an expected value and z is an m-dimensional vector generated at random. z is also called noise.
  • x ⁇ X and y ⁇ Y are abnormal portion data when abnormality occurs with regard to the abnormal data x ⁇ X.
  • cot (z, y) is an operation of combining z and y to generate an (m+l)-dimensional vector.
  • learning data is generated by the generator G ( ⁇ ; ⁇ G ) having the learned parameter ⁇ G .
  • the (m+l)-dimensional vector obtained by combining an m-dimensional vector z generated at random and an 1-dimensional vector y generated at random is input to the learned generator G ( ⁇ ; ⁇ G ), and the k-dimensional vector ⁇ circumflex over ( ) ⁇ x is obtained as an output.
  • learning data ( ⁇ circumflex over ( ) ⁇ x, y) for constructing a model for estimating an abnormal portion of the ICT system for example, the causal model modeled by a Bayesian network or the like, a machine learning model such as SVM
  • the l-dimensional vector y is, for example, a one-hot vector in which only the j-th vector is set to 1 at random due to a uniform distribution or the like.
  • FIG. 2 illustrates a hardware configuration example of the learning data generation apparatus 10 according to the embodiment.
  • the learning data generation apparatus 10 includes an input device 101 , a display device 102 , an external I/F 103 , a communication I/F 104 , a random access memory (RAM) 105 , a read only memory (ROM) 106 , an auxiliary storage device 107 , and a processor 108 .
  • the hardware is communicatively connected via a bus 109 .
  • the input device 101 is, for example, a keyboard, a mouse, a touch panel, various physical buttons, or the like.
  • the display device 102 is, for example, a display or a display panel.
  • the learning data generation apparatus 10 may not include at least one of the input device 101 and the display device 102 .
  • the external I/F 103 is an interface with an external device such as a recording medium 103 a .
  • the learning data generation apparatus 10 can perform reading and writing from and on the recording medium 103 a via the external I/F 103 .
  • Examples of the recording medium 103 a include a flexible disk, a compact disc (CD), a digital versatile disk (DVD), a secure digital (SD) memory card, and a Universal Serial Bus (USB) memory card.
  • the communication I/F 104 is an interface for connecting the learning data generation apparatus 10 to a communication network.
  • the RAM 105 is a volatile semiconductor memory (storage device) that temporarily stores programs and data.
  • the ROM 106 is a nonvolatile semiconductor memory (storage device) that can hold programs and data even when a power source is turned off.
  • the auxiliary storage device 107 is a storage device such as a hard disk drive (HDD) or a solid state drive (SSD). Examples of the processor 108 include various arithmetic devices such as a CPU and a graphics processing unit (GPU).
  • the learning data generation apparatus 10 can implement various types of processing which will be described below with the hardware configuration illustrated in FIG. 2 .
  • the hardware configuration illustrated in FIG. 2 is merely exemplary, and the hardware configuration of the learning data generation apparatus 10 is not limited thereto.
  • the learning data generation apparatus 10 may include a plurality of auxiliary storage devices 107 and a plurality of processors 108 , or may have various pieces of hardware other than the illustrated hardware.
  • FIG. 3 illustrates a functional configuration example of the learning data generation apparatus 10 according to the embodiment.
  • the learning data generation apparatus 10 includes an observation data collection unit 201 , a generation unit 202 , a discrimination unit 203 , a learning unit 204 , and an output unit 205 .
  • Each of these units is realized through, for example, processing executed by the processor 108 or the like according to one or more programs installed in the learning data generation apparatus 10 .
  • the learning data generation apparatus 10 according to the embodiment includes an observation data DB 206 .
  • the observation data DB 206 is realized by, for example, the auxiliary storage device 107 .
  • the observation data DB 206 may also be realized by, for example, a storage device connected to the learning data generation apparatus 10 via a communication network or the like.
  • the observation data collection unit 201 collects abnormal data x of the ICT system and abnormal portion data y when abnormality occurs.
  • the abnormal data x and the abnormal portion data y are stored in the observation data DB 206 . Accordingly, the data set X formed by the abnormal data x and the data set Y formed by the abnormal portion data y are stored in the observation data DB 206 .
  • the generation unit 202 is realized by the generator G ( ⁇ ; ⁇ G ), and accepts an (m+l)-dimensional vector as an input and outputs a k-dimensional vector.
  • the discrimination unit 203 is realized by the discriminator D ( ⁇ ; ⁇ D ), and accepts the k-dimensional vector as an input and outputs a scalar value of 0 or 1.
  • the learning data generation apparatus 10 can learn the CGAN using the observation data (x, y) during abnormality of the ICT system and can generate the learning data ( ⁇ circumflex over ( ) ⁇ x, y) for constructing a model for estimating an abnormal portion of the ICT system by the generator G included in the CGAN. Accordingly, a sufficient amount of learning data necessary to construct the model can be obtained.
  • the generator G accepts a vector in which a vector z generated at random and a one-hot vector y generated at random are combined as an input, and generates abnormal data ⁇ circumflex over ( ) ⁇ x. Therefore, for example, abnormal data which is difficult to obtain in the chaos engineering can also be generated. Accordingly, by using the learning data generated by the learning data generation apparatus 10 according to the embodiment, it is possible to construct a model capable of estimating an abnormal portion with high accuracy.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Algebra (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Testing And Monitoring For Control Systems (AREA)
US18/840,569 2022-03-29 2022-03-29 Training data generation apparatus, training data generation method and program Pending US20250165799A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2022/015591 WO2023188017A1 (ja) 2022-03-29 2022-03-29 学習用データ生成装置、学習用データ生成方法及びプログラム

Publications (1)

Publication Number Publication Date
US20250165799A1 true US20250165799A1 (en) 2025-05-22

Family

ID=88200134

Family Applications (1)

Application Number Title Priority Date Filing Date
US18/840,569 Pending US20250165799A1 (en) 2022-03-29 2022-03-29 Training data generation apparatus, training data generation method and program

Country Status (3)

Country Link
US (1) US20250165799A1 (https=)
JP (1) JPWO2023188017A1 (https=)
WO (1) WO2023188017A1 (https=)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7338698B2 (ja) * 2019-11-11 2023-09-05 日本電信電話株式会社 学習装置、検知装置、学習方法、及び異常検知方法
JPWO2021161405A1 (https=) * 2020-02-12 2021-08-19
US11550682B2 (en) * 2020-10-20 2023-01-10 International Business Machines Corporation Synthetic system fault generation

Also Published As

Publication number Publication date
WO2023188017A1 (ja) 2023-10-05
JPWO2023188017A1 (https=) 2023-10-05

Similar Documents

Publication Publication Date Title
US11615343B2 (en) Anomaly detection apparatus, anomaly detection method, and program
US11675641B2 (en) Failure prediction
US8275735B2 (en) Diagnostic system
KR20200123454A (ko) 무인 항공기에 대한 사이버-공격 탐지, 위치 파악, 및 무효화
Tornil-Sin et al. Robust fault diagnosis of nonlinear systems using interval constraint satisfaction and analytical redundancy relations
CN101473283A (zh) 对操作程序的多变量监视
JP7632613B2 (ja) 異常原因推定装置、異常原因推定方法、及びプログラム
US11665165B2 (en) Whitelist generator, whitelist evaluator, whitelist generator/evaluator, whitelist generation method, whitelist evaluation method, and whitelist generation/evaluation method
CN116992376A (zh) 用于检测电网中的异常的装置、系统和方法
Takai et al. A generalized framework for inference-based diagnosis of discrete event systems capturing both disjunctive and conjunctive decision-making
US10936967B2 (en) Information processing system, information processing method, and recording medium for learning a classification model
JP6835702B2 (ja) 異常推定装置、異常推定方法及びプログラム
JP7016450B2 (ja) 異常兆候検知装置、異常兆候検知方法、及び、異常兆候検知プログラム
US20230038463A1 (en) Detection device, detection method, and detection program
US20050065744A1 (en) System and method for detecting and excluding outlier sensors in sensor-based monitoring
JP4862446B2 (ja) 障害原因推定システム、方法、及び、プログラム
US20250165799A1 (en) Training data generation apparatus, training data generation method and program
JPWO2018142694A1 (ja) 特徴量生成装置、特徴量生成方法及びプログラム
KR102132077B1 (ko) 설비 데이터의 이상 정도 평가 방법
US11595244B2 (en) Recovery support apparatus, recovery support method and program
JP6787873B2 (ja) 異常種別判定装置、異常種別判定方法及びプログラム
Bouali et al. Formal verification for model-based development
Hulse et al. Synthetic fault mode generation for resilience analysis and failure mechanism discovery
CN117272207B (zh) 数据中心异常分析方法及系统
TWI824681B (zh) 裝置管理系統、裝置的障礙原因推測方法以及非暫時性地記憶程式的記憶媒體

Legal Events

Date Code Title Description
AS Assignment

Owner name: NIPPON TELEGRAPH AND TELEPHONE CORPORATION, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MATSUO, YOICHI;WATANABE, KEISHIRO;REEL/FRAME:068365/0801

Effective date: 20220420

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION