US20250165799A1 - Training data generation apparatus, training data generation method and program - Google Patents
Training data generation apparatus, training data generation method and program Download PDFInfo
- Publication number
- US20250165799A1 US20250165799A1 US18/840,569 US202218840569A US2025165799A1 US 20250165799 A1 US20250165799 A1 US 20250165799A1 US 202218840569 A US202218840569 A US 202218840569A US 2025165799 A1 US2025165799 A1 US 2025165799A1
- Authority
- US
- United States
- Prior art keywords
- learning data
- generator
- vector
- learning
- data generation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/094—Adversarial learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0475—Generative networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
Definitions
- the present disclosure relates to a learning data generation apparatus, a learning data generation method, and a program.
- NPL 1 and NPL 2 propose schemes for estimating an abnormal portion.
- NPL 3 proposes a scheme for modeling a relationship between an abnormal portion and a change in data in an ICT system caused in the abnormal portion as a causal model by using a Bayesian network, and estimating the abnormal portion from data observed during abnormality.
- NPL 4 proposes an abnormality factor identifying scheme by generating fault data by chaos engineering.
- the first method is a method of defining and modeling an abnormal portion and a rule of a change in data in an ICT system caused by the abnormal portion based on knowledge or the like of an expert operator (for example, NPL 3).
- the second method is a method of constructing a causal model from an abnormal portion during past abnormality and data at that time.
- a causal model is constructed by one of the two methods and an abnormal portion is estimated.
- the collected data can be used for modeling a Bayesian network or can be used for learning data such as a support-vector machine (SVM), and an abnormal portion and a factor can be estimated.
- SVM support-vector machine
- the two construction methods of the causal models in the studies of the related art have problems.
- the second method has a problem that it is difficult to sufficiently collect data during an abnormality necessary to construct the causal model. This is because an ICT system generally rarely generates an abnormality, and even if an abnormality occurs, a recurrence prevention measure is taken so that the same abnormality does not occur again.
- a causal model is constructed based only on past abnormalities, so that the causal model cannot cope with unknown abnormalities, and an abnormal portion cannot be estimated.
- Chaotic engineering is likely to partially solve the problem that it is difficult to sufficiently collect data during abnormality necessary to construct a causal model, but cannot be said to suffice. This is because a wide variety of abnormalities occur in an ICT system, but chaos engineering is a method of intentionally inserting a fault, and thus only data related to abnormalities able to be conceived by humans can be obtained.
- the present disclosure has been devised in view of the foregoing circumstances and provides a technique for generating data used to construct a model for estimating an abnormal portion.
- x i is a k-dimensional vector representing past abnormal data.
- k is the number of types of data, such as a traffic amount collected from the ICT system and a central processing unit (CPU) usage rate. That is, each x i represents any of various states such as a traffic amount and a CPU usage rate when the ICT system is abnormal.
- N is the number of pieces of abnormal data.
- Each x i may have a data value at a certain time as an element, or may have a statistical value such as an average of data values in a certain time duration as an element.
- y 1 is an l-dimensional (where, l is a lower case letter of L) vector.
- l denotes the number of apparatuses in the ICT system.
- each element of y i corresponds to each apparatus in the ICT system.
- the present invention is not limited thereto.
- each element of y i corresponds to an I/F of an apparatus or a device built into the apparatus.
- each element corresponds to a device built into the apparatus, it is possible to estimate an abnormal portion in units of devices.
- y i is a one-hot vector in which only a j ⁇ 1, . . . , j ⁇ -th element corresponding to the abnormal portion is 1, and the other elements are 0.
- the discriminator D ( ⁇ ; ⁇ D ) accepts the k-dimensional vector as an input and outputs a scalar value of 0 or 1.
- One of the abnormal data x i actually collected from the ICT system or the data ⁇ circumflex over ( ) ⁇ x i generated by the generator G is input to the discriminator D ( ⁇ ; ⁇ D ), and it is determined whether x i or ⁇ circumflex over ( ) ⁇ x i is input.
- the discriminator D ( ⁇ ; ⁇ D ) outputs 1 when it is determined that x i is input, and outputs 1 when it is determined that ⁇ circumflex over ( ) ⁇ x i is input.
- the parameter ⁇ D is learned so that discrimination performance is enhanced.
- a loss function L of the CGAN including the generator G ( ⁇ ; ⁇ G ) and the discriminator D ( ⁇ ; ⁇ D ) is shown in the following Formula (1).
- E( ⁇ ) is an expected value and z is an m-dimensional vector generated at random. z is also called noise.
- x ⁇ X and y ⁇ Y are abnormal portion data when abnormality occurs with regard to the abnormal data x ⁇ X.
- cot (z, y) is an operation of combining z and y to generate an (m+l)-dimensional vector.
- learning data is generated by the generator G ( ⁇ ; ⁇ G ) having the learned parameter ⁇ G .
- the (m+l)-dimensional vector obtained by combining an m-dimensional vector z generated at random and an 1-dimensional vector y generated at random is input to the learned generator G ( ⁇ ; ⁇ G ), and the k-dimensional vector ⁇ circumflex over ( ) ⁇ x is obtained as an output.
- learning data ( ⁇ circumflex over ( ) ⁇ x, y) for constructing a model for estimating an abnormal portion of the ICT system for example, the causal model modeled by a Bayesian network or the like, a machine learning model such as SVM
- the l-dimensional vector y is, for example, a one-hot vector in which only the j-th vector is set to 1 at random due to a uniform distribution or the like.
- FIG. 2 illustrates a hardware configuration example of the learning data generation apparatus 10 according to the embodiment.
- the learning data generation apparatus 10 includes an input device 101 , a display device 102 , an external I/F 103 , a communication I/F 104 , a random access memory (RAM) 105 , a read only memory (ROM) 106 , an auxiliary storage device 107 , and a processor 108 .
- the hardware is communicatively connected via a bus 109 .
- the input device 101 is, for example, a keyboard, a mouse, a touch panel, various physical buttons, or the like.
- the display device 102 is, for example, a display or a display panel.
- the learning data generation apparatus 10 may not include at least one of the input device 101 and the display device 102 .
- the external I/F 103 is an interface with an external device such as a recording medium 103 a .
- the learning data generation apparatus 10 can perform reading and writing from and on the recording medium 103 a via the external I/F 103 .
- Examples of the recording medium 103 a include a flexible disk, a compact disc (CD), a digital versatile disk (DVD), a secure digital (SD) memory card, and a Universal Serial Bus (USB) memory card.
- the communication I/F 104 is an interface for connecting the learning data generation apparatus 10 to a communication network.
- the RAM 105 is a volatile semiconductor memory (storage device) that temporarily stores programs and data.
- the ROM 106 is a nonvolatile semiconductor memory (storage device) that can hold programs and data even when a power source is turned off.
- the auxiliary storage device 107 is a storage device such as a hard disk drive (HDD) or a solid state drive (SSD). Examples of the processor 108 include various arithmetic devices such as a CPU and a graphics processing unit (GPU).
- the learning data generation apparatus 10 can implement various types of processing which will be described below with the hardware configuration illustrated in FIG. 2 .
- the hardware configuration illustrated in FIG. 2 is merely exemplary, and the hardware configuration of the learning data generation apparatus 10 is not limited thereto.
- the learning data generation apparatus 10 may include a plurality of auxiliary storage devices 107 and a plurality of processors 108 , or may have various pieces of hardware other than the illustrated hardware.
- FIG. 3 illustrates a functional configuration example of the learning data generation apparatus 10 according to the embodiment.
- the learning data generation apparatus 10 includes an observation data collection unit 201 , a generation unit 202 , a discrimination unit 203 , a learning unit 204 , and an output unit 205 .
- Each of these units is realized through, for example, processing executed by the processor 108 or the like according to one or more programs installed in the learning data generation apparatus 10 .
- the learning data generation apparatus 10 according to the embodiment includes an observation data DB 206 .
- the observation data DB 206 is realized by, for example, the auxiliary storage device 107 .
- the observation data DB 206 may also be realized by, for example, a storage device connected to the learning data generation apparatus 10 via a communication network or the like.
- the observation data collection unit 201 collects abnormal data x of the ICT system and abnormal portion data y when abnormality occurs.
- the abnormal data x and the abnormal portion data y are stored in the observation data DB 206 . Accordingly, the data set X formed by the abnormal data x and the data set Y formed by the abnormal portion data y are stored in the observation data DB 206 .
- the generation unit 202 is realized by the generator G ( ⁇ ; ⁇ G ), and accepts an (m+l)-dimensional vector as an input and outputs a k-dimensional vector.
- the discrimination unit 203 is realized by the discriminator D ( ⁇ ; ⁇ D ), and accepts the k-dimensional vector as an input and outputs a scalar value of 0 or 1.
- the learning data generation apparatus 10 can learn the CGAN using the observation data (x, y) during abnormality of the ICT system and can generate the learning data ( ⁇ circumflex over ( ) ⁇ x, y) for constructing a model for estimating an abnormal portion of the ICT system by the generator G included in the CGAN. Accordingly, a sufficient amount of learning data necessary to construct the model can be obtained.
- the generator G accepts a vector in which a vector z generated at random and a one-hot vector y generated at random are combined as an input, and generates abnormal data ⁇ circumflex over ( ) ⁇ x. Therefore, for example, abnormal data which is difficult to obtain in the chaos engineering can also be generated. Accordingly, by using the learning data generated by the learning data generation apparatus 10 according to the embodiment, it is possible to construct a model capable of estimating an abnormal portion with high accuracy.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Algebra (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Computational Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Testing And Monitoring For Control Systems (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2022/015591 WO2023188017A1 (ja) | 2022-03-29 | 2022-03-29 | 学習用データ生成装置、学習用データ生成方法及びプログラム |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20250165799A1 true US20250165799A1 (en) | 2025-05-22 |
Family
ID=88200134
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US18/840,569 Pending US20250165799A1 (en) | 2022-03-29 | 2022-03-29 | Training data generation apparatus, training data generation method and program |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20250165799A1 (https=) |
| JP (1) | JPWO2023188017A1 (https=) |
| WO (1) | WO2023188017A1 (https=) |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP7338698B2 (ja) * | 2019-11-11 | 2023-09-05 | 日本電信電話株式会社 | 学習装置、検知装置、学習方法、及び異常検知方法 |
| JPWO2021161405A1 (https=) * | 2020-02-12 | 2021-08-19 | ||
| US11550682B2 (en) * | 2020-10-20 | 2023-01-10 | International Business Machines Corporation | Synthetic system fault generation |
-
2022
- 2022-03-29 JP JP2024510813A patent/JPWO2023188017A1/ja active Pending
- 2022-03-29 WO PCT/JP2022/015591 patent/WO2023188017A1/ja not_active Ceased
- 2022-03-29 US US18/840,569 patent/US20250165799A1/en active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| WO2023188017A1 (ja) | 2023-10-05 |
| JPWO2023188017A1 (https=) | 2023-10-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11615343B2 (en) | Anomaly detection apparatus, anomaly detection method, and program | |
| US11675641B2 (en) | Failure prediction | |
| US8275735B2 (en) | Diagnostic system | |
| KR20200123454A (ko) | 무인 항공기에 대한 사이버-공격 탐지, 위치 파악, 및 무효화 | |
| Tornil-Sin et al. | Robust fault diagnosis of nonlinear systems using interval constraint satisfaction and analytical redundancy relations | |
| CN101473283A (zh) | 对操作程序的多变量监视 | |
| JP7632613B2 (ja) | 異常原因推定装置、異常原因推定方法、及びプログラム | |
| US11665165B2 (en) | Whitelist generator, whitelist evaluator, whitelist generator/evaluator, whitelist generation method, whitelist evaluation method, and whitelist generation/evaluation method | |
| CN116992376A (zh) | 用于检测电网中的异常的装置、系统和方法 | |
| Takai et al. | A generalized framework for inference-based diagnosis of discrete event systems capturing both disjunctive and conjunctive decision-making | |
| US10936967B2 (en) | Information processing system, information processing method, and recording medium for learning a classification model | |
| JP6835702B2 (ja) | 異常推定装置、異常推定方法及びプログラム | |
| JP7016450B2 (ja) | 異常兆候検知装置、異常兆候検知方法、及び、異常兆候検知プログラム | |
| US20230038463A1 (en) | Detection device, detection method, and detection program | |
| US20050065744A1 (en) | System and method for detecting and excluding outlier sensors in sensor-based monitoring | |
| JP4862446B2 (ja) | 障害原因推定システム、方法、及び、プログラム | |
| US20250165799A1 (en) | Training data generation apparatus, training data generation method and program | |
| JPWO2018142694A1 (ja) | 特徴量生成装置、特徴量生成方法及びプログラム | |
| KR102132077B1 (ko) | 설비 데이터의 이상 정도 평가 방법 | |
| US11595244B2 (en) | Recovery support apparatus, recovery support method and program | |
| JP6787873B2 (ja) | 異常種別判定装置、異常種別判定方法及びプログラム | |
| Bouali et al. | Formal verification for model-based development | |
| Hulse et al. | Synthetic fault mode generation for resilience analysis and failure mechanism discovery | |
| CN117272207B (zh) | 数据中心异常分析方法及系统 | |
| TWI824681B (zh) | 裝置管理系統、裝置的障礙原因推測方法以及非暫時性地記憶程式的記憶媒體 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: NIPPON TELEGRAPH AND TELEPHONE CORPORATION, JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MATSUO, YOICHI;WATANABE, KEISHIRO;REEL/FRAME:068365/0801 Effective date: 20220420 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |