JP2021105807A

JP2021105807A - Computer system and method for analyzing operating environment of business system

Info

Publication number: JP2021105807A
Application number: JP2019236299A
Authority: JP
Inventors: ウシンリョウ; Yuxin Liang; 恵木　正史; Masashi Egi; 正史恵木; 伊藤　淳史; Junji Ito; 淳史伊藤
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2019-12-26
Filing date: 2019-12-26
Publication date: 2021-07-26

Abstract

To present information superior in readability for grasping an operating environment of an AI system (business system).SOLUTION: A computer system for analyzing an operating environment of a business system having an inference unit 111 that performs inference includes: an interpretation index calculation unit 510 that calculates an interpretation index for interpreting an inference result 501 obtained by inputting data 500 containing a plurality of feature amounts into the inference unit; and an analysis unit 113 that analyzes the operating environment of the current business system based on the interpretation index and outputs an analysis result. The interpretation index is a grounds vector 502 whose component is a degree of influence of each of the plurality of feature amounts contained in the data input into the inference unit on the inference result.SELECTED DRAWING: Figure 5

Description

本発明は、ＡＩを活用したシステムの運用環境を解析するシステム及び方法に関する。 The present invention relates to a system and a method for analyzing the operating environment of a system utilizing AI.

近年、医療及び金融等の様々な分野でＡＩを活用したシステムが提供されている。例えば、医療分野では、ＡＩを活用して、病気の発症率の予測及び症状の特定等が行われる。また、金融分野では、ＡＩを活用して、与信審査等が行われる。本明細書では、ＡＩを活用したシステムをＡＩシステム（業務システム）と記載する。 In recent years, systems utilizing AI have been provided in various fields such as medical care and finance. For example, in the medical field, AI is used to predict the incidence of diseases and identify symptoms. In the financial field, AI is used to conduct credit screening. In this specification, a system utilizing AI is referred to as an AI system (business system).

ＡＩのモデルは学習データを用いた機械学習により生成される。ＡＩシステムを長期に運用した場合、ターゲットの変化及び世界情勢の変化等、ＡＩシステムの運用環境が変化する。そのため、想定したＡＩシステムの運用環境と、実際のＡＩシステムの運用環境との間のズレが生じて、ＡＩの予測精度が低下する問題がある。 The AI model is generated by machine learning using training data. When the AI system is operated for a long period of time, the operating environment of the AI system changes due to changes in the target and changes in the world situation. Therefore, there is a problem that the expected accuracy of AI prediction is lowered due to a deviation between the assumed operating environment of the AI system and the actual operating environment of the AI system.

したがって、ＡＩシステムを長期に運用するためには、運用環境の変化を検知し、変化の要因を解析し、解析結果を用いた再学習を行う必要がある。特に、運用環境の変化を検知するための仕組みが重要である。 Therefore, in order to operate the AI system for a long period of time, it is necessary to detect changes in the operating environment, analyze the factors of the changes, and perform re-learning using the analysis results. In particular, a mechanism for detecting changes in the operating environment is important.

これに対してＡＩの継続的な学習を実現するＬＭＬ（ＬｉｆｅｌｏｎｇＭａｃｈｉｎｅＬｅａｒｎｉｎｇ）が注目されている。ＬＭＬは、前例のない観測データの出現を環境変化として検知する機能を有する。ＬＭＬを利用した技術としては、例えば、特許文献１のような技術が知られている。 On the other hand, LML (Lifelong Machine Learning), which realizes continuous learning of AI, is attracting attention. LML has a function of detecting the appearance of unprecedented observation data as an environmental change. As a technique using LML, for example, a technique such as Patent Document 1 is known.

特許文献１には、「分離した識別サブシステム及びアプリケーションサブシステムを備えた階層型機械学習システムであって、第１層の第１ノードは第１入力を処理し、第１入力の少なくとも一部を処理して第１特徴ベクトルを生成し、第２層の第２ノードは、第１特徴ベクトルの少なくとも一部を含む第２入力を処理して、第２特徴ベクトルを生成する。第１のノードは、第１の特徴ベクトルから第１のスパース特徴ベクトルを生成し、第２のノードは、第２の特徴ベクトルから第２のスパース特徴ベクトルを生成する。識別サブシステムの第３のノードは、第１のスパース特徴ベクトルまたは第２のスパース特徴ベクトルの少なくとも一方を処理して、出力を決定する。」ことが記載されている。 Patent Document 1 states that "a hierarchical machine learning system including a separate identification subsystem and an application subsystem, in which the first node of the first layer processes the first input and at least a part of the first input. To generate the first feature vector, and the second node of the second layer processes the second input including at least a part of the first feature vector to generate the second feature vector. The node generates the first sparse feature vector from the first feature vector, the second node generates the second sparse feature vector from the second feature vector, and the third node of the identification subsystem , At least one of the first sparse feature vector and the second sparse feature vector is processed to determine the output. "

米国特許出願公開第２０１９／０２７９１０５号明細書U.S. Patent Application Publication No. 2019/0279105

Marco Tulio Ribeiro 他、「“Why Should I Trust You?”: Explaining the Predictions of Any Classifier」、KDD '16 Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining、２０１６年８月、Pages 1135-1144Marco Tulio Ribeiro et al., "Why Should I Trust You?": Explaining the Predictions of Any Classifier, KDD '16 Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2016, Pages 1135-1144 Scott M Lundberg 他、「A Unified Approach to Interpreting Model Predictions」、Advances in Neural Information Processing Systems 30、２０１７年１２月、Pages 4765-4774Scott M Lundberg et al., "A Unified Approach to Interpreting Model Predictions", Advances in Neural Information Processing Systems 30, December 2017, Pages 4765-4774 Ramprasaath R.Selvaraju 他、「Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization」、［令和１年１２月１９日検索］、インターネット＜ＵＲＬ：https://arxiv.org/abs/1610.02391v1＞Ramprasaath R.Selvaraju et al., "Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization", [Searched on December 19, 1991], Internet <URL: https: // arxiv .org / abs / 1610.02391v1 ＞

従来技術では、ユーザが、特徴ベクトル等のモデルの内部情報を解釈し、運用環境の変化を検知する必要がある。モデルの内部情報は、入力したデータとの関連性を直接的に表す情報ではないため解釈方法を定義することが難しい。また、再学習を行った場合、モデルが変化するためこれまで使用してきた解釈方法をそのまま流用できない。 In the prior art, the user needs to interpret the internal information of the model such as the feature vector and detect the change in the operating environment. Since the internal information of the model is not information that directly expresses the relationship with the input data, it is difficult to define the interpretation method. In addition, when re-learning is performed, the model changes, so the interpretation method used so far cannot be used as it is.

すなわち、モデルの内部情報は可読性が低いため、当該情報を利用するための労力及び時間を要するという問題がある。また、従来技術では、モデルの内部情報を取得できるような構造のモデルである必要があり、適用可能なモデルが限定される。 That is, since the internal information of the model is not readable, there is a problem that it takes labor and time to use the information. Further, in the prior art, the model needs to have a structure capable of acquiring the internal information of the model, and the applicable model is limited.

本発明は、モデルの構造に限定されず、かつ、可読性に優れたＡＩシステムの運用環境を把握するための情報を出力する方法及びシステムを提供する。 The present invention provides a method and a system for outputting information for grasping the operating environment of an AI system having excellent readability without being limited to the structure of the model.

本願において開示される発明の代表的な一例を示せば以下の通りである。すなわち、推論を行う推論部を有する業務システムの運用環境を解析する計算機システムであって、プロセッサ、前記プロセッサに接続されるメモリ、及び前記プロセッサに接続されるネットワークインタフェースを有する少なくとも一つの計算機を含み、複数の特徴量を含むデータを前記推論部に入力することによって得られた推論結果を解釈するための解釈指標を算出する解釈指標算出部と、前記解釈指標に基づいて、現在の前記業務システムの運用環境を解析し、前記解析の結果を出力する解析部と、を備え、前記解釈指標は、前記推論部に入力された前記データに含まれる前記複数の特徴量の各々の前記推論結果に対する影響度を成分とする根拠ベクトルである。 A typical example of the invention disclosed in the present application is as follows. That is, it is a computer system that analyzes the operating environment of a business system having an inference unit that performs inference, and includes at least one computer having a processor, a memory connected to the processor, and a network interface connected to the processor. , An interpretation index calculation unit that calculates an interpretation index for interpreting the inference result obtained by inputting data including a plurality of feature quantities into the reasoning unit, and the current business system based on the interpretation index. It is provided with an analysis unit that analyzes the operating environment of the above and outputs the result of the analysis, and the interpretation index is for the inference result of each of the plurality of feature quantities included in the data input to the inference unit. It is a basis vector whose component is the degree of influence.

本発明によれば、モデルの構造に限定されず、かつ、可読性に優れたＡＩシステムの運用環境を把握するための情報を出力することができる。上記した以外の課題、構成及び効果は、以下の実施例の説明により明らかにされる。 According to the present invention, it is possible to output information for grasping the operating environment of an AI system having excellent readability without being limited to the structure of the model. Issues, configurations and effects other than those mentioned above will be clarified by the description of the following examples.

実施例１の計算機システムの構成例を示す図である。It is a figure which shows the configuration example of the computer system of Example 1. FIG. 実施例１の計算機のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware configuration of the computer of Example 1. FIG. 実施例１の事例データ管理情報のデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the case data management information of Example 1. FIG. 実施例１の根拠ベクトル管理情報のデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the basis vector management information of Example 1. FIG. 実施例１の計算機システムの処理の流れを示す図である。It is a figure which shows the process flow of the computer system of Example 1. FIG. 実施例１の学習部が実行するモデル生成処理の一例を説明するフローチャートである。It is a flowchart explaining an example of the model generation processing executed by the learning part of Example 1. FIG. 実施例１の根拠ベクトル生成部が実行する根拠ベクトル生成処理の一例を説明するフローチャートである。It is a flowchart explaining an example of the basis vector generation processing executed by the basis vector generation part of Example 1. FIG. 実施例１の解析部が実行する環境解析処理の一例を説明するフローチャートである。It is a flowchart explaining an example of the environment analysis processing executed by the analysis part of Example 1. FIG. 実施例１の結果出力部が生成した環境解析情報に基づいて表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed based on the environment analysis information generated by the result output part of Example 1. FIG. 実施例１の結果出力部が生成した環境解析情報に基づいて表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed based on the environment analysis information generated by the result output part of Example 1. FIG. 実施例１の結果出力部が生成した環境解析情報に基づいて表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed based on the environment analysis information generated by the result output part of Example 1. FIG. 実施例１の結果出力部が生成した環境解析情報に基づいて表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed based on the environment analysis information generated by the result output part of Example 1. FIG. 実施例２の計算機システムの処理の流れを示す図である。It is a figure which shows the process flow of the computer system of Example 2. FIG. 実施例２の解析部が実行する環境解析処理の一例を説明するフローチャートである。It is a flowchart explaining an example of the environment analysis processing executed by the analysis part of Example 2. 実施例２の結果出力部が生成した環境解析情報に基づいて表示される画面の一例を示す図である。It is a figure which shows an example of the screen displayed based on the environment analysis information generated by the result output part of Example 2. FIG. 実施例３の計算機システムの構成例を示す図である。It is a figure which shows the configuration example of the computer system of Example 3. 実施例３の計算機システムの処理の流れを示す図である。It is a figure which shows the process flow of the computer system of Example 3. FIG. 実施例３の解析部が実行する環境解析処理の一例を説明するフローチャートである。It is a flowchart explaining an example of the environment analysis processing which the analysis part of Example 3 executes. 実施例３のアルゴリズム更新部が実行するアルゴリズム更新処理の一例を説明するフローチャートである。It is a flowchart explaining an example of the algorithm update process executed by the algorithm update part of Example 3. FIG.

以下、本発明の実施例を、図面を用いて説明する。ただし、本発明は以下に示す実施の形態の記載内容に限定して解釈されるものではない。本発明の思想ないし趣旨から逸脱しない範囲で、その具体的構成を変更し得ることは当業者であれば容易に理解される。 Hereinafter, examples of the present invention will be described with reference to the drawings. However, the present invention is not construed as being limited to the description of the embodiments shown below. It is easily understood by those skilled in the art that a specific configuration thereof can be changed without departing from the idea or gist of the present invention.

以下に説明する発明の構成において、同一又は類似する構成又は機能には同一の符号を付し、重複する説明は省略する。 In the configurations of the invention described below, the same or similar configurations or functions are designated by the same reference numerals, and duplicate description will be omitted.

本明細書等における「第１」、「第２」、「第３」等の表記は、構成要素を識別するために付するものであり、必ずしも、数又は順序を限定するものではない。 The notations such as "first", "second", and "third" in the present specification and the like are attached to identify the components, and do not necessarily limit the number or order.

図面等において示す各構成の位置、大きさ、形状、及び範囲等は、発明の理解を容易にするため、実際の位置、大きさ、形状、及び範囲等を表していない場合がある。したがって、本発明では、図面等に開示された位置、大きさ、形状、及び範囲等に限定されない。 The position, size, shape, range, etc. of each configuration shown in the drawings and the like may not represent the actual position, size, shape, range, etc., in order to facilitate understanding of the invention. Therefore, the present invention is not limited to the position, size, shape, range, etc. disclosed in the drawings and the like.

図１は、実施例１の計算機システムの構成例を示す図である。 FIG. 1 is a diagram showing a configuration example of the computer system of the first embodiment.

計算機システムは、複数の計算機１００−１、１００−２、１００−３、及び端末１０１から構成される。複数の計算機１００−１、１００−２、１００−３、及び端末１０１は、ネットワーク１０５を介して互いに接続される。ネットワーク１０５は、例えば、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）及びＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）である。ネットワーク１０５の接続方式は有線及び無線のいずれでもよい。 The computer system is composed of a plurality of computers 100-1, 100-2, 100-3, and a terminal 101. The plurality of computers 100-1, 100-2, 100-3, and the terminal 101 are connected to each other via the network 105. The network 105 is, for example, WAN (Wide Area Network) and LAN (Local Area Network). The connection method of the network 105 may be either wired or wireless.

以下の説明では、計算機１００−１、１００−２、１００−３を区別しない場合、計算機１００と記載する。 In the following description, when computers 100-1, 100-2, and 100-3 are not distinguished, they are referred to as computer 100.

端末１０１は、ユーザが操作する計算機である。端末１０１は、例えば、パーソナルコンピュータ、スマートフォン、及びタブレット端末等である。端末１０１は、ユーザの操作に基づいて、ＡＩ（推論部１１１）による推論に必要な評価対象データ５００（図５参照）等を入力する。評価対象データ５００は複数の項目の値（特徴量）から構成される。 The terminal 101 is a computer operated by the user. The terminal 101 is, for example, a personal computer, a smartphone, a tablet terminal, or the like. The terminal 101 inputs evaluation target data 500 (see FIG. 5) and the like necessary for inference by AI (inference unit 111) based on the user's operation. The evaluation target data 500 is composed of values (features) of a plurality of items.

なお、端末１０１は、プロセッサ、メモリ、ネットワークインタフェース、入力装置、及び出力装置を備える。入力装置は、キーボード、マウス、及びタッチパネル等の装置であり、出力装置は、タッチパネル及びディスプレイ等の装置である。 The terminal 101 includes a processor, a memory, a network interface, an input device, and an output device. The input device is a device such as a keyboard, a mouse, and a touch panel, and the output device is a device such as a touch panel and a display.

計算機１００−１は、ＡＩ（推論部１１１）を構築する計算機である。計算機１００−１は、学習部１１０を備え、また、事例データ管理情報１２０を保持する。 The computer 100-1 is a computer for constructing an AI (inference unit 111). The computer 100-1 includes a learning unit 110, and also holds case data management information 120.

学習部１１０は、推論部１１１に設定するモデル（アルゴリズム）を生成するための学習処理を実行する。なお、本発明はモデルの学習方法に限定されない。また、本発明は、モデルの種別及び構造に限定されない。例えば、モデルはニューラルネットワーク及び決定木等である。 The learning unit 110 executes a learning process for generating a model (algorithm) set in the inference unit 111. The present invention is not limited to the learning method of the model. Further, the present invention is not limited to the type and structure of the model. For example, the model is a neural network and a decision tree.

事例データ管理情報１２０は、学習処理に用いる学習データを管理するための情報である。本実施例の学習データは、過去の事例に基づいて生成されたデータである。以下の説明では、学習データを事例データとも記載する。 The case data management information 120 is information for managing the learning data used in the learning process. The learning data of this embodiment is data generated based on past cases. In the following description, the learning data is also described as case data.

計算機１００−２は、任意のモデル（アルゴリズム）に基づいて、評価対象データ５００を用いた推論を行い、推論結果を出力する計算機である。評価対象データ５００を用いた推論は、例えば、分類及び事象の推論等である。計算機１００−２は、推論を行う推論部１１１を備える。 The computer 100-2 is a computer that performs inference using the evaluation target data 500 based on an arbitrary model (algorithm) and outputs the inference result. The inference using the evaluation target data 500 is, for example, classification and inference of an event. The computer 100-2 includes an inference unit 111 that performs inference.

なお、学習処理では、推論部１１１に事例データが入力され、推定処理では、推論部１１１に評価対象データ５００が入力される。以下の説明では、評価対象データ５００及び事例データを区別しない場合、入力データと記載する。 In the learning process, case data is input to the inference unit 111, and in the estimation process, the evaluation target data 500 is input to the inference unit 111. In the following description, when the evaluation target data 500 and the case data are not distinguished, they are described as input data.

計算機１００−３は、推論部１１１を用いたＡＩシステムの運用環境を解析する計算機である。計算機１００−３は、根拠ベクトル生成部１１２、解析部１１３、及び結果出力部１１４を備え、また、根拠ベクトル管理情報１２１を保持する。 The computer 100-3 is a computer that analyzes the operating environment of the AI system using the inference unit 111. The computer 100-3 includes a basis vector generation unit 112, an analysis unit 113, and a result output unit 114, and also holds the basis vector management information 121.

根拠ベクトル生成部１１２は、推論結果を解釈するための指標となる根拠ベクトル５０２（図５参照）を生成する。根拠ベクトル５０２は、推論部１１１に入力したデータ（事例データ又は評価対象データ５００）に含まれる各特徴量の推論結果に対する影響度を成分とするベクトルである。 The rationale vector generation unit 112 generates the rationale vector 502 (see FIG. 5) as an index for interpreting the inference result. The basis vector 502 is a vector whose component is the degree of influence of each feature amount included in the data (case data or evaluation target data 500) input to the inference unit 111 on the inference result.

解析部１１３は、根拠ベクトル５０２を用いて、ＡＩシステムの運用環境を解析する。 The analysis unit 113 analyzes the operating environment of the AI system using the basis vector 502.

結果出力部１１４は、解析部１１３の解析結果に基づいて表示データを生成し、当該表示データを端末１０１に送信する。 The result output unit 114 generates display data based on the analysis result of the analysis unit 113, and transmits the display data to the terminal 101.

なお、計算機１００−１、１００−２、１００−３のいずれかは、端末１０１からの要求を受けつけるためのＡＰＩ（ＡｐｐｌｉｃａｔｉｏｎＰｒｏｇｒａｍｍｉｎｇＩｎｔｅｒｆａｃｅ）を提供する操作受付部を有する。 In addition, any one of the computers 100-1, 100-2, and 100-3 has an operation reception unit that provides an API (Application Programming Interface) for receiving a request from the terminal 101.

ここで、計算機１００のハードウェア構成について説明する。図２は、実施例１の計算機１００のハードウェア構成の一例を示す図である。 Here, the hardware configuration of the computer 100 will be described. FIG. 2 is a diagram showing an example of the hardware configuration of the computer 100 of the first embodiment.

計算機１００は、プロセッサ２０１、主記憶装置２０２、副記憶装置２０３、及びネットワークインタフェース２０４を有する。各ハードウェアは内部バスを介して互いに接続される。なお、計算機１００は、副記憶装置２０３を有していなくてもよい。また、計算機１００は、入力装置及び出力装置を有してもよい。 The computer 100 has a processor 201, a main storage device 202, a sub storage device 203, and a network interface 204. Each piece of hardware is connected to each other via an internal bus. The computer 100 does not have to have the sub-storage device 203. Further, the computer 100 may have an input device and an output device.

プロセッサ２０１は、主記憶装置２０２に格納されるプログラムを実行する。プロセッサ２０１がプログラムにしたがって処理を実行することによって、根拠ベクトル生成部１１２等、特定の機能を実現する機能部（モジュール）として動作する。以下の説明では、機能部を主語に処理を説明する場合、プロセッサ２０１が当該機能部を実現するプログラムを実行していることを示す。 The processor 201 executes a program stored in the main storage device 202. When the processor 201 executes the process according to the program, it operates as a functional unit (module) that realizes a specific function, such as the basis vector generation unit 112. In the following description, when the process is described with the functional unit as the subject, it is shown that the processor 201 is executing the program that realizes the functional unit.

主記憶装置２０２は、プロセッサ２０１が実行するプログラム及び当該プログラムが使用する情報を格納する。また、主記憶装置２０２は、プログラムが一時的に使用するワークエリアを含む。 The main storage device 202 stores a program executed by the processor 201 and information used by the program. The main memory 202 also includes a work area that the program temporarily uses.

計算機１００−１の主記憶装置２０２は、学習部１１０を実現するためのプログラムを格納する。計算機１００−２の主記憶装置２０２は、推論部１１１を実現するためのプログラムを格納する。計算機１００−３の主記憶装置２０２は、根拠ベクトル生成部１１２、解析部１１３、及び結果出力部１１４を実現するためのプログラムを格納する。また、いずれかの計算機１００−１、１００−２、１００−３の主記憶装置２０２は、操作受付部を実現するためのプログラムを格納する。 The main storage device 202 of the computer 100-1 stores a program for realizing the learning unit 110. The main storage device 202 of the computer 100-2 stores a program for realizing the inference unit 111. The main storage device 202 of the computer 100-3 stores a program for realizing the basis vector generation unit 112, the analysis unit 113, and the result output unit 114. Further, the main storage device 202 of any of the computers 100-1, 100-2, and 100-3 stores a program for realizing the operation reception unit.

副記憶装置２０３は、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）及びＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等、データを永続的に格納する。 The sub-storage device 203 permanently stores data such as an HDD (Hard Disk Drive) and an SSD (Solid State Drive).

計算機１００−１の副記憶装置２０３は事例データ管理情報１２０を格納する。計算機１００−２の副記憶装置２０３は、学習部１１０によって生成されたモデルの情報（図示省略）を格納する。計算機１００−３の副記憶装置２０３は、根拠ベクトル管理情報１２１を格納する。 The sub-storage device 203 of the computer 100-1 stores the case data management information 120. The sub-storage device 203 of the computer 100-2 stores model information (not shown) generated by the learning unit 110. The sub-storage device 203 of the computer 100-3 stores the basis vector management information 121.

なお、各計算機１００が有する各機能部については、複数の機能部を一つの機能部にまとめてもよいし、一つの機能部を機能毎に複数の機能部に分けてもよい。また、各計算機１００が有する機能を一つの計算機１００にまとめてもよい。 Regarding each functional unit included in each computer 100, a plurality of functional units may be combined into one functional unit, or one functional unit may be divided into a plurality of functional units for each function. Further, the functions of each computer 100 may be combined into one computer 100.

図３は、実施例１の事例データ管理情報１２０のデータ構造の一例を示す図である。 FIG. 3 is a diagram showing an example of the data structure of the case data management information 120 of the first embodiment.

事例データ管理情報１２０は、ＩＤ３０１、特徴量３０２、及び正解値３０３を含むエントリを複数格納する。一つのエントリが一つの事例データに対応する。 The case data management information 120 stores a plurality of entries including the ID 301, the feature amount 302, and the correct answer value 303. One entry corresponds to one case data.

ＩＤ３０１は、事例データの識別情報を格納するフィールドである。実施例１のＩＤ３０１には番号が格納される。 ID301 is a field for storing identification information of case data. A number is stored in the ID 301 of the first embodiment.

特徴量３０２は、事例データを構成する項目の値である特徴量を格納するフィールド群である。項目は、例えば、築年数、駅距離、及び建築材等である。 The feature amount 302 is a field group for storing the feature amount which is a value of an item constituting the case data. Items are, for example, age, station distance, building materials, and the like.

正解値３０３は、事例データに対する推論の正解値を格納するフィールドである。正解値３０３に格納される値は予め与えられている。 The correct answer value 303 is a field for storing the correct answer value of the inference for the case data. The value stored in the correct answer value 303 is given in advance.

なお、ユーザ又は計算機１００は、正解値を対応づけた評価対象データ５００を、事例データとして事例データ管理情報１２０に登録してもよい。 The user or the computer 100 may register the evaluation target data 500 associated with the correct answer value in the case data management information 120 as case data.

図４は、実施例１の根拠ベクトル管理情報１２１のデータ構造の一例を示す図である。 FIG. 4 is a diagram showing an example of the data structure of the basis vector management information 121 of the first embodiment.

根拠ベクトル管理情報１２１は、ＩＤ４０１、影響度４０２、推論結果４０３、及びクラスタ４０４を含むエントリを複数格納する。一つのエントリが一つの入力データの根拠ベクトル５０２に対応する。 The rationale vector management information 121 stores a plurality of entries including ID 401, influence degree 402, inference result 403, and cluster 404. One entry corresponds to one input data basis vector 502.

ＩＤ４０１は、入力データの識別情報を格納するフィールドである。 ID 401 is a field for storing the identification information of the input data.

影響度４０２は、推論結果に対する各項目の特徴量の影響の大きさを表す影響度を格納するフィールド群である。実施例１では、影響度４０２に含まれる各フィールドの値を成分とするベクトルが根拠ベクトル５０２として扱われる。 The influence degree 402 is a group of fields for storing the influence degree indicating the magnitude of the influence of the feature amount of each item on the inference result. In the first embodiment, the vector whose component is the value of each field included in the influence degree 402 is treated as the basis vector 502.

推論結果４０３は、推論部１１１が出力した推論結果を格納するフィールドである。 The inference result 403 is a field for storing the inference result output by the inference unit 111.

クラスタ４０４は、入力データが属するクラスタの識別情報を格納するフィールドである。後述するように、根拠ベクトル５０２に基づいてクラスタリングが行われる。 The cluster 404 is a field for storing the identification information of the cluster to which the input data belongs. As will be described later, clustering is performed based on the basis vector 502.

図５は、実施例１の計算機システムの処理の流れを示す図である。 FIG. 5 is a diagram showing a processing flow of the computer system of the first embodiment.

図中の矢印はデータの流れを示す。実線は、事例データの根拠ベクトル５０２を生成するための処理におけるデータの流れを示す。点線は、評価対象データ５００の推論結果を出力するための処理におけるデータの流れを示す。一点鎖線は、ＡＩシステムの運用環境の解析結果を出力するための処理におけるデータの流れを示す。 The arrows in the figure indicate the data flow. The solid line shows the data flow in the process for generating the basis vector 502 of the case data. The dotted line shows the data flow in the process for outputting the inference result of the evaluation target data 500. The alternate long and short dash line indicates the data flow in the process for outputting the analysis result of the operating environment of the AI system.

まず、事例データの根拠ベクトル５０２を算出するための処理の流れについて説明する。 First, the flow of processing for calculating the basis vector 502 of the case data will be described.

操作受付部は、端末１０１から推論部１１１の生成要求を受けつけた場合、計算機１００−２にモデル生成指示を出力する。また、操作受付部は、端末１０１から事例データの根拠ベクトル５０２の生成要求を受けつけた場合、計算機１００−３に事例データの根拠ベクトル５０２の生成指示を出力する。 When the operation reception unit receives the generation request of the inference unit 111 from the terminal 101, the operation reception unit outputs a model generation instruction to the computer 100-2. Further, when the operation reception unit receives the request for generating the basis vector 502 of the case data from the terminal 101, the operation reception unit outputs the generation instruction of the basis vector 502 of the case data to the computer 100-3.

計算機１００−２は、推論部１１１の生成指示を受信した場合、学習処理を実行し、推論部１１１に設定するモデルを生成する。 When the computer 100-2 receives the generation instruction of the inference unit 111, the computer 100-2 executes the learning process and generates the model to be set in the inference unit 111.

計算機１００−３の根拠ベクトル生成部１１２は、事例データの根拠ベクトル５０２の生成指示を受信した場合、事例データに含まれる各特徴量の影響度を算出し、根拠ベクトル５０２を生成する。影響度を算出方法としては、非特許文献１から非特許文献３の技術を用いることが考えられる。 When the basis vector generation unit 112 of the computer 100-3 receives the generation instruction of the basis vector 502 of the case data, it calculates the influence degree of each feature amount included in the case data and generates the basis vector 502. As a method for calculating the degree of influence, it is conceivable to use the techniques of Non-Patent Document 1 to Non-Patent Document 3.

非特許文献２に記載の算出方法を採用した場合、以下のような処理が実行される。指標算出部５１０は、事例データから計算用データを生成し、計算用データを推論部１１１に入力して推論結果を取得する。分配部５１１は、計算用データ及び推論結果を用いて、事例データに含まれる各特徴量の推論結果（正解値）への影響度を算出する。さらに、分配部５１１は、各特徴量の影響度を成分とする根拠ベクトル５０２を生成し、根拠ベクトル管理情報１２１に登録する。 When the calculation method described in Non-Patent Document 2 is adopted, the following processing is executed. The index calculation unit 510 generates calculation data from the case data, inputs the calculation data to the inference unit 111, and acquires the inference result. The distribution unit 511 calculates the degree of influence of each feature amount included in the case data on the inference result (correct answer value) by using the calculation data and the inference result. Further, the distribution unit 511 generates a basis vector 502 having an influence degree of each feature amount as a component and registers it in the basis vector management information 121.

なお、指標算出部５１０及び分配部５１１は、必要に応じて、影響度を算出するための計算アルゴリズム５２０を参照する。計算アルゴリズム５２０は、例えば、非特許文献１から非特許文献３のいずれかを実現するアルゴリズムである。 The index calculation unit 510 and the distribution unit 511 refer to the calculation algorithm 520 for calculating the degree of influence, if necessary. The calculation algorithm 520 is, for example, an algorithm that realizes any one of Non-Patent Document 1 to Non-Patent Document 3.

次に、評価対象データ５００の推論結果を出力するための処理の流れについて説明する。 Next, the flow of processing for outputting the inference result of the evaluation target data 500 will be described.

操作受付部は、端末１０１から評価対象データ５００を含む推論要求を受けつけた場合、計算機１００−２に評価対象データ５００を含む推論指示を出力する。また、操作受付部は、計算機１００−３に評価対象データ５００の根拠ベクトル５０２の生成指示を出力する。 When the operation reception unit receives an inference request including the evaluation target data 500 from the terminal 101, the operation reception unit outputs an inference instruction including the evaluation target data 500 to the computer 100-2. Further, the operation reception unit outputs an instruction to generate the basis vector 502 of the evaluation target data 500 to the computer 100-3.

計算機１００−２の推論部１１１は、推論指示を受信した場合、評価対象データ５００を用いた推論を行い、操作受付部に推論結果５０１を出力する。操作受付部は、推論結果５０１を端末１０１に送信する。 When the inference unit 111 of the computer 100-2 receives the inference instruction, it makes an inference using the evaluation target data 500 and outputs the inference result 501 to the operation reception unit. The operation reception unit transmits the inference result 501 to the terminal 101.

計算機１００−３の根拠ベクトル生成部１１２は、評価対象データ５００の根拠ベクトル５０２の算出指示を受信した場合、評価対象データ５００の根拠ベクトル５０２を生成する。評価対象データ５００の根拠ベクトル５０２の生成方法は、事例データの根拠ベクトル５０２の生成方法と同様であるため説明を省略する。根拠ベクトル生成部１１２は、評価対象データ５００の根拠ベクトル５０２を根拠ベクトル管理情報１２１に登録する。 When the basis vector generation unit 112 of the computer 100-3 receives the calculation instruction of the basis vector 502 of the evaluation target data 500, the basis vector 502 of the evaluation target data 500 is generated. Since the method of generating the basis vector 502 of the evaluation target data 500 is the same as the method of generating the basis vector 502 of the case data, the description thereof will be omitted. The basis vector generation unit 112 registers the basis vector 502 of the evaluation target data 500 in the basis vector management information 121.

次に、ＡＩシステムの運用環境の解析結果を出力するための処理の流れについて説明する。 Next, the flow of processing for outputting the analysis result of the operating environment of the AI system will be described.

計算機１００−３の解析部１１３は、操作受付部から環境解析指示を受けつけた場合、実行条件を満たした場合、又は、周期的に、環境解析処理を実行する。例えば、評価対象データ５００の根拠ベクトル５０２の算出を実行条件として設定できる。解析部１１３は処理結果を結果出力部１１４に出力する。 The analysis unit 113 of the computer 100-3 executes the environment analysis process periodically when receiving an environment analysis instruction from the operation reception unit, satisfying the execution conditions, or periodically. For example, the calculation of the basis vector 502 of the evaluation target data 500 can be set as the execution condition. The analysis unit 113 outputs the processing result to the result output unit 114.

計算機１００−３の結果出力部１１４は、解析処理の結果を表示するための環境解析情報５０３を生成する。結果出力部１１４は、当該環境解析情報５０３を操作受付部に出力する。操作受付部は、環境解析情報５０３を端末１０１に送信する。 The result output unit 114 of the computer 100-3 generates the environment analysis information 503 for displaying the result of the analysis process. The result output unit 114 outputs the environmental analysis information 503 to the operation reception unit. The operation reception unit transmits the environmental analysis information 503 to the terminal 101.

根拠ベクトル５０２は、推論部１１１（モデル）の特性を示す情報として扱うことができる。一方、影響度は推論部１１１の内部から取得される情報ではない。すなわち、根拠ベクトル５０２は、モデルの更新にかかわらず、一義的に解釈できる可読性の高い情報として使用できる。したがって、実施例１の計算機システムは、モデルが更新された場合でも、根拠ベクトル５０２を用いて、ＡＩシステムの運用環境を自動的に解析することができる。 The basis vector 502 can be treated as information indicating the characteristics of the inference unit 111 (model). On the other hand, the degree of influence is not information acquired from the inside of the inference unit 111. That is, the basis vector 502 can be used as highly readable information that can be uniquely interpreted regardless of the model update. Therefore, the computer system of the first embodiment can automatically analyze the operating environment of the AI system by using the basis vector 502 even when the model is updated.

次に、具体的な処理の内容について説明する。まず、推論部１１１に設定するモデルを生成するための処理について説明する。 Next, the specific contents of the processing will be described. First, a process for generating a model to be set in the inference unit 111 will be described.

図６は、実施例１の学習部１１０が実行するモデル生成処理の一例を説明するフローチャートである。 FIG. 6 is a flowchart illustrating an example of a model generation process executed by the learning unit 110 of the first embodiment.

学習部１１０は、事例データ管理情報１２０から事例データを取得する（ステップＳ１０１）。なお、取得する事例データの数及び範囲は任意に設定できる。 The learning unit 110 acquires case data from the case data management information 120 (step S101). The number and range of case data to be acquired can be set arbitrarily.

学習部１１０は、事例データを用いて学習処理を実行する（ステップＳ１０２）。学習処理は公知の技術であるため詳細な説明は省略する。 The learning unit 110 executes a learning process using the case data (step S102). Since the learning process is a known technique, detailed description thereof will be omitted.

学習部１１０は、終了条件を満たすか否かを判定する（ステップＳ１０３）。例えば、評価用のデータを用いた推論の精度が閾値より高い場合、学習部１１０は終了条件を満たすと判定する。また、学習処理の実行回数が閾値より大きい場合、学習部１１０は終了条件を満たすと判定する。 The learning unit 110 determines whether or not the end condition is satisfied (step S103). For example, when the accuracy of the inference using the evaluation data is higher than the threshold value, the learning unit 110 determines that the end condition is satisfied. Further, when the number of executions of the learning process is larger than the threshold value, the learning unit 110 determines that the end condition is satisfied.

終了条件を満たさないと判定された場合、学習部１１０は、ステップＳ１０２に戻り、同様の処理を実行する。 If it is determined that the end condition is not satisfied, the learning unit 110 returns to step S102 and executes the same process.

終了条件を満たすと判定された場合、学習部１１０はモデル生成処理を終了する。 When it is determined that the end condition is satisfied, the learning unit 110 ends the model generation process.

図７は、実施例１の根拠ベクトル生成部１１２が実行する根拠ベクトル生成処理の一例を説明するフローチャートである。ここでは、非特許文献２に記載の算出方法を一例として根拠ベクトル生成処理を説明する。 FIG. 7 is a flowchart illustrating an example of the basis vector generation process executed by the basis vector generation unit 112 of the first embodiment. Here, the basis vector generation process will be described using the calculation method described in Non-Patent Document 2 as an example.

根拠ベクトル生成部１１２は入力データを取得する（ステップＳ２０１）。事例データの根拠ベクトル５０２を算出する場合、指標算出部５１０は事例データ管理情報１２０から事例データを取得する。評価対象データ５００の根拠ベクトル５０２を算出する場合、指標算出部５１０は生成指示に含まれる評価対象データ５００を取得する。 The basis vector generation unit 112 acquires the input data (step S201). When calculating the basis vector 502 of the case data, the index calculation unit 510 acquires the case data from the case data management information 120. When calculating the basis vector 502 of the evaluation target data 500, the index calculation unit 510 acquires the evaluation target data 500 included in the generation instruction.

次に、根拠ベクトル生成部１１２は、入力データから計算用データを生成する（ステップＳ２０２）。 Next, the basis vector generation unit 112 generates calculation data from the input data (step S202).

具体的には、指標算出部５１０は、特徴量のバックグラウンドデータセットに基づいて、入力データの一部の項目の特徴量を変化させることによって計算用データを生成する。ここでは、一つの入力データからＫ個の計算用データが生成されるものとする。各計算用データには識別情報が付与される。 Specifically, the index calculation unit 510 generates calculation data by changing the feature amount of some items of the input data based on the background data set of the feature amount. Here, it is assumed that K pieces of calculation data are generated from one input data. Identification information is added to each calculation data.

計算アルゴリズム５２０には特徴量のバックグラウンドデータセットが格納される。また、変化させる特徴量の数、変化量、及び生成する計算用データの数等を決定するためのルールが計算アルゴリズム５２０に格納される。 The background data set of the feature quantity is stored in the calculation algorithm 520. Further, a rule for determining the number of feature quantities to be changed, the amount of change, the number of calculation data to be generated, and the like are stored in the calculation algorithm 520.

次に、根拠ベクトル生成部１１２は、変数ｋに初期値「１」を設定する（ステップＳ２０３）。 Next, the basis vector generation unit 112 sets the initial value “1” in the variable k (step S203).

次に、根拠ベクトル生成部１１２は、変数ｋに対応する計算用データの推論結果を取得する（ステップＳ２０４）。 Next, the basis vector generation unit 112 acquires the inference result of the calculation data corresponding to the variable k (step S204).

具体的には、指標算出部５１０は、変数ｋに対応する計算用データを推論部１１１に入力することによって推論結果を取得する。 Specifically, the index calculation unit 510 acquires the inference result by inputting the calculation data corresponding to the variable k into the inference unit 111.

次に、根拠ベクトル生成部１１２は、主記憶装置２０２の記憶領域に計算用データ及び推論結果の組を格納する（ステップＳ２０５）。 Next, the basis vector generation unit 112 stores a set of calculation data and an inference result in the storage area of the main storage device 202 (step S205).

次に、根拠ベクトル生成部１１２は、変数ｋがＫに一致するか否かを判定する（ステップＳ２０６）。すなわち、全ての計算用データの推論結果を取得したか否かが判定される。 Next, the basis vector generation unit 112 determines whether or not the variable k matches K (step S206). That is, it is determined whether or not the inference results of all the calculation data have been acquired.

変数ｋがＫに一致しないと判定された場合、根拠ベクトル生成部１１２は、変数ｋの値に１を加算した値を変数ｋに設定する（ステップＳ２０７）。その後、根拠ベクトル生成部１１２は、ステップＳ２０４に戻り、同様の処理を実行する。 When it is determined that the variable k does not match K, the basis vector generation unit 112 sets the value obtained by adding 1 to the value of the variable k in the variable k (step S207). After that, the basis vector generation unit 112 returns to step S204 and executes the same process.

変数ｋがＫに一致すると判定された場合、根拠ベクトル生成部１１２は、根拠ベクトル５０２を生成する（ステップＳ２０８）。 When it is determined that the variable k matches K, the rationale vector generation unit 112 generates the rationale vector 502 (step S208).

具体的には、分配部５１１は、計算用データ及び推論結果の組を用いて、重回帰分析等の統計分析を実行することによって、推論結果に対する入力データの各項目の特徴量の影響度を算出する。さらに、分配部５１１は、各特徴量の影響度を成分とする根拠ベクトル５０２を生成する。 Specifically, the distribution unit 511 performs statistical analysis such as multiple regression analysis using a set of calculation data and inference results to determine the degree of influence of the features of each item of the input data on the inference results. calculate. Further, the distribution unit 511 generates a basis vector 502 whose component is the degree of influence of each feature amount.

次に、根拠ベクトル生成部１１２は、根拠ベクトル５０２を推論結果とともに根拠ベクトル管理情報１２１に登録する（ステップＳ２０９）。その後、根拠ベクトル生成部１１２は、根拠ベクトル生成処理を終了する。この時点では、登録された根拠ベクトル５０２に対応するエントリのクラスタ４０４は空欄である。 Next, the basis vector generation unit 112 registers the basis vector 502 together with the inference result in the basis vector management information 121 (step S209). After that, the basis vector generation unit 112 ends the basis vector generation process. At this point, the cluster 404 of the entry corresponding to the registered rationale vector 502 is blank.

なお、非特許文献３に記載の算出方法では、指標算出部５１０は特徴マップを算出し、分配部５１１は、特徴マップの各座標の平均値に各クラスのチャネルの重要度を乗算した値の総和を影響度として算出する。 In the calculation method described in Non-Patent Document 3, the index calculation unit 510 calculates the feature map, and the distribution unit 511 multiplies the average value of each coordinate of the feature map by the importance of each class channel. Calculate the sum as the degree of influence.

図８は、実施例１の解析部１１３が実行する環境解析処理の一例を説明するフローチャートである。 FIG. 8 is a flowchart illustrating an example of the environment analysis process executed by the analysis unit 113 of the first embodiment.

解析部１１３は、根拠ベクトル管理情報１２１から根拠ベクトル５０２を取得する（ステップＳ３０１）。 The analysis unit 113 acquires the basis vector 502 from the basis vector management information 121 (step S301).

なお、解析部１１３は、クラスタ４０４が空欄のエントリに対応する根拠ベクトル５０２のみを取得してもよい。 Note that the analysis unit 113 may acquire only the basis vector 502 corresponding to the blank entry in the cluster 404.

解析部１１３は、根拠ベクトル５０２を用いたクラスタリングを実行することによって、入力データのクラスタを生成する（ステップＳ３０２）。 The analysis unit 113 generates a cluster of input data by executing clustering using the basis vector 502 (step S302).

このとき、解析部１１３は、クラスタリングの結果に基づいて、根拠ベクトル管理情報１２１の各エントリのクラスタ４０４の値を更新する。また、解析部１１３は、クラスタリングの結果に基づいてクラスタ情報を生成する。クラスタ情報のデータ構造は根拠ベクトル管理情報１２１と同一である。 At this time, the analysis unit 113 updates the value of the cluster 404 of each entry of the basis vector management information 121 based on the result of clustering. Further, the analysis unit 113 generates cluster information based on the result of clustering. The data structure of the cluster information is the same as that of the basis vector management information 121.

次に、解析部１１３は、ソート条件に基づいて入力データをソートする（ステップＳ３０３）。ソート条件は予め設定されているものとする。なお、解析部１１３は、操作受付部を介して端末１０１からソート条件を受けつけてもよい。 Next, the analysis unit 113 sorts the input data based on the sort conditions (step S303). It is assumed that the sort conditions are set in advance. The analysis unit 113 may receive the sort condition from the terminal 101 via the operation reception unit.

ソート条件としては、時系列、特徴量、影響度、推論結果、及び入力データの識別情報等が考えられる。複数の観点を組み合わせたソート条件でもよい。また、クラスタリングの結果を用いたソート条件でもよい。 As the sorting condition, a time series, a feature amount, an influence degree, an inference result, identification information of input data, and the like can be considered. A sort condition that combines a plurality of viewpoints may be used. Further, a sort condition using the result of clustering may be used.

解析部１１３は、過去に生成されたクラスタ情報が存在するか否かを判定する（ステップＳ３０４）。過去に生成されたクラスタ情報が存在しない場合とは、初めて、環境解析処理が実行される場合である。 The analysis unit 113 determines whether or not the cluster information generated in the past exists (step S304). The case where the cluster information generated in the past does not exist is the case where the environment analysis process is executed for the first time.

過去に生成されたクラスタ情報が存在しないと判定された場合、解析部１１３は、各クラスタの特性を解析する（ステップＳ３０５）。 When it is determined that the cluster information generated in the past does not exist, the analysis unit 113 analyzes the characteristics of each cluster (step S305).

例えば、解析部１１３は、クラスタに属する入力データの影響度が大きい特徴量の範囲等を特定する。また、解析部１１３は、クラスタに属する入力データの推論結果の傾向を特定する。 For example, the analysis unit 113 specifies a range of features that have a large influence on the input data belonging to the cluster. Further, the analysis unit 113 identifies the tendency of the inference result of the input data belonging to the cluster.

次に、解析部１１３は、クラスタ情報、ソート結果、及び解析結果を結果出力部１１４に出力する（ステップＳ３０６）。その後、解析部１１３は環境解析処理を終了する。すなわち、初めて環境解析処理が実行された場合、現在のＡＩシステムの運用環境を示す情報が結果出力部１１４から出力される。 Next, the analysis unit 113 outputs the cluster information, the sort result, and the analysis result to the result output unit 114 (step S306). After that, the analysis unit 113 ends the environment analysis process. That is, when the environment analysis process is executed for the first time, information indicating the current operating environment of the AI system is output from the result output unit 114.

ステップＳ３０４において、過去に生成されたクラスタ情報が存在すると判定された場合、解析部１１３は、過去に生成されたクラスタ情報と、今回生成されたクラスタ情報とを比較し、新規クラスタが出現したか否かを判定する（ステップＳ３０７）。 If it is determined in step S304 that the cluster information generated in the past exists, the analysis unit 113 compares the cluster information generated in the past with the cluster information generated this time, and has a new cluster appeared? It is determined whether or not (step S307).

なお、過去に生成されたクラスタ情報が複数存在する場合、解析部１１３は、時系列が最も新しいクラスタ情報と、今回生成されたクラスタ情報とを比較する。 When a plurality of cluster information generated in the past exists, the analysis unit 113 compares the cluster information with the latest time series with the cluster information generated this time.

新規クラスタが出現していないと判定された場合、解析部１１３は、クラスタ情報及びソート結果を結果出力部１１４に出力する（ステップＳ３０８）。その後、解析部１１３は環境解析処理を終了する。この場合、現在のＡＩシステムの運用環境を示す情報が結果出力部１１４から出力される。 When it is determined that no new cluster has appeared, the analysis unit 113 outputs the cluster information and the sort result to the result output unit 114 (step S308). After that, the analysis unit 113 ends the environment analysis process. In this case, information indicating the current operating environment of the AI system is output from the result output unit 114.

なお、解析部１１３は、属する入力データの数が変化したクラスタの特性を解析し、当該解析結果を出力してもよい。 The analysis unit 113 may analyze the characteristics of the cluster in which the number of input data to which it belongs has changed, and output the analysis result.

新規クラスタが出現していると判定された場合、解析部１１３は、新規クラスタに属する入力データの特性を解析する（ステップＳ３０９）。解析方法は、ステップＳ３０５と同一である。 When it is determined that a new cluster has appeared, the analysis unit 113 analyzes the characteristics of the input data belonging to the new cluster (step S309). The analysis method is the same as in step S305.

次に、解析部１１３は、クラスタ情報、ソート結果、解析結果、及び環境変化通知を結果出力部１１４に出力する（ステップＳ３１０）。その後、解析部１１３は環境解析処理を終了する。この場合、現在のＡＩシステムの運用環境を示す情報、及び運用環境の変化が検知されたことを示す情報が結果出力部１１４から出力される。 Next, the analysis unit 113 outputs the cluster information, the sort result, the analysis result, and the environmental change notification to the result output unit 114 (step S310). After that, the analysis unit 113 ends the environment analysis process. In this case, the result output unit 114 outputs information indicating the current operating environment of the AI system and information indicating that a change in the operating environment has been detected.

本実施例では、根拠ベクトル５０２の成分を軸とする特徴量空間上でクラスタリングが行われる。根拠ベクトル５０２は、評価対象データ５００に対する推論部１１１の挙動を表す情報である。したがって、新規クラスタの出現は、推論部１１１がこれまでとは異なる挙動をしていることを示す。すなわち、新規クラスタの出現は、今までの運用環境では想定していなかった評価対象データ５００が入力されたことを示す。このように、根拠ベクトル５０２を用いることによってＡＩシステムの運用環境を的確に把握できる。 In this embodiment, clustering is performed on the feature space centered on the component of the basis vector 502. The basis vector 502 is information representing the behavior of the inference unit 111 with respect to the evaluation target data 500. Therefore, the appearance of a new cluster indicates that the inference unit 111 behaves differently than before. That is, the appearance of a new cluster indicates that the evaluation target data 500, which was not assumed in the operating environment so far, has been input. In this way, by using the basis vector 502, the operating environment of the AI system can be accurately grasped.

また、根拠ベクトル５０２は、推論部１１１の内部情報から生成された情報ではないため、推論部１１１が更新されても取扱いは変わらない。そのため、根拠ベクトル５０２を解釈するための労力及び時間は必要ない。さらに、モデルの構造に限定されない。 Further, since the basis vector 502 is not the information generated from the internal information of the inference unit 111, the handling does not change even if the inference unit 111 is updated. Therefore, no effort or time is required to interpret the rationale vector 502. Furthermore, it is not limited to the structure of the model.

次に、解析部１１３から出力された情報に基づく表示について説明する。 Next, the display based on the information output from the analysis unit 113 will be described.

図９、図１０Ａ、図１０Ｂ、及び図１１は、実施例１の結果出力部１１４が生成した環境解析情報５０３に基づいて表示される画面の一例を示す図である。 9, FIG. 10A, FIG. 10B, and FIG. 11 are diagrams showing an example of a screen displayed based on the environmental analysis information 503 generated by the result output unit 114 of the first embodiment.

図９は、クラスタ情報及び環境変化通知に基づいて生成された環境解析情報５０３により表示される画面９００の一例を示す。 FIG. 9 shows an example of the screen 900 displayed by the environment analysis information 503 generated based on the cluster information and the environment change notification.

画面９００には、根拠ベクトル５０２の成分を軸とする特徴量空間上のクラスタを示すグラフ９１０が表示される。グラフ９１０には根拠ベクトル５０２がプロットされる。また、グラフ９１０には、クラスタ情報に基づいて三つのクラスタ９１１−１、９１１−２、９１１−３が表示される。各クラスタ９１１には、解析結果に基づいて吹出９１２が表示される。 On the screen 900, a graph 910 showing clusters in the feature space centered on the component of the basis vector 502 is displayed. The basis vector 502 is plotted on the graph 910. Further, in the graph 910, three clusters 911-1, 911-2, and 911-3 are displayed based on the cluster information. A blowout 912 is displayed on each cluster 911 based on the analysis result.

クラスタ９１１−１、９１１−２は、既存のクラスタである。クラスタ９１１−３は新規クラスタである。クラスタ９１１−３は環境変化通知に基づいて強調表示される。 Clusters 911-1 and 911-2 are existing clusters. Cluster 911-3 is a new cluster. Cluster 911-3 is highlighted based on the environmental change notification.

なお、クラスタ情報のみから生成された環境解析情報５０３により表示される画面では、全てのクラスタ９１１が同じ表示形式で表示される。 On the screen displayed by the environment analysis information 503 generated only from the cluster information, all the clusters 911 are displayed in the same display format.

図１０Ａ及び図１０Ｂは、ソート結果に基づいて生成された環境解析情報５０３により表示される画面１０００の一例を示す。 10A and 10B show an example of the screen 1000 displayed by the environmental analysis information 503 generated based on the sort result.

画面１０００には、ソート結果を示すグラフ１０１０が表示される。図１０Ａのグラフ１０１０の縦軸は推定結果を示し、横軸はクラスタを示す。図１０Ｂのグラフ１０１０の縦軸は推定結果を示し、横軸は時刻を示す。 A graph 1010 showing the sort result is displayed on the screen 1000. The vertical axis of the graph 1010 in FIG. 10A shows the estimation result, and the horizontal axis shows the cluster. The vertical axis of the graph 1010 in FIG. 10B shows the estimation result, and the horizontal axis shows the time.

図１０Ａのグラフ１０１０には、入力データの根拠ベクトル５０２を表すアイコン１０１１を用いた入力データのソート結果が表示される。アイコン１０１１は、影響度はアイコン１０１１内の領域の大きさとして表される。アイコン１０１１の白部分は正の影響度を表し、斜線部分は負の影響度を表す。アイコン１０１１の大きさは推論結果に基づいて決定される。グラフ１０１０には、解析結果に基づいて吹出１０１２が表示される。また、アイコン１０１１は、属するクラスタを明示する情報を付与してもよい。図１０Ａでは、属するクラスタに応じてアイコン１０１１の外枠を変えている。 In the graph 1010 of FIG. 10A, the sorting result of the input data using the icon 1011 representing the basis vector 502 of the input data is displayed. For the icon 1011 the degree of influence is represented by the size of the area within the icon 1011. The white portion of the icon 1011 represents the degree of positive influence, and the shaded portion represents the degree of negative influence. The size of the icon 1011 is determined based on the inference result. In the graph 1010, the blowout 1012 is displayed based on the analysis result. Further, the icon 1011 may be given information indicating the cluster to which the icon 1011 belongs. In FIG. 10A, the outer frame of the icon 1011 is changed according to the cluster to which it belongs.

ユーザは、図１０Ａの画面１０００に基づいて、クラスタにおける入力データの特性等を把握することができる。 The user can grasp the characteristics of the input data in the cluster based on the screen 1000 of FIG. 10A.

図１０Ｂのグラフ１０１０には、アイコン１０１１及びポインタ１０１３が表示される。ポインタ１０１３は、運用環境が変化した推定時刻を示す。 The icon 1011 and the pointer 1013 are displayed in the graph 1010 of FIG. 10B. The pointer 1013 indicates the estimated time when the operating environment has changed.

ユーザは、図１０Ｂの画面１０００に基づいて、運用環境が変化した時刻を把握することができる。 The user can grasp the time when the operating environment has changed based on the screen 1000 of FIG. 10B.

図１１は、クラスタ情報に基づいて生成された環境解析情報５０３により表示される画面１１００の一例を示す。 FIG. 11 shows an example of the screen 1100 displayed by the environment analysis information 503 generated based on the cluster information.

画面１１００には、入力データの根拠ベクトル５０２を表すアイコン１１０１が表示される。ユーザが所定の領域にポインタ１１０２を合わせた場合、当該領域の入力データの特徴量及び影響度が表示される。 On the screen 1100, the icon 1101 representing the basis vector 502 of the input data is displayed. When the user moves the pointer 1102 to a predetermined area, the feature amount and the degree of influence of the input data in the area are displayed.

なお、図９、図１０Ａ、図１０Ｂ、及び図１１で説明した画面は一例であってこれに限定されない。同様の情報を提示できる画面であればよい。 The screens described with reference to FIGS. 9, 10A, 10B, and 11 are merely examples and are not limited thereto. Any screen that can present similar information will do.

以上で説明したように、実施例１の計算機システムは、根拠ベクトル５０２を用いたクラスタリングによって生成されたクラスタに関する情報を提示できる。根拠ベクトル５０２は、入力データと推論部１１１との関係性を示す情報であり、可読性に優れている。したがって、クラスタに関する情報は、ＡＩシステムの運用環境を把握のための可読性に優れた情報として扱うことができる。 As described above, the computer system of the first embodiment can present information about the cluster generated by the clustering using the basis vector 502. The basis vector 502 is information indicating the relationship between the input data and the inference unit 111, and is excellent in readability. Therefore, the information about the cluster can be treated as highly readable information for grasping the operating environment of the AI system.

また、従来技術では、モデルの更新とともに内部情報の解釈方法も変更する必要があるため、自動的な環境変化の検知が困難であった。一方、実施例１では、モデルの内部情報に依存せず、かつ、モデルの挙動を示す根拠ベクトル５０２を用いているため、自動的な環境変化の検知が可能である。 Further, in the prior art, it is difficult to automatically detect environmental changes because it is necessary to change the interpretation method of internal information as the model is updated. On the other hand, in the first embodiment, since the basis vector 502 that does not depend on the internal information of the model and shows the behavior of the model is used, it is possible to automatically detect the environmental change.

実施例２の計算機システムは、推論部１１１の推論精度の低下を検知する。以下、実施例１との差異を中心に実施例２について説明する。 The computer system of the second embodiment detects a decrease in the inference accuracy of the inference unit 111. Hereinafter, Example 2 will be described with a focus on the differences from Example 1.

実施例２のシステム構成は実施例１と同一である。実施例２の計算機１００のハードウェア構成及びソフトウェア構成は実施例１と同一である。実施例２で扱う情報は実施例１と同一である。また、実施例２の学習部１１０、推論部１１１、根拠ベクトル生成部１１２、及び結果出力部１１４が実行する処理は実施例１と同一である。 The system configuration of the second embodiment is the same as that of the first embodiment. The hardware configuration and software configuration of the computer 100 of the second embodiment are the same as those of the first embodiment. The information handled in the second embodiment is the same as that in the first embodiment. Further, the processing executed by the learning unit 110, the inference unit 111, the basis vector generation unit 112, and the result output unit 114 of the second embodiment is the same as that of the first embodiment.

実施例２では、解析部１１３が実行する処理が一部異なる。図１２は、実施例２の計算機システムの処理の流れを示す図である。 In the second embodiment, the processing executed by the analysis unit 113 is partially different. FIG. 12 is a diagram showing a processing flow of the computer system of the second embodiment.

実施例２では、評価対象データ５００に対応づけられた正解データ５０４が解析部１１３に入力される。正解データ５０４は、評価対象データ５００の識別情報と、評価対象データ５００に対する推論の正解とを含む。 In the second embodiment, the correct answer data 504 associated with the evaluation target data 500 is input to the analysis unit 113. The correct answer data 504 includes the identification information of the evaluation target data 500 and the correct answer of the reasoning for the evaluation target data 500.

正解データ５０４は、環境解析処理の実行前に入力されてもよいし、また、ステップＳ３０７の判定結果がＹＥＳの場合に入力してもよい。 The correct answer data 504 may be input before the execution of the environment analysis process, or may be input when the determination result in step S307 is YES.

図１３は、実施例２の解析部１１３が実行する環境解析処理の一例を説明するフローチャートである。 FIG. 13 is a flowchart illustrating an example of the environment analysis process executed by the analysis unit 113 of the second embodiment.

ステップＳ３０１からステップＳ３０９までの処理の流れは実施例１と同一である。ステップＳ３０９の処理が実行された後、解析部１１３は、新規クラスタに属する評価対象データ５００の推論誤差を算出する（ステップＳ３５１）。 The processing flow from step S301 to step S309 is the same as that in the first embodiment. After the process of step S309 is executed, the analysis unit 113 calculates the inference error of the evaluation target data 500 belonging to the new cluster (step S351).

推論部１１１が出力する推論結果が数値である場合、解析部１１３は、推論結果及び正解の差を推論誤差として算出する。推論部１１１が出力する推論結果が数値以外の値である場合、解析部１１３は、推論結果及び正解が一致する場合は「１」、推論結果及び正解が一致しない場合は「０」を推論誤差として算出する。なお、前述した算出方法は一例であってこれに限定されない。 When the inference result output by the inference unit 111 is a numerical value, the analysis unit 113 calculates the difference between the inference result and the correct answer as an inference error. When the inference result output by the inference unit 111 is a value other than a numerical value, the analysis unit 113 sets "1" if the inference result and the correct answer match, and "0" if the inference result and the correct answer do not match. Calculate as. The above-mentioned calculation method is an example and is not limited to this.

次に、解析部１１３は、推論誤差の平均値が閾値以上であるか否かを判定する（ステップＳ３５２）。閾値は予め設定されているものとする。すなわち、推論部１１１の推論精度が低下しているか否かが判定される。 Next, the analysis unit 113 determines whether or not the average value of the inference error is equal to or greater than the threshold value (step S352). It is assumed that the threshold value is set in advance. That is, it is determined whether or not the inference accuracy of the inference unit 111 is lowered.

推論誤差の平均値が閾値より小さいと判定された場合、解析部１１３はステップＳ３１０に進む。実施例２のステップＳ３１０の処理は実施例１と同一である。 If it is determined that the average value of the inference error is smaller than the threshold value, the analysis unit 113 proceeds to step S310. The process of step S310 of the second embodiment is the same as that of the first embodiment.

推論誤差の平均値が閾値以上であると判定された場合、解析部１１３は、クラスタ情報、ソート結果、解析結果、環境変化通知、及びアラートを結果出力部１１４に出力する（ステップＳ３５３）。その後、解析部１１３は、環境解析処理を終了する。この場合、現在のＡＩシステムの運用環境を示す情報、運用環境の変化が検知されたことを示す情報、及び推論部１１１の推論精度が低下したことを示す情報が結果出力部１１４から出力される。 When it is determined that the average value of the inference error is equal to or greater than the threshold value, the analysis unit 113 outputs the cluster information, the sort result, the analysis result, the environmental change notification, and the alert to the result output unit 114 (step S353). After that, the analysis unit 113 ends the environment analysis process. In this case, the result output unit 114 outputs information indicating the current operating environment of the AI system, information indicating that a change in the operating environment has been detected, and information indicating that the inference accuracy of the inference unit 111 has deteriorated. ..

なお、クラスタの特性の解析において、解析部１１３は、クラスタに属する評価対象データ５００の推論誤差の平均値を算出してもよい。 In the analysis of the characteristics of the cluster, the analysis unit 113 may calculate the average value of the inference errors of the evaluation target data 500 belonging to the cluster.

図１４は、実施例２の結果出力部１１４が生成した環境解析情報５０３に基づいて表示される画面の一例を示す図である。 FIG. 14 is a diagram showing an example of a screen displayed based on the environment analysis information 503 generated by the result output unit 114 of the second embodiment.

図１４は、クラスタ情報、環境変化通知、及びアラートに基づいて生成された環境解析情報５０３により表示される画面９００の一例を示す。 FIG. 14 shows an example of the screen 900 displayed by the environment analysis information 503 generated based on the cluster information, the environment change notification, and the alert.

実施例２では、吹出９１２にクラスタ９１１に属する評価対象データ５００の推論誤差の平均値が含まれる。また、推論誤差の平均値が閾値以上である場合、アラートに基づいて吹出９１２の推論誤差の平均値が強調表示される。 In the second embodiment, the outlet 912 includes the average value of the inference error of the evaluation target data 500 belonging to the cluster 911. If the average value of the inference error is equal to or greater than the threshold value, the average value of the inference error of the outlet 912 is highlighted based on the alert.

実施例２によれば、ＡＩシステムの運用環境を把握のための可読性に優れた情報を出力することができる。また、計算機システムは、推論精度の低下を検知できる。これによって、再学習の自動化、及び、ユーザへの再学習の推奨が可能となる。 According to the second embodiment, it is possible to output highly readable information for grasping the operating environment of the AI system. In addition, the computer system can detect a decrease in inference accuracy. This makes it possible to automate re-learning and recommend re-learning to users.

実施例３の計算機システムは、ＡＩシステムの運用環境の変化に合わせて根拠ベクトル５０２の算出方法（計算アルゴリズム５２０）を更新する。以下、実施例１との差異を中心に実施例３について説明する。 The computer system of the third embodiment updates the calculation method (calculation algorithm 520) of the basis vector 502 in accordance with the change in the operating environment of the AI system. Hereinafter, Example 3 will be described with a focus on the differences from Example 1.

図１５は、実施例３の計算機システムの構成例を示す図である。 FIG. 15 is a diagram showing a configuration example of the computer system of the third embodiment.

図１５に示すように実施例３のシステム構成は実施例１と同一である。実施例３の計算機１００のハードウェア構成は実施例１と同一である。実施例３の計算機１００−１、１００−２のソフトウェア構成は実施例１と同一である。 As shown in FIG. 15, the system configuration of the third embodiment is the same as that of the first embodiment. The hardware configuration of the computer 100 of the third embodiment is the same as that of the first embodiment. The software configurations of the computers 100-1 and 100-2 of the third embodiment are the same as those of the first embodiment.

実施例３では、計算機１００−３のソフトウェア構成が一部異なる。具体的には、実施例３の計算機１００−３は、計算アルゴリズム５２０を更新するアルゴリズム更新部１１５を含む。 In the third embodiment, the software configuration of the computer 100-3 is partially different. Specifically, the computer 100-3 of the third embodiment includes an algorithm update unit 115 that updates the calculation algorithm 520.

実施例３の学習部１１０、推論部１１１、根拠ベクトル生成部１１２、及び結果出力部１１４が実行する処理は実施例１と同一である。 The processing executed by the learning unit 110, the inference unit 111, the basis vector generation unit 112, and the result output unit 114 of the third embodiment is the same as that of the first embodiment.

実施例３では、解析部１１３が実行する処理が一部異なる。図１６は、実施例３の計算機システムの処理の流れを示す図である。 In the third embodiment, the processing executed by the analysis unit 113 is partially different. FIG. 16 is a diagram showing a processing flow of the computer system of the third embodiment.

実施例３では、解析部１１３が、クラスタに属する入力データの解析結果をアルゴリズム更新部１１５に出力する。アルゴリズム更新部１１５は、当該解析結果に基づいて、計算アルゴリズム５２０を更新する。 In the third embodiment, the analysis unit 113 outputs the analysis result of the input data belonging to the cluster to the algorithm update unit 115. The algorithm update unit 115 updates the calculation algorithm 520 based on the analysis result.

図１７は、実施例３の解析部１１３が実行する環境解析処理の一例を説明するフローチャートである。 FIG. 17 is a flowchart illustrating an example of the environment analysis process executed by the analysis unit 113 of the third embodiment.

ステップＳ３０１からステップＳ３０９までの処理の流れは実施例１と同一である。ステップＳ３０９の処理が実行された後、解析部１１３は、解析結果をアルゴリズム更新部１１５に出力する（ステップＳ３６１）。その後、ステップＳ３１０に進む。実施例３のステップＳ３１０の処理は実施例１と同一である。 The processing flow from step S301 to step S309 is the same as that in the first embodiment. After the process of step S309 is executed, the analysis unit 113 outputs the analysis result to the algorithm update unit 115 (step S361). Then, the process proceeds to step S310. The process of step S310 of Example 3 is the same as that of Example 1.

図１８は、実施例３のアルゴリズム更新部１１５が実行するアルゴリズム更新処理の一例を説明するフローチャートである。 FIG. 18 is a flowchart illustrating an example of the algorithm update process executed by the algorithm update unit 115 of the third embodiment.

アルゴリズム更新部１１５は、解析部１１３から解析結果を受けつけた場合、アルゴリズム更新処理を実行する。 When the algorithm update unit 115 receives the analysis result from the analysis unit 113, the algorithm update unit 115 executes the algorithm update process.

アルゴリズム更新部１１５は、根拠ベクトル生成部１１２から計算アルゴリズム５２０の定義情報を取得する（ステップＳ４０１）。 The algorithm update unit 115 acquires the definition information of the calculation algorithm 520 from the basis vector generation unit 112 (step S401).

アルゴリズム更新部１１５は、解析結果に基づいて、計算アルゴリズム５２０の定義情報を更新する（ステップＳ４０２）。 The algorithm update unit 115 updates the definition information of the calculation algorithm 520 based on the analysis result (step S402).

アルゴリズム更新部１１５は、更新された計算アルゴリズム５２０の定義情報を根拠ベクトル生成部１１２に出力する（ステップＳ４０３）。その後、アルゴリズム更新部１１５はアルゴリズム更新処理を終了する。 The algorithm update unit 115 outputs the updated definition information of the calculation algorithm 520 to the basis vector generation unit 112 (step S403). After that, the algorithm update unit 115 ends the algorithm update process.

ここで、具体例を用いて計算アルゴリズム５２０の更新方法について説明する。ここでは、非特許文献２に記載の処理を実現する計算アルゴリズム５２０及び非特許文献３に記載の処理を実現する計算アルゴリズム５２０の更新方法について説明する。 Here, a method of updating the calculation algorithm 520 will be described using a specific example. Here, a method of updating the calculation algorithm 520 that realizes the processing described in Non-Patent Document 2 and the calculation algorithm 520 that realizes the processing described in Non-Patent Document 3 will be described.

（１）非特許文献２
非特許文献２に記載の処理では、バックグラウンドデータセットを用いて計算用データが生成される。非特許文献２に記載の処理を採用する場合、バックグラウンドデータセットが計算アルゴリズム５２０に含まれる。 (1) Non-Patent Document 2
In the process described in Non-Patent Document 2, calculation data is generated using a background data set. When the process described in Non-Patent Document 2 is adopted, the background data set is included in the calculation algorithm 520.

ここでは、各特徴量の平均値がバックグラウンドデータセットとして定義されているものとする。この場合、根拠ベクトル生成部１１２は、各特徴量を平均値の範囲内で増減させることによって計算用データを生成する。例えば、入力データの変数ｘ_１の特徴量が８で、平均値が５である場合、根拠ベクトル生成部１１２は、３から１３の範囲で変数ｘ_１の特徴量を変化させた計算用データを生成する。 Here, it is assumed that the average value of each feature is defined as a background data set. In this case, the basis vector generation unit 112 generates calculation data by increasing or decreasing each feature amount within the range of the average value. For example, the feature quantity of the variable x ₁ of the input data is 8, when the average value is 5, basis vector generation unit 112, the calculation data obtained by changing the characteristics of the variable x ₁ in the range 3 13 Generate.

アルゴリズム更新部１１５は、解析結果に基づいて各特徴量の平均値を算出する。アルゴリズム更新部１１５は、算出結果をバックグラウンドデータセットに反映させることによって計算アルゴリズム５２０を更新する。 The algorithm update unit 115 calculates the average value of each feature amount based on the analysis result. The algorithm update unit 115 updates the calculation algorithm 520 by reflecting the calculation result in the background data set.

（２）非特許文献３
ＣＮＮの出力の特徴マップの各クラスへの分類スコアＳ^ｃは式（１）で算出される。Ｃはクラスを表す。Ａ_ｉｊは特徴マップを表し、ｗ_ｋ ^ｃは重みを表す。また、ｉ、ｊはピクセルの座標を表す。Ｚは特徴マップのサイズを表す。 (2) Non-Patent Document 3
Classification score S ^c to each class of feature map of CNN output is calculated by the equation (1). C represents a class. A _ij represents a feature map and w _k ^c represents a weight. Further, i and j represent the coordinates of the pixel. Z represents the size of the feature map.

重みは入力したデータがクラスＣと分類するための式（２）の重要度と理解される。 The weight is understood as the importance of equation (2) for classifying the input data as class C.

ここで、式（１）の有限和の順序を入れ替えて式（３）のように変形する。 Here, the order of the finite sum of the equation (1) is changed and transformed as in the equation (3).

このとき、式（３）の括弧内の項は、ピクセル（ｉ．ｊ）がクラスＣに分類される影響の大きさと理解できる。ここで、重みは式（４）のように算出できる。ここでｙ^Ｃは推論部１１１の推論結果５０１を表す。 At this time, the term in parentheses in the equation (3) can be understood as the magnitude of the influence that the pixel (i.j) is classified into the class C. Here, the weight can be calculated as in the equation (4). Here, y ^C represents the inference result 501 of the inference unit 111.

非特許文献３では、式（５）に示すような影響度（重要度）を特徴マップの各チャネルの重み付き和として可視化している。 In Non-Patent Document 3, the degree of influence (importance) as shown in the equation (5) is visualized as a weighted sum of each channel of the feature map.

式（４）に示すように、重みは、ＡＩシステムの運用環境、特に、特徴量の分布の変化の影響を受ける。そこで、アルゴリズム更新部１１５は、特徴量の分布の変化に基づいて重みを補正する。 As shown in equation (4), the weight is affected by the operating environment of the AI system, especially the change in the distribution of features. Therefore, the algorithm update unit 115 corrects the weight based on the change in the distribution of the feature amount.

具体的には、アルゴリズム更新部１１５は、解析結果に基づいて各特徴量の平均値を算出する。アルゴリズム更新部１１５は、算出結果に基づいて入力データから計算用データを生成する。アルゴリズム更新部１１５は、計算用データを推論部１１１に入力して推論結果５０１を取得する。 Specifically, the algorithm update unit 115 calculates the average value of each feature amount based on the analysis result. The algorithm update unit 115 generates calculation data from the input data based on the calculation result. The algorithm update unit 115 inputs the calculation data to the inference unit 111 and acquires the inference result 501.

アルゴリズム更新部１１５は、入力データの推論結果５０１（ｙ^ｃ）と計算用データの推論結果５０１（ｙ^’ｃ）を用いて重みを補正する。具体的には、式（６）を用いて重みを算出する。 Algorithm updating unit 115 corrects the weighting using the inference results 501 of the input data ^{(y c)} and the inference result 501 (y ^'c) of the calculation data. Specifically, the weight is calculated using the equation (6).

なお、アルゴリズム更新部１１５は、異なる入力データ及び計算用データのペアを生成し、ｙ^ｃ−ｙ^’ｃ空間にプロットすることによって、式（７）の勾配を算出してもよい。 Incidentally, the algorithm updating unit 115 generates different input data and the calculation data pairs, by plotting the y ^c -y ^'c space may calculate the slope of the equation (7).

実施例３によれば、ＡＩシステムの運用環境の変化に合わせて、根拠ベクトル５０２を算出するための計算アルゴリズム５２０を自動的に更新することができる。これによって、ＡＩシステムの運用環境を監視するためのアルゴリズムの開発及び解釈の定義等を行う必要がない。 According to the third embodiment, the calculation algorithm 520 for calculating the basis vector 502 can be automatically updated according to the change in the operating environment of the AI system. As a result, it is not necessary to develop an algorithm for monitoring the operating environment of the AI system and define an interpretation.

なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。また、例えば、上記した実施例は本発明を分かりやすく説明するために構成を詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、各実施例の構成の一部について、他の構成に追加、削除、置換することが可能である。 The present invention is not limited to the above-described examples, and includes various modifications. Further, for example, the above-described embodiment describes the configuration in detail in order to explain the present invention in an easy-to-understand manner, and is not necessarily limited to the one including all the described configurations. In addition, a part of the configuration of each embodiment can be added, deleted, or replaced with another configuration.

また、上記の各構成、機能、処理部、処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、本発明は、実施例の機能を実現するソフトウェアのプログラムコードによっても実現できる。この場合、プログラムコードを記録した記憶媒体をコンピュータに提供し、そのコンピュータが備えるプロセッサが記憶媒体に格納されたプログラムコードを読み出す。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施例の機能を実現することになり、そのプログラムコード自体、及びそれを記憶した記憶媒体は本発明を構成することになる。このようなプログラムコードを供給するための記憶媒体としては、例えば、フレキシブルディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、ハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）、光ディスク、光磁気ディスク、ＣＤ−Ｒ、磁気テープ、不揮発性のメモリカード、ＲＯＭなどが用いられる。 Further, each of the above configurations, functions, processing units, processing means and the like may be realized by hardware by designing a part or all of them by, for example, an integrated circuit. The present invention can also be realized by a program code of software that realizes the functions of the examples. In this case, a storage medium in which the program code is recorded is provided to the computer, and the processor included in the computer reads out the program code stored in the storage medium. In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the program code itself and the storage medium storing the program code itself constitute the present invention. Examples of the storage medium for supplying such a program code include a flexible disk, a CD-ROM, a DVD-ROM, a hard disk, an SSD (Solid State Drive), an optical disk, a magneto-optical disk, a CD-R, and a magnetic tape. Non-volatile memory cards, ROMs, etc. are used.

また、本実施例に記載の機能を実現するプログラムコードは、例えば、アセンブラ、Ｃ／Ｃ＋＋、ｐｅｒｌ、Ｓｈｅｌｌ、ＰＨＰ、Ｊａｖａ（登録商標）等の広範囲のプログラム又はスクリプト言語で実装できる。 In addition, the program code that realizes the functions described in this embodiment can be implemented in a wide range of programs or script languages such as assembler, C / C ++, perl, Shell, PHP, and Java (registered trademark).

さらに、実施例の機能を実現するソフトウェアのプログラムコードを、ネットワークを介して配信することによって、それをコンピュータのハードディスクやメモリ等の記憶手段又はＣＤ−ＲＷ、ＣＤ−Ｒ等の記憶媒体に格納し、コンピュータが備えるプロセッサが当該記憶手段や当該記憶媒体に格納されたプログラムコードを読み出して実行するようにしてもよい。 Further, by distributing the program code of the software that realizes the functions of the examples via the network, it is stored in a storage means such as a hard disk or memory of a computer or a storage medium such as a CD-RW or a CD-R. , The processor provided in the computer may read and execute the program code stored in the storage means or the storage medium.

上述の実施例において、制御線や情報線は、説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。全ての構成が相互に接続されていてもよい。 In the above-described embodiment, the control lines and information lines show what is considered necessary for explanation, and do not necessarily indicate all the control lines and information lines in the product. All configurations may be interconnected.

１００計算機
１０１端末
１０５ネットワーク
１１０学習部
１１１推論部
１１２根拠ベクトル生成部
１１３解析部
１１４結果出力部
１１５アルゴリズム更新部
１２０事例データ管理情報
１２１根拠ベクトル管理情報
２０１プロセッサ
２０２主記憶装置
２０３副記憶装置
２０４ネットワークインタフェース
５００評価対象データ
５０１推論結果
５０２根拠ベクトル
５０３環境解析情報
５０４正解データ
５１０指標算出部
５１１分配部
５２０計算アルゴリズム
９００、１０００、１１００画面 100 Computer 101 Terminal 105 Network 110 Learning unit 111 Reasoning unit 112 Ground vector generation unit 113 Analysis unit 114 Result output unit 115 Algorithm update unit 120 Case data management information 121 Ground vector management information 201 Processor 202 Main storage device 203 Sub storage device 204 Network Interface 500 Evaluation target data 501 Inference result 502 Basis vector 503 Environmental analysis information 504 Correct answer data 510 Index calculation unit 511 Distribution unit 520 Calculation algorithm 900, 1000, 1100 screens

Claims

A computer system that analyzes the operating environment of a business system that has an inference unit that makes inferences.
Includes a processor, memory connected to the processor, and at least one calculator having a network interface connected to the processor.
An interpretation index calculation unit that calculates an interpretation index for interpreting the inference result obtained by inputting data including a plurality of features into the inference unit, and an interpretation index calculation unit.
It is provided with an analysis unit that analyzes the current operating environment of the business system based on the interpretation index and outputs the result of the analysis.
The computer system is characterized in that the interpretation index is a basis vector whose component is the degree of influence of each of the plurality of feature quantities included in the data input to the inference unit on the inference result.

The computer system according to claim 1.
The analysis unit
Perform clustering using the rationale vector
Based on the result of the clustering, it is determined whether or not a change in the operating environment of the business system has occurred.
A computer system characterized by outputting the result of the clustering and the result of the determination.

The computer system according to claim 2.
The analysis unit
Cluster information is generated based on the result of the clustering.
By comparing the cluster information generated in the past with the newly generated cluster information, it is determined whether or not a new cluster has appeared.
A computer system characterized in that when a new cluster appears, it is determined that a change in the operating environment of the business system has occurred.

The computer system according to claim 3.
The analysis unit
By analyzing the accuracy of the inference result of the data belonging to the new cluster, it is determined whether or not the inference accuracy of the inference unit is lowered.
A computer system characterized in that when it is determined that the inference accuracy of the inference unit is reduced, information for notifying the decrease in the inference accuracy of the inference unit is output.

The computer system according to claim 2.
A computer system characterized in that the analysis unit sorts the data based on sort conditions and outputs the result of the sort.

The computer system according to claim 5.
A computer system characterized in that the sort condition is at least one of the feature amount, the basis vector, the time series of the data, the cluster to which the data belongs, the inference result, and the accuracy of the inference result.

The computer system according to claim 1.
The interpretation index calculation unit calculates the interpretation index based on the calculation algorithm.
The analysis unit is a computer system characterized in that the calculation algorithm is updated based on the result of the analysis.

It is an analysis method of the operating environment of a business system that has an inference unit that performs inference, which is executed by a computer system.
The computer system includes a processor, a memory connected to the processor, and at least one computer having a network interface connected to the processor.
The method of analyzing the operating environment of the business system is as follows.
A first step in which the at least one computer calculates an interpretation index for interpreting an inference result obtained by inputting data including a plurality of feature quantities into the inference unit.
The at least one computer analyzes the current operating environment of the business system based on the interpretation index, and includes a second step of outputting the result of the analysis.
The interpretation index is a basis vector whose component is the degree of influence of each of the plurality of feature quantities included in the data input to the inference unit on the inference result. analysis method.

The method for analyzing the operating environment of the business system according to claim 8.
The second step is
A third step in which the at least one computer performs clustering using the rationale vector,
A fourth step in which the at least one computer determines whether or not a change in the operating environment of the business system has occurred based on the result of the clustering.
A method for analyzing an operating environment of a business system, wherein the at least one computer includes a fifth step of outputting the result of the clustering and the result of the determination.

The method for analyzing the operating environment of the business system according to claim 9.
The third step includes a step in which the at least one computer generates cluster information based on the result of the clustering.
The fourth step is
A step in which the at least one computer determines whether or not a new cluster has appeared by comparing the cluster information generated in the past with the newly generated cluster information.
A method for analyzing the operating environment of a business system, which comprises a step of determining that a change in the operating environment of the business system has occurred when the new cluster appears.

The method for analyzing the operating environment of the business system according to claim 10.
The fourth step is
A step in which the at least one computer analyzes the accuracy of the inference result of the data belonging to the new cluster to determine whether or not the inference accuracy of the inference unit is lowered.
When it is determined that the inference accuracy of the inference unit is low, the at least one computer includes a step of outputting information notifying that the inference accuracy of the inference unit is low. How to analyze the operating environment of the business system to be used.

The method for analyzing the operating environment of the business system according to claim 9.
The second step is
A step in which the at least one computer sorts the data based on the sorting conditions.
A method for analyzing an operating environment of a business system, wherein the at least one computer includes a step of outputting the result of the sort.

The method for analyzing the operating environment of the business system according to claim 12.
The operation of the business system, wherein the sort condition is at least one of the feature amount, the basis vector, the time series of the data, the cluster to which the data belongs, the inference result, and the accuracy of the inference result. How to analyze the environment.

The method for analyzing the operating environment of the business system according to claim 8.
The first step includes a step in which the at least one computer calculates the interpretation index based on a calculation algorithm.
The analysis method is a method for analyzing an operating environment of a business system, wherein the at least one computer includes a step of updating the calculation algorithm based on the result of the analysis.