JP7318383B2

JP7318383B2 - Information processing program, information processing method, and information processing apparatus

Info

Publication number: JP7318383B2
Application number: JP2019134863A
Authority: JP
Inventors: 晃一白幡; アミルハデルバシュ; 安基富田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2019-07-22
Filing date: 2019-07-22
Publication date: 2023-08-01
Anticipated expiration: 2039-07-22
Also published as: EP3779735A1; CN112288124A; US20210027004A1; US11625517B2; JP2021018683A

Description

本発明は、情報処理プログラム、情報処理方法、及び情報処理装置に関する。 The present invention relates to an information processing program, an information processing method, and an information processing apparatus.

近年、製品の構造にかかる応力の検証、気体や液体の振る舞いの検証等に、構造解析や流体解析のシミュレーションが用いられるようになっている。解析シミュレーションでは、非線形解析ではニュートン法などの反復法が用いられ、各反復において線形ソルバを解く。線形ソルバ自体も反復法で解かれることが多い。 In recent years, structural analysis and fluid analysis simulations have come to be used to verify the stress applied to the structure of products and to verify the behavior of gases and liquids. In analytical simulation, an iterative method such as Newton's method is used for nonlinear analysis, solving a linear solver at each iteration. Linear solvers themselves are also often solved iteratively.

デバイス特性に関する所定の条件を解析に用いる方程式により得た解の行列が線形方程式の収束判定を満たすか否かに応じて収束判定の条件を変更することで、当該解の行列の非線形方程式による収束条件を満たすまでの計算時間を短縮する技術等が知られている。 Convergence of the solution matrix by the nonlinear equation by changing the convergence judgment condition according to whether the solution matrix obtained from the equation using the predetermined condition for the analysis of the device characteristics satisfies the convergence judgment of the linear equation Techniques and the like for shortening the calculation time until the conditions are satisfied are known.

特開２００３－１６２５１７号公報JP-A-2003-162517 特開２０１７－１２３１６０号公報JP 2017-123160 A 特開２０１６－１４６１３９号公報JP 2016-146139 A 特開２０００－３３９１７９号公報JP-A-2000-339179

構造解析や流体解析のシミュレーションでは、非線形解析を行う場合がある。非線形解析ではニュートン法などの反復法では収束条件として残差の閾値があり、シミュレーションの演算時間を高速化するためには、残差閾値を最適な値に設定することが有効であるが、事前に最適な残差閾値を得ることができない。 Nonlinear analysis may be performed in simulations of structural analysis and fluid analysis. In nonlinear analysis, iterative methods such as Newton's method have a residual threshold value as a convergence condition. , the optimal residual threshold cannot be obtained.

したがって、１つの側面では、最適な残差閾値の設定を可能とし、非線形解析の反復回数を低減することで、演算時間を短縮することを目的とする。 Accordingly, one aspect of the present invention aims at shortening the computation time by making it possible to set an optimum residual threshold and reducing the number of iterations of the nonlinear analysis.

一態様によれば、１又は複数のコンピュータに、線形解析を反復して非線形解析を行う第１の処理を実行させ、前記第１の処理により複数の実験値で残差閾値ごとに求めた前記線形解析の反復ごとの残差推移と演算時間とに基づいて、ＮＮにより前記線形解析の収束判定に用いる残差閾値を推論させる第２の処理とを実行させ、前記第１の処理と前記第２の処理との間で行われるデータの受け渡しを、メモリに設定された共有メモリを用いたプロセス間通信により行わせることを特徴とする情報処理プログラムが提供される。
According to one aspect, one or more computers are caused to perform a first process of repeating linear analysis to perform nonlinear analysis, and the first process determines for each residual threshold with a plurality of experimental values a second process of inferring a residual threshold value used for determining convergence of the linear analysis by the NN based on the residual transition and the computation time for each iteration of the linear analysis; There is provided an information processing program characterized in that the data exchange between the processes of 2 is performed by inter-process communication using a shared memory set in the memory.

また、本開示によれば、情報処理方法、及び情報処理装置が提供される。 Further, according to the present disclosure, an information processing method and an information processing apparatus are provided.

最適な残差閾値の設定を可能とし、非線形解析の反復回数を低減することで、演算時間を短縮することができる。 By making it possible to set the optimum residual threshold and reducing the number of iterations of the nonlinear analysis, the computation time can be shortened.

情報処理装置のハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of an information processing apparatus. 情報処理装置の機能構成例を示す図である。It is a figure which shows the functional structural example of an information processing apparatus. 学習処理を説明するための図である。It is a figure for demonstrating a learning process. 推論処理を説明するための図である。FIG. 4 is a diagram for explaining inference processing; 検証結果のグラフを示す図である。It is a figure which shows the graph of a verification result. 第１実施例におけるプロセス間通信によるデータ送受信を説明するための図である。FIG. 4 is a diagram for explaining data transmission/reception by inter-process communication in the first embodiment; 異種プログラム言語間におけるプロセス間通信の概要を説明するための図である。1 is a diagram for explaining an overview of interprocess communication between different types of programming languages; FIG. シミュレーションでＡＩが利用される場合のプロセス間通信の概要を説明するための図である。FIG. 4 is a diagram for explaining an overview of inter-process communication when AI is used in simulation; 図６における実装例を示す図である。FIG. 7 is a diagram showing an implementation example in FIG. 6; 図７における実装例を示す図である。FIG. 8 is a diagram showing an implementation example in FIG. 7; シミュレーション処理を説明するためのフローチャート図である。FIG. 10 is a flowchart for explaining simulation processing; シミュレーションとの連携において動作する機械学習処理を説明するためのフローチャート図である。FIG. 10 is a flow chart for explaining machine learning processing that operates in cooperation with a simulation; シミュレーションのメインプロセスにおけるプロセス間通信に係る処理の一例を説明するための図である。FIG. 10 is a diagram for explaining an example of processing related to inter-process communication in the main process of simulation; シミュレーションのメインプロセスにおけるプロセス間通信に係る処理の一例を説明するための図である。FIG. 10 is a diagram for explaining an example of processing related to inter-process communication in the main process of simulation; Python子プロセスにおけるプロセス間通信に係る処理の一例を説明するための図である。FIG. 4 is a diagram for explaining an example of processing related to inter-process communication in a Python child process; プロセス間通信における状態例を示す図である。FIG. 4 is a diagram showing an example of states in inter-process communication; オーバーヘッドの検証結果例を示す図である。It is a figure which shows the verification result example of an overhead. ディスクへの入出力によるデータサイズごとの検証結果を示す図である。FIG. 10 is a diagram showing verification results for each data size by input/output to/from a disk; ディスクへの入出力によるデータサイズごとの検証結果を示す図である。FIG. 10 is a diagram showing verification results for each data size by input/output to/from a disk; 非線形解析を説明するための図である。It is a figure for demonstrating a nonlinear analysis. 第２実施例における情報処理装置の第１の機能構成例において学習部の概要を説明するための図である。FIG. 12 is a diagram for explaining an outline of a learning unit in the first functional configuration example of the information processing apparatus in the second embodiment; 第２実施例における情報処理装置の第１の機能構成例において推論部の概要を説明するための図である。FIG. 11 is a diagram for explaining an outline of an inference unit in the first functional configuration example of the information processing apparatus in the second embodiment; 候補閾値ごとのシミュレーション時間の検証例を示す図である。FIG. 11 is a diagram illustrating an example of verification of simulation time for each candidate threshold; 第１の機能構成例における学習処理を説明するためのフローチャート図である。FIG. 5 is a flowchart for explaining learning processing in the first functional configuration example; 第１の機能構成例における推論処理を説明するためのフローチャート図である。FIG. 4 is a flowchart for explaining inference processing in the first functional configuration example; 第２実施例における情報処理装置の第２の機能構成例において学習部の概要を説明するための図である。FIG. 12 is a diagram for explaining an overview of a learning unit in a second functional configuration example of the information processing apparatus in the second embodiment; 第２実施例における情報処理装置の第１の機能構成例において推論部の概要を説明するための図である。FIG. 11 is a diagram for explaining an outline of an inference unit in the first functional configuration example of the information processing apparatus in the second embodiment; 第２の機能構成例における学習処理の第１の例を説明するためのフローチャート図である。FIG. 11 is a flowchart for explaining a first example of learning processing in the second functional configuration example; 第２の機能構成例における推論処理を説明するためのフローチャート図である。FIG. 11 is a flowchart for explaining inference processing in the second functional configuration example; 第２の機能構成例における学習処理の第２の例を説明するためのフローチャート図である。FIG. 11 is a flowchart for explaining a second example of learning processing in the second functional configuration example; 第２の機能構成例における学習処理の第３の例を説明するためのフローチャート図である。FIG. 11 is a flowchart for explaining a third example of learning processing in the second functional configuration example; 学習処理の第１の例による候補閾値ごとの学習結果を示す図である。FIG. 10 is a diagram showing learning results for each candidate threshold value according to the first example of learning processing; 図３２のラベル付けに基づく学習結果を示す図である。FIG. 33 is a diagram showing learning results based on the labeling of FIG. 32; 学習処理の第２の例による候補閾値ごとの学習結果を示す図である。FIG. 10 is a diagram showing learning results for each candidate threshold value according to a second example of learning processing; 図３４のラベル付けに基づく学習結果を示す図である。FIG. 35 is a diagram showing learning results based on the labeling of FIG. 34; 学習処理の第２の例による候補閾値ごとの学習結果を示す図である。FIG. 10 is a diagram showing learning results for each candidate threshold value according to a second example of learning processing; 実行時間の検証結果を示す図である。It is a figure which shows the verification result of execution time. シミュレーション結果の検証結果を示す図である。It is a figure which shows the verification result of a simulation result. 閾値の変化例を示す図である。It is a figure which shows the example of a change of a threshold value. 処理時間の経過例を示す図である。It is a figure which shows the progress example of processing time. 反復回数の経過例を示す図である。It is a figure which shows the progress example of the number of iterations. 第２実施例における実行ログの例を示す図である。FIG. 11 is a diagram showing an example of an execution log in the second embodiment; FIG.

以下、本発明の実施の形態を図面に基づいて説明する。構造解析や流体解析のシミュレーションでは非線形方程式の解析（「非線形解析」という）を行うこともあり、非線形解析ではNewton法などを用いて線形ソルバを反復的に解くことで解を求めている。線形ソルバ自体も大規模な問題に対しては反復法（ＣＧ（Conjugate Gradient：共役勾配法）など）を用いて解を求めることが多い。 BEST MODE FOR CARRYING OUT THE INVENTION An embodiment of the present invention will be described below with reference to the drawings. In structural analysis and fluid analysis simulations, nonlinear equation analysis (called "nonlinear analysis") is sometimes performed, and in nonlinear analysis, solutions are obtained by iteratively solving a linear solver using Newton's method or the like. The linear solver itself often uses an iterative method (such as CG (Conjugate Gradient)) to obtain a solution for a large-scale problem.

シミュレーションの演算時間を高速化するために、発明者等は、非線形解析における反復処理の収束条件となる残差閾値の調整方向をＡＩ（Artificial Intelligence：人工知能）に学習させて動的に増加又は減少することによりシミュレーションを実行することを見出した。この場合、線形ソルバの反復処理により収束するまでの時間経過による残差の系列（「残差曲線」又は「残差推移」という）を、シミュレーションプログラムから機械学習プログラムへと渡すことで、残差閾値の学習が行われる。この手法は第２実施例で詳述する。 In order to speed up the calculation time of the simulation, the inventors let AI (Artificial Intelligence) learn the adjustment direction of the residual threshold, which is the convergence condition of iterative processing in nonlinear analysis, and dynamically increase or We found that the simulation was performed by reducing In this case, by passing the series of residuals over time until convergence by the iterative process of the linear solver (called "residual curve" or "residual transition") from the simulation program to the machine learning program, the residual Threshold learning is performed. This technique will be described in detail in the second embodiment.

しかしながら、シミュレーションプログラムと機械学習プログラムとは一般的に言語が異なるため、シミュレーションと学習との間でデータをやり取りするためのファイルアクセスに時間が掛かる。この問題を解決するために、発明者等は、第１実施例によるプロセス間通信を見出した。第２実施例では、動的な残差閾値の調整手法について説明する。 However, since simulation programs and machine learning programs generally use different languages, it takes time to access files for exchanging data between simulation and learning. To solve this problem, the inventors found inter-process communication according to the first embodiment. In the second embodiment, a dynamic residual threshold adjustment technique will be described.

後述される第１実施例～第２実施例は、図１に示すようなハードウェア構成を有する情報処理装置に実装可能であり、第１実施例～第２実施例のいずれか又は両方を実装することにより、シミュレーションの処理速度を改善することができる。 The first to second embodiments described later can be implemented in an information processing apparatus having a hardware configuration as shown in FIG. 1, and either or both of the first to second embodiments can be implemented. By doing so, the processing speed of the simulation can be improved.

図１は、情報処理装置のハードウェア構成例を示す図である。図１より、情報処理装置１００は、コンピュータであって、ＣＰＵ（Central Processing Unit）１１と、メインメモリ１２と、ディスク１３と、ＧＰＵ（Graphics Processing Unit）１４ｇと、ＧＰＵメモリ１４ｍと、入力装置１５と、表示装置１６と、通信Ｉ／Ｆ１７と、ドライブ装置１８とを有し、バスＢに接続される。このハードウェア構成は、第２実施例においても同様である。 FIG. 1 is a diagram illustrating a hardware configuration example of an information processing apparatus. 1, the information processing apparatus 100 is a computer, and includes a CPU (Central Processing Unit) 11, a main memory 12, a disk 13, a GPU (Graphics Processing Unit) 14g, a GPU memory 14m, and an input device 15. , a display device 16, a communication I/F 17, and a drive device 18, and are connected to the bus B. This hardware configuration is the same in the second embodiment.

ＣＰＵ１１は、情報処理装置１００全体を制御するプロセッサに相当し、ディスク１３から読み込まれメインメモリ１２（例えば、ＲＡＭ（Random Access Memory））に格納されたシミュレーションプログラムを実行し以下に説明する本実施例における処理を実現する。ＣＰＵ１１は、また、シミュレーション以外の様々な処理を行う。 The CPU 11 corresponds to a processor that controls the entire information processing apparatus 100, and executes a simulation program read from a disk 13 and stored in a main memory 12 (for example, a RAM (random access memory)) to execute the present embodiment described below. Realize the processing in The CPU 11 also performs various processes other than simulation.

ＧＰＵ１４ｇは、ＡＩ推論用のプロセッサに相当し、シミュレーションの実行により得られたシミュレーションデータを用いて、本実施例における残差閾値の調整方向を推定するＡＩ推論を行う。ＧＰＵ１４ｍは、ＧＰＵ１４ｇが利用するローカルメモリでありＡＩ推論を行うＮＮ２７０（図３）のプログラムを記憶している。ＡＩ推論は、ＧＰＵ１４ｇがＧＰＵ１４ｍに記憶されたプログラムを実行することにより、ＮＮ２７０の最適なパラメータ値を学習する。 The GPU 14g corresponds to a processor for AI inference, and performs AI inference for estimating the adjustment direction of the residual threshold in this embodiment using simulation data obtained by executing simulation. The GPU 14m is a local memory used by the GPU 14g and stores a program of the NN 270 (FIG. 3) that performs AI inference. AI inference learns the optimal parameter values of the NN 270 by executing a program stored in the GPU 14m by the GPU 14g.

入力装置１５は、ユーザによって操作され、操作に応じてデータを入力し、表示装置１６は、ユーザーインタフェースとして様々な画面を表示する。通信Ｉ／Ｆ１７は、外部装置との通信を制御する。 The input device 15 is operated by a user to input data according to the operation, and the display device 16 displays various screens as a user interface. Communication I/F 17 controls communication with an external device.

記憶媒体１９（例えば、ＣＤ－ＲＯＭ（Compact Disc Read-Only Memory）等）に記憶された本実施例に係るシミュレーションプログラムは、ドライブ装置１８を介してディスク１３にインストールされ、ＣＰＵ１１によって実行可能となる。また、機械学習プログラムも同様に記憶媒体１９からドライブ装置１８を介してディスク１３にインストールされ、ＧＰＵ１４ｇによって実行可能となる。 A simulation program according to the present embodiment stored in a storage medium 19 (for example, a CD-ROM (Compact Disc Read-Only Memory) or the like) is installed on the disc 13 via the drive device 18 and can be executed by the CPU 11. . Similarly, the machine learning program is installed from the storage medium 19 to the disk 13 via the drive device 18, and can be executed by the GPU 14g.

記憶媒体１９（例えば、ＣＤ－ＲＯＭ（Compact Disc Read-Only Memory）等）に記憶された本実施例に係るシミュレーションプログラム及び機械学習プログラムは、ドライブ装置１８を介して記憶部１３０にインストールされ、ＣＰＵ１１によって実行可能となる。シミュレーションプログラム及び機械学習プログラムはそれぞれ別の記憶媒体１９からインストールされてもよい。 A simulation program and a machine learning program according to the present embodiment stored in a storage medium 19 (for example, a CD-ROM (Compact Disc Read-Only Memory), etc.) are installed in the storage unit 130 via the drive device 18, and the CPU 11 can be executed by The simulation program and the machine learning program may be installed from separate storage media 19, respectively.

尚、本実施例に係るプログラムを格納する記憶媒体１９はＣＤ－ＲＯＭに限定されず、コンピュータが読み取り可能な、構造（structure）を有する１つ以上の非一時的（non-transitory）な、有形（tangible）な媒体であればよい。コンピュータ読取可能な記憶媒体として、ＣＤ－ＲＯＭの他に、ＤＶＤ（Digital Versatile Disk）ディスク、ＵＳＢメモリ等の可搬型記録媒体、フラッシュメモリ等の半導体メモリであっても良い。 Note that the storage medium 19 for storing the program according to the present embodiment is not limited to a CD-ROM, and one or more computer-readable, non-transitory, tangible (tangible) medium. As a computer-readable storage medium, in addition to a CD-ROM, a DVD (Digital Versatile Disk) disk, a portable recording medium such as a USB memory, and a semiconductor memory such as a flash memory may be used.

［第１実施例］
また、第１実施例では、図２に示すような機能構成を有する情報処理装置１００において、プロセス間通信によりシミュレーションを高速化する手法について説明する。図２は、情報処理装置の機能構成例を示す図である。 [First embodiment]
Also, in the first embodiment, a technique for speeding up simulation by inter-process communication in the information processing apparatus 100 having the functional configuration as shown in FIG. 2 will be described. FIG. 2 is a diagram illustrating a functional configuration example of an information processing apparatus.

図２より、情報処理装置１００は、主に、シミュレーション部３０と、機械学習部４０とを処理部として有する。また、問題データ２、閾値Ｔｈ、シミュレーションデータ２０４ｄ、シミュレーション結果５、推論結果７１等は、第１実施例で説明するように一部を共有メモリ１２ａ（図８）として用いることにより、メインメモリ１２に記憶されるものとして示す。 As shown in FIG. 2, the information processing apparatus 100 mainly includes a simulation unit 30 and a machine learning unit 40 as processing units. Further, the problem data 2, the threshold value Th, the simulation data 204d, the simulation result 5, the inference result 71, etc. are stored in the main memory 12 by using a part thereof as the shared memory 12a (FIG. 8) as described in the first embodiment. are shown as being stored in

シミュレーション部３０は、ＣＰＵ１１がシミュレーションプログラムを実行することにより実現される処理部であり、問題データ２に対して所定の解析を行い、主に、非線形解析部３２と、線形解析部３４とを有する。所定の解析とは、構造解析、流体解析等である。シミュレーション部３０によって得られた解析結果は表示装置１６に表示されてもよい。 The simulation unit 30 is a processing unit realized by the CPU 11 executing a simulation program, performs a predetermined analysis on the problem data 2, and mainly has a nonlinear analysis unit 32 and a linear analysis unit 34. . The predetermined analysis includes structural analysis, fluid analysis, and the like. The analysis results obtained by the simulation unit 30 may be displayed on the display device 16 .

非線形解析部３２は、問題データ２を読み込んで非線形解析を行う処理部である。非線形解析ではNewton-Raphson法などの反復法を用いて解を得る。非線形解析部３２は、非線形回数を所定の回数分繰り返し、繰り返しごとに、線形解析部３４に、問題データ２から得られるパラメータ値等と、収束判定に用いる閾値Ｔｈ（即ち、残差閾値）とを与え、線形方程式による解析を行わせる。非線形解析によって得られた非線形解析データは、後述されるnonlin_dataに相当する。 The nonlinear analysis unit 32 is a processing unit that reads the problem data 2 and performs nonlinear analysis. In nonlinear analysis, an iterative method such as the Newton-Raphson method is used to obtain the solution. The nonlinear analysis unit 32 repeats the nonlinear number of times for a predetermined number of times. and let it be analyzed by linear equations. Nonlinear analysis data obtained by nonlinear analysis corresponds to nonlin_data described later.

線形解析部３４は、閾値Ｔｈを満たすまで線形方程式を用いた解析を反復し、反復ごとの解と閾値Ｔｈとの差分を示す残差と、シミュレーション時間等の線形解析データをメインメモリ１２に出力する。閾値Ｔｈを満たすまで線形方程式を用いた解析を反復する処理には、線形ソルバが用いられる。線形解析部３４によって得られた線形解析データは、後述されるlin_dataに相当する。 The linear analysis unit 34 repeats the analysis using the linear equation until the threshold Th is satisfied, and outputs the residual indicating the difference between the solution for each iteration and the threshold Th and the linear analysis data such as the simulation time to the main memory 12. do. A linear solver is used for the process of repeating the analysis using the linear equations until the threshold Th is satisfied. The linear analysis data obtained by the linear analysis unit 34 corresponds to lin_data described later.

非線形解析データは、非線形解析の繰り返しごとに得られるデータであり、また、線形解析データは反復ごとに累積されたデータである。そして、様々なパラメータ値、解等を含む、非線形解析データ及び線形解析データがシミュレーションデータ２０４ｄに相当する。シミュレーションデータ２０４ｄは、ＡＩ推論ごとに機械学習部４０に読み込まれる。 Nonlinear analysis data is data obtained for each iteration of nonlinear analysis, and linear analysis data is data accumulated for each iteration. Nonlinear analysis data and linear analysis data, including various parameter values, solutions, etc., correspond to the simulation data 204d. The simulation data 204d is read into the machine learning unit 40 for each AI inference.

また、シミュレーションの開始から終了までの時間経過で示された実行環境及び実行状態を示すデータの集まりを実行ログ４ａ（図４２）という。実行ログ４ａには、シミュレーションデータ２０４ｄが含まれる。また、問題データ２に対する目的の解をシミュレーション結果５という。 Also, a collection of data indicating the execution environment and the execution state indicated by the passage of time from the start to the end of the simulation is called an execution log 4a (FIG. 42). The execution log 4a includes simulation data 204d. A target solution for the problem data 2 is called a simulation result 5 .

第１実施例では、非線形解析を行うために線形解析を用いる情報処理装置１００において、実験値で閾値Ｔｈごとに求めた反復ごとの残差推移、演算時間などを示すログデータに基づいて残差の閾値Ｔｈを決定する。 In the first embodiment, in the information processing apparatus 100 that uses linear analysis to perform nonlinear analysis, the residual error is calculated based on log data indicating the transition of the residual error for each iteration, the calculation time, etc. obtained for each threshold value Th using experimental values. to determine the threshold Th.

機械学習部４０は、線形解析部３４で用いる閾値Ｔｈを調整するＮＮ２７０のパラメータ値を学習する学習部５０と、訓練済みのＮＮ２７０を用いて閾値Ｔｈの増減（調整方向）を推論する推論部６０とを有する。 The machine learning unit 40 includes a learning unit 50 that learns the parameter values of the NN 270 that adjusts the threshold Th used in the linear analysis unit 34, and an inference unit 60 that infers the increase or decrease (adjustment direction) of the threshold Th using the trained NN 270. and

学習部５０は、図３に示すように、シミュレーションデータ２０４ｄにラベル付けして作成した学習データ６ｇをＮＮ２７０に入力し、ＮＮ２７０が推論したクラスを示す推論結果７１と、学習データ６ｇのラベルとの誤差をＮＮ２７０にフィードバックする。 As shown in FIG. 3, the learning unit 50 inputs the learning data 6g created by labeling the simulation data 204d to the NN 270, and the inference result 71 indicating the class inferred by the NN 270 and the label of the learning data 6g. The error is fed back to NN270.

推論部６０は、訓練済みのＮＮ２７０により閾値Ｔｈの増減を示す推論結果７１を得る。一例として、推論結果７１は、ＮＮ２７０によって分類されるクラス「１」、「２」、及び「３」のいずれかを示すものとする。この場合、クラス「１」は、閾値Ｔｈの増加を指定し（閾値up）、クラス「２」は、閾値Ｔｈの調整不要を指定し（閾値keep）、クラス「３」は、閾値Ｔｈの減少を指定する（閾値down）。 The inference unit 60 obtains an inference result 71 indicating an increase or decrease in the threshold Th from the trained NN 270 . As an example, the inference result 71 indicates one of the classes "1", "2", and "3" classified by the NN270. In this case, class "1" designates an increase in the threshold Th (threshold up), class "2" designates no need to adjust the threshold Th (threshold keep), and class "3" designates a decrease in the threshold Th. (threshold down).

推論部６０は、訓練済みのＮＮ２７０に、シミュレーションデータ２０４ｄに対して残差を小さくするようにクラスを推論する。推論結果７１はメインメモリ１２に記憶され、シミュレーション部３０の非線形解析部３２への戻り値となる。 The inference unit 60 infers a class to the trained NN 270 so as to reduce the residual for the simulation data 204d. The inference result 71 is stored in the main memory 12 and serves as a return value to the nonlinear analysis section 32 of the simulation section 30 .

非線形解析部３２では、推論部６０が推論した推論結果７１に基づいて閾値Ｔｈを更新し線形解析部３４に与える。このような閾値Ｔｈの調整によりシミュレーションに掛かる時間を短縮することが可能となる。 The nonlinear analysis unit 32 updates the threshold value Th based on the inference result 71 inferred by the inference unit 60 and provides it to the linear analysis unit 34 . By adjusting the threshold value Th in this way, it is possible to shorten the time required for the simulation.

閾値Ｔｈは、反復法での収束条件の一つであり、線形解析部３４への閾値Ｔｈの与え方が解の精度と実行時間に影響を与える。閾値Ｔｈは任意の値に設定しても、非線形解析が収束する限り、最終的な解の精度には影響を与えないと考えられる。しかしながら、最適な残差閾値を高速に推定できれば、非線形解析全体を高速化することが可能となる。 The threshold Th is one of the convergence conditions in the iterative method, and how the threshold Th is given to the linear analysis unit 34 affects solution accuracy and execution time. Even if the threshold value Th is set to an arbitrary value, as long as the nonlinear analysis converges, it is considered that the accuracy of the final solution is not affected. However, if the optimal residual threshold can be estimated quickly, the entire nonlinear analysis can be speeded up.

このような観点から、発明者等は、ＮＮ２７０を用いて閾値Ｔｈの増減を学習し、また、訓練済みのＮＮ２７０を用いて、閾値Ｔｈの増減を予測することで、最適値へと閾値Ｔｈを近付けることができ、線形解析が収束するまでの反復回数及び処理時間を短縮できることを見出した。図３及び図４を参照して、情報処理装置１００で行われる処理の概要について説明する。この例では、ＮＮ２７０としてＣＮＮ（Convolutional Neural Network）を用いた場合で説明するが、ＮＮはＣＮＮに限定されるものではない。 From this point of view, the inventors use the NN 270 to learn the increase and decrease of the threshold Th, and also use the trained NN 270 to predict the increase and decrease of the threshold Th, thereby adjusting the threshold Th to the optimum value. It was found that the number of iterations until the linear analysis converges and the processing time can be shortened. An overview of the processing performed by the information processing apparatus 100 will be described with reference to FIGS. 3 and 4. FIG. In this example, a case where a CNN (Convolutional Neural Network) is used as the NN 270 will be described, but the NN is not limited to the CNN.

図３は、学習処理を説明するための図である。図３において、問題データ２に対して異なる候補閾値を用いたシミュレーションごとのシミュレーションデータ２０４ｄを参照して、反復ごとの残差推移、演算時間等に基づいて候補閾値の中から、学習用に用いる基準閾値を決定する。そして、残差曲線データ４ｄそれぞれには、この基準閾値と候補閾値との差分に基づいてラベル付けを行い、学習データ６ｇとして使用される。候補閾値は、実験値に相当する。 FIG. 3 is a diagram for explaining the learning process. In FIG. 3, referring to the simulation data 204d for each simulation using different candidate thresholds for the problem data 2, the candidate thresholds to be used for learning are selected from among the candidate thresholds based on the transition of the residual error for each iteration, the calculation time, etc. Determine a baseline threshold. Then, each residual curve data 4d is labeled based on the difference between the reference threshold and the candidate threshold, and is used as learning data 6g. The candidate thresholds correspond to experimental values.

基準閾値より小さい候補閾値を使用したシミュレーションデータ２０４ｄには、ラベル「１」が付与されている。基準閾値より大きい候補閾値を使用したシミュレーションデータ２０４ｄは、ラベル「３」が付与されている。また、基準閾値と一致する候補閾値を使用したシミュレーションデータ２０４ｄには、ラベル「２」が付与されている。即ち、前述した最短時間でシミュレーションが終了したときのシミュレーションデータ２０４ｄに対しては、ラベル「２」が付与される。 The simulation data 204d using the candidate threshold smaller than the reference threshold is labeled "1". Simulation data 204d using a candidate threshold greater than the reference threshold is labeled "3". The simulation data 204d using the candidate threshold that matches the reference threshold is given the label "2". That is, the label "2" is assigned to the simulation data 204d when the simulation is finished in the shortest time.

シミュレーションデータ２０４ｄをＮＮ２７０に入力することで推論結果７１を得る。一例として、推論結果７１は、閾値Ｔｈを上げるクラス１、閾値Ｔｈを維持するクラス２、及び閾値Ｔｈを下げるクラス３のいずれか１つを示す。推論結果７１は、学習データ６ｇのラベルと比較され、比較結果としての誤差がＮＮ２７０にフィードバックされる。この誤差のフィードバックによりＮＮ２７０のパラメータ値が更新される。訓練済みのＮＮ２７０が、推論部６０で用いられる。 An inference result 71 is obtained by inputting the simulation data 204 d to the NN 270 . As an example, the inference result 71 indicates any one of class 1 that raises the threshold Th, class 2 that maintains the threshold Th, and class 3 that lowers the threshold Th. The inference result 71 is compared with the label of the learning data 6g, and the error as the comparison result is fed back to the NN270. This error feedback updates the NN 270 parameter values. A trained NN 270 is used in the reasoning unit 60 .

上述では、クラス１、２、及び３に分類する場合を例としたが、閾値Ｔｈを上げる場合とそれ以外の場合のみに分類してもよい。その場合には、学習部５０では、学習時に候補閾値３が基準閾値３ｒｅｆより低い場合にラベル０を付与し、それ以外ではラベル１を付与するようにし、推論結果７１として、クラス０又はクラス１を推論するようにしてもよい。 In the above description, the classification into classes 1, 2, and 3 is taken as an example, but the classification may be limited to cases where the threshold value Th is raised and other cases. In that case, the learning unit 50 assigns the label 0 when the candidate threshold 3 is lower than the reference threshold 3ref during learning, and assigns the label 1 otherwise. may be inferred.

図４は、推論処理を説明するための図である。図４では、シミュレーション部３０としてＣＰＵ１１を機能させるシミュレーションプログラムの一例として、疑似コード３１を示している。 FIG. 4 is a diagram for explaining inference processing. FIG. 4 shows pseudo code 31 as an example of a simulation program that causes the CPU 11 to function as the simulation unit 30 .

疑似コード３１において、“call fstr_Newton”により非線形解析が実行されると、非線形解析部３２内で、“call auto_threshold”により、機械学習部４０の推論部６０が呼び出される。推論部６０は、シミュレーションデータ２０４ｄをＮＮ２７０に入力し、推論結果７１を得る。得られた推論結果７１は、“call auto_threshold”の戻り値として非線形解析部３２に渡される。非線形解析部３２は、推論結果７１に基づいて閾値Ｔｈを更新し、“call solve_LINEQ”を実行し、更新された閾値Ｔｈで線形ソルバでの処理が行われる。 In the pseudo code 31, when the nonlinear analysis is executed by "call fstr_Newton", the reasoning section 60 of the machine learning section 40 is called by "call auto_threshold" in the nonlinear analysis section 32. FIG. The inference unit 60 inputs the simulation data 204 d to the NN 270 and obtains an inference result 71 . The obtained inference result 71 is passed to the nonlinear analysis unit 32 as the return value of "call auto_threshold". The nonlinear analysis unit 32 updates the threshold Th based on the inference result 71, executes "call solve_LINEQ", and performs processing with the linear solver with the updated threshold Th.

“call auto_threshold”による推論結果７１の取得は、線形ソルバの実行ごとに行わなくてもよい。予め定めた線形ソルバの実行回数ごとに、推論結果７１を取得し閾値Ｔｈを更新してもよい。 Acquisition of the inference result 71 by “call auto_threshold” does not have to be performed each time the linear solver is executed. The inference result 71 may be acquired and the threshold Th may be updated for each predetermined number of times the linear solver is executed.

上述したような構成において、発明者等は、シミュレーションデータ２０４ｄが１ＧＢのデータサイズの場合でオーバーヘッドを検証した。図５は、検証結果のグラフを示す図である。図５において、縦軸に反復ごとのオーバーヘッドを時間（秒）で示し、横軸に反復回数を示している。シミュレーションデータ２０４ｄの入出力によるオーバーヘッドが反復ごとに線形解析の処理時間を示している。 In the configuration as described above, the inventors verified the overhead when the simulation data 204d has a data size of 1 GB. FIG. 5 is a diagram showing a graph of verification results. In FIG. 5, the vertical axis indicates the overhead per iteration in time (seconds), and the horizontal axis indicates the number of iterations. The overhead due to the input and output of simulation data 204d indicates the processing time of the linear analysis for each iteration.

図５のグラフ３ａでは、シミュ―レーションにおける線形解析の反復が８６回の場合、シミュレーションデータ２０４ｄに対するファイルアクセスを伴う処理時間３ｂを示している。この検証結果から、ＮＮ２７０を用いて閾値Ｔｈを調整することで、シミュレーションを高速化できたことが分かる。 Graph 3a of FIG. 5 shows processing time 3b with file access to simulation data 204d for 86 iterations of linear analysis in the simulation. From this verification result, it can be seen that the speed of the simulation can be increased by adjusting the threshold value Th using the NN270.

ところで、前述したように、シミュレーション部３０を実現するシミュレーションプログラムと、機械学習部４０を実現する機械学習プログラムとは言語が異なっている。一例として、シミュレーションプログラムは、構造解析ソルバであれば、FrontISTR等の科学計算用の手続き型言語のプログラムである。機械学習プログラムについては、Python等のＮＮを構築するスクリプト言語と、Python等から利用可能なライブラリのKeras等では深層学習言語が使用されている。 By the way, as described above, the language of the simulation program that implements the simulation unit 30 and the machine learning program that implements the machine learning unit 40 are different. As an example, if the simulation program is a structural analysis solver, it is a procedural language program for scientific calculation such as FrontISTR. As for machine learning programs, a deep learning language is used in a script language such as Python for constructing a NN, and a library such as Keras that can be used from Python or the like.

そのため、シミュレーション部３０のプログラミング言語と、機械学習部４０のうち、ＮＮ２７０を利用するスクリプト言語とは、ＣＰＵ１１で実行され、ＮＮ２７０はライブラリとしてＧＰＵｍに記憶され、ＧＰＵ１５ｇによって実行される。 Therefore, the programming language of the simulation unit 30 and the script language using the NN 270 in the machine learning unit 40 are executed by the CPU 11, the NN 270 is stored in the GPUm as a library, and executed by the GPU 15g.

プログラミング言語とスクリプト言語とはＣＰＵ１１で実行されるがプログラム言語が異なるため、シミュレーションデータ２０４ｄと、推論結果７１とは、通常、ディスク１３に格納し、シミュレーション部３０と機械学習部４０との間のデータの送受信にはファイルアクセスが行われている。このファイルアクセスは、シミュレーション時間を消費する問題がある。ファイルアクセスとは、具体的には、図４において、推論部６０によるシミュレーションデータ２０４ｄの取得と、非線形解析部３２による推論結果７１の取得の際に行われる。 Although the programming language and the script language are executed by the CPU 11, the programming languages are different. File access is performed for data transmission/reception. This file access has the problem of consuming simulation time. More specifically, file access is performed when the inference unit 60 acquires the simulation data 204d and the nonlinear analysis unit 32 acquires the inference result 71 in FIG.

第１実施例では、このファイルアクセスによる処理時間の消費を改善するための仕組みについて図６及び図７で説明する。シミュレーションデータ２０４ｄ等のＮＮ２７０への入力データ及びＮＮからの戻り値である推論結果７１は、共有メモリを介して転送し、ディスク１３へのアクセスを回避する。また、閾値Ｔｈが更新されるまでシミュレーション部３０による共有メモリへのアクセスをブロックする。 In the first embodiment, a mechanism for improving consumption of processing time due to file access will be described with reference to FIGS. 6 and 7. FIG. Input data to the NN 270 such as the simulation data 204d and inference results 71, which are return values from the NN, are transferred via shared memory to avoid accessing the disk 13. FIG. Also, access to the shared memory by the simulation unit 30 is blocked until the threshold Th is updated.

図６は、第１実施例におけるプロセス間通信によるデータ送受信を説明するための図である。図６中、点線はデータの流れを示し、実線は処理の流れを示す。以下、情報処理装置１００は、UNIX系のOS（一例として、LINUX等）を実装している場合で説明するが、OSを限定するものではない。 FIG. 6 is a diagram for explaining data transmission/reception by inter-process communication in the first embodiment. In FIG. 6, dotted lines indicate the flow of data, and solid lines indicate the flow of processing. In the following, the information processing apparatus 100 will be described as having a UNIX-based OS (LINUX, for example), but the OS is not limited.

図６において、反復ループを含む高性能（High-Performance Computing、HPC）アプリケーション２３０は、処理の反復ごとにシミュレーションデータ２０４ｄが出力される。シミュレーションデータ２３４ｄは、ここでは、シミュレーション処理の高速化に関する値の調整を行うためにＮＮ２７０に入力するデータである。 In FIG. 6, a High-Performance Computing (HPC) application 230 including an iterative loop outputs simulation data 204d for each iteration of processing. The simulation data 234d here is data input to the NN 270 in order to adjust values relating to speeding up of the simulation processing.

高性能アプリケーション２３２は、高性能アプリケーション２３０の一部であり、データ転送機能を有する部分に相当する。高性能アプリケーション２３２は、シミュレーションデータ２０４ｄを共有メモリ１２ａに書き込み（send_data()）、名前付きパイプ１２ｂにデータ転送完了を設定する。具体的には、シミュレーションデータ２０４ｄの先頭アドレスを設定することによりデータ転送完了を示す。 The high performance application 232 is part of the high performance application 230 and corresponds to the part having the data transfer function. The high performance application 232 writes (send_data()) the simulation data 204d to the shared memory 12a and sets data transfer complete to the named pipe 12b. Specifically, the completion of data transfer is indicated by setting the start address of the simulation data 204d.

一方、機械学習メインプログラム２５０は、名前付きパイプ１２ｂから先頭アドレスを読み込み、共有メモリ１２ａにアクセスしてシミュレーションデータ２０４ｄを読み込んで、ＮＮ２７０へ入力する。ＮＮ２７０によるＡＩ推論が実行される。機械学習メインプログラム２５０は、ＮＮ２７０から推論結果７１を得ると、名前付きパイプ１２ｂに推論結果７１を設定する。 On the other hand, the machine learning main program 250 reads the top address from the named pipe 12b, accesses the shared memory 12a, reads the simulation data 204d, and inputs it to the NN 270. FIG. AI inference by NN 270 is performed. When the machine learning main program 250 obtains the inference result 71 from the NN 270, it sets the inference result 71 in the named pipe 12b.

高性能アプリケーション２３２は、名前付きパイプ１２ｂから推論結果７１を取得し、シミュレーションを続行する。即ち、取得した推論結果７１を用いて閾値Ｔｈを増減又は維持し（X<-get_AI_prediction()）、調整後の閾値Ｔｈ（変数X）を用いて、シミュレーションを継続する（continue_simulation(X)）。 The high performance application 232 obtains the inference result 71 from the named pipe 12b and continues the simulation. That is, the acquired inference result 71 is used to increase/decrease or maintain the threshold Th (X<-get_AI_prediction()), and the simulation is continued using the adjusted threshold Th (variable X) (continue_simulation(X)).

図６において、図２のシミュレーション部３０、非線形解析部３２及び線形解析部３４が、それぞれ、高性能アプリケーション２３０、高性能アプリケーション２３２、及びcontinue_simulation(X)に相当する。また、図２の機会学習部４０は、機械学習メインプログラム２５０により実現される。以下の説明において、同様の図において同様の対応付けとなる。 6, the simulation unit 30, the nonlinear analysis unit 32, and the linear analysis unit 34 in FIG. 2 respectively correspond to the high performance application 230, the high performance application 232, and continue_simulation(X). Also, the machine learning unit 40 in FIG. 2 is realized by the machine learning main program 250 . In the following description, similar correspondences are used in similar figures.

図７は、異種プログラム言語間におけるプロセス間通信の概要を説明するための図である。図７において、シミュレーション３８は、シミュレーション部３０によって実行される処理であり、仮想メモリアドレス３８ａｄを用いてＯＳ仮想メモリ１２ｖを介してメインメモリ１２にアクセスする。また、機械学習処理４８は、機械学習部４０によって実行される処理であり、仮想メモリアドレス４８ａｄを用いてＯＳ仮想メモリ１２ｖを介してメインメモリ１２にアクセスする。 FIG. 7 is a diagram for explaining an overview of interprocess communication between different types of programming languages. In FIG. 7, the simulation 38 is processing executed by the simulation unit 30, and accesses the main memory 12 via the OS virtual memory 12v using the virtual memory address 38ad. The machine learning process 48 is a process executed by the machine learning unit 40, and accesses the main memory 12 via the OS virtual memory 12v using the virtual memory address 48ad.

メインメモリ１２には、シミュレーション３８により仮想メモリアドレス３８ａｄを用いて、ディスク１３からＮＮ２７０の入力データとなるシミュレーションデータ２０４ｄが書き込まれる。また、機械学習処理４８により仮想メモリアドレス４８ａｄを用いてシミュレーションデータ２０４ｄがメインメモリ１２から読み込まれる。 Simulation data 204d, which serves as input data for the NN 270, is written from the disk 13 to the main memory 12 by the simulation 38 using the virtual memory address 38ad. Also, the machine learning process 48 reads the simulation data 204d from the main memory 12 using the virtual memory address 48ad.

名前付きパイプ１２ｂは、シミュレーションデータ２０４ｄの送受信用に用いる名前付きパイプ１２ｂ－１と、推論結果７１（即ち、戻り値）の出力用に名前付きパイプ１２ｂ－２とを有することが好ましい。 The named pipe 12b preferably has a named pipe 12b-1 used for transmitting and receiving the simulation data 204d and a named pipe 12b-2 for outputting the inference result 71 (ie return value).

図８は、シミュレーションでＡＩが利用される場合のプロセス間通信の概要を説明するための図である。図８において、メインメモリ１２は、ＣＭ領域１２ｍ－１と、共有メモリ１２ａと、名前付きパイプ領域１２ｐと、ＭＬ領域１２ｍ－２とを有する。 FIG. 8 is a diagram for explaining an overview of interprocess communication when AI is used in simulation. In FIG. 8, the main memory 12 has a CM area 12m-1, a shared memory 12a, a named pipe area 12p, and an ML area 12m-2.

ＣＭ領域１２ｍ－１には、シミュレーションプログラムの命令（以下、「シミュレーション命令」という）と処理にデータとが記憶され、ＭＬ領域１２ｍ－２には、機械学習プログラムの命令（以下、「スクリプト命令」という）とデータとが記憶される。また、共有メモリ１２ａには、ディスク１３からＤＭＡ（Direct Memory Access）によりデータ転送されたシミュレーションデータ２０４ｄが記憶される。名前付きパイプ領域１２ｐは、名前付きパイプ１２ｂ－１と名前付きパイプ１２ｂ－２として利用される領域である。 The CM area 12m-1 stores simulation program instructions (hereinafter referred to as "simulation instructions") and processing data, and the ML area 12m-2 stores machine learning program instructions (hereinafter referred to as "script instructions"). ) and data are stored. The shared memory 12a also stores simulation data 204d transferred from the disk 13 by DMA (Direct Memory Access). The named pipe area 12p is an area used as the named pipe 12b-1 and the named pipe 12b-2.

ＣＰＵ１１は、ＣＭ領域１２ｍ－１から順にシミュレーション命令を実行することでシミュレーションを行い、シミュレーションにより得られたシミュレーションデータ２０４ｄをディスク１３に記憶し、データ転送指示を行う。最新のシミュレーションデータ２０４ｄがＤＭＡにより共有メモリ１２ａへとデータ転送させる。一方、ＣＰＵ１１は、名前付きパイプ領域１２内において名前付きパイプ１２ｂ－１に先頭アドレスを書き込む。 The CPU 11 performs a simulation by sequentially executing simulation commands from the CM area 12m-1, stores the simulation data 204d obtained by the simulation in the disk 13, and issues a data transfer instruction. The latest simulation data 204d is transferred to the shared memory 12a by DMA. On the other hand, the CPU 11 writes the leading address to the named pipe 12b-1 in the named pipe area 12. FIG.

ＣＰＵ１１は、機械学習による閾値Ｔｈの調整の学習時及び推論時には、ＭＬ領域１２、－２から順にスクリプト命令を実行する。ＣＰＵ１１は、ＮＮ２７０に与えるシミュレーションデータ２０４ｄを共有メモリ１２ａから読み出して、入力データとしてＮＮ２７０に与える。ＮＮ２７０は、ＧＰＵ１４ｍを用いるＧＰＵ１４ｇにより実行される。 The CPU 11 executes script commands in order from the ML regions 12 and -2 when learning to adjust the threshold value Th by machine learning and when making an inference. The CPU 11 reads the simulation data 204d to be given to the NN 270 from the shared memory 12a and gives it to the NN 270 as input data. NN 270 is executed by GPU 14g using GPU 14m.

次に、実装例を図９及び図１０に示す。図９は、図６における実装例を示す図である。図９では、図６に対応付けて、図２の機能構成において実装され得るプログラム言語の例を示している。 Next, mounting examples are shown in FIGS. 9 and 10. FIG. FIG. 9 is a diagram showing an implementation example in FIG. FIG. 9 shows an example of a program language that can be implemented in the functional configuration of FIG. 2 in association with FIG.

高性能アプリケーション２３０及び２３２は、Fortran言語であるFrontISTRでプログラムされ、高性能アプリケーション２３０により繰り返し処理がｎ回行われた場合、シミュレーションデータ２０４ｄは、非線形解析の繰り返し（iteration 0 to n-1）ごとの線形解析の残差推移を示すログを含む。また、機械学習メインプログラム２５０は、スクリプト言語のPythonでプログラムされ、Keras等のＡＰＩ（Application Programming Interface）を介してGoogle社製のTensorflowなどによるＮＮ２７０を利用する。 The high-performance applications 230 and 232 are programmed in FrontISTR, which is a Fortran language. Contains a log showing the residual evolution of the linear analysis of . The machine learning main program 250 is programmed in the script language Python, and uses the NN 270 by Tensorflow or the like manufactured by Google via an API (Application Programming Interface) such as Keras.

図１０は、図７における実装例を示す図である。図１０では、シミュレーション３８は、ニュートン・ラフソン法に基づく処理（ニュートン・ラフソン処理）がFortran、C、又はC++等によりプログラムされ実装される。機械学習処理４８は、深層学習処理がPython等によりプログラムされ実装される。 FIG. 10 is a diagram showing an implementation example in FIG. In FIG. 10, the simulation 38 is implemented by programming processing based on the Newton-Raphson method (Newton-Raphson processing) in Fortran, C, C++, or the like. Machine learning processing 48 is implemented by programming deep learning processing in Python or the like.

次に、プロセス間通信による情報処理装置１００の処理について図１１及び図１２で説明する。図１１及び図１２において、名前付きパイプ１２ｂ－１を“sync”で指定し、名前付きパイプ１２ｂ－２を“return”で指定する。図１１は、シミュレーション処理を説明するためのフローチャート図である。 Next, processing of the information processing apparatus 100 through interprocess communication will be described with reference to FIGS. 11 and 12. FIG. 11 and 12, the named pipe 12b-1 is specified by "sync" and the named pipe 12b-2 is specified by "return". FIG. 11 is a flowchart for explaining simulation processing.

図１１より、シミュレーション部３０は、シミュレーションプロセスを開始すると、共有メモリ１２ａを設定し（ステップＳ３１１）、Pythonプロセスを起動する（fork）（ステップＳ３１２）。共有メモリ１２ａは、メモリマップトファイルによりＯＳ仮想メモリ１２ｖに展開される。そして、機械学習部４０による機械学習が開始する（図１２）。 11, when the simulation process is started, the simulation unit 30 sets the shared memory 12a (step S311), and starts (forks) the Python process (step S312). The shared memory 12a is expanded in the OS virtual memory 12v by a memory-mapped file. Then, machine learning by the machine learning unit 40 starts (FIG. 12).

そして、非線形解析部３２は、非線形解析部３２により非線形解析ループを開始し（ステップＳ３１３）、ＡＩ推論を行うか否かを判定する（ステップＳ３１４）。第１実施例において、ＡＩ推論とは、残差の閾値Ｔｈの増減を予測する機械学習処理に相当する。また、ＡＩ推論の要否判定の一例として、図４に例示したような“call auto_threshold”による機械学習処理の呼び出しのイベントがあったか否かを判定すればよい。 Then, the nonlinear analysis unit 32 starts a nonlinear analysis loop (step S313), and determines whether or not to perform AI inference (step S314). In the first embodiment, AI inference corresponds to machine learning processing for predicting an increase or decrease in the residual threshold Th. Also, as an example of determining the necessity of AI inference, it may be determined whether or not there is an event of calling machine learning processing by “call auto_threshold” as illustrated in FIG.

ＡＩ推論を行わない場合（ステップＳ３１４のＮＯ）、シミュレーション部３０は、線形ソルバを実行する（ステップＳ３１５）。線形解析部３４により線形解析が行われる。その後、シミュレーション部３０は、ステップＳ３１３へと戻り、非線形解析を繰り返す。一方、ＡＩ推論を行う場合（ステップＳ３１４のＹＥＳ）、非線形解析部３２は、名前付きパイプ“sync”に接続して、ロック解除を待つ（ステップＳ３１７）。 When AI inference is not performed (NO in step S314), the simulation unit 30 executes a linear solver (step S315). A linear analysis is performed by the linear analysis unit 34 . After that, the simulation unit 30 returns to step S313 and repeats the nonlinear analysis. On the other hand, if AI inference is to be performed (YES in step S314), the nonlinear analysis unit 32 connects to the named pipe "sync" and waits for unlock (step S317).

非線形解析部３２は、ロック解除を検出すると、シミュレーションデータ２０４ｄを共有メモリ１２ａにコピーする（書き込む）（ステップＳ３１８）。共有メモリ１２ａにシミュレーションデータ２０４ｄがＤＭＡデータ転送により書き込まれる。一方で、非線形解析部３２は、機械学習部４０を呼び出して調整要求を行う。 Upon detecting unlocking, the nonlinear analysis unit 32 copies (writes) the simulation data 204d to the shared memory 12a (step S318). The simulation data 204d is written to the shared memory 12a by DMA data transfer. On the other hand, the nonlinear analysis unit 32 calls the machine learning unit 40 and makes an adjustment request.

その後、非線形解析部３２は、名前付きパイプ“return”から推論結果７１を読み込んで取得し（ステップＳ３２１）、得られた推論結果７１を用いて更新した閾値Ｔｈで線形ソルバを実行する（ステップＳ３２２）。線形ソルバの処理が終了すると、非線形解析部３２は、ステップＳ３１３へと戻り、上述した同様の処理を繰り返す。 After that, the nonlinear analysis unit 32 reads and acquires the inference result 71 from the named pipe “return” (step S321), and uses the obtained inference result 71 to execute the linear solver with the updated threshold value Th (step S322). ). When the linear solver process ends, the nonlinear analysis unit 32 returns to step S313 and repeats the same process as described above.

図１２は、シミュレーションとの連携において動作する機械学習処理を説明するためのフローチャート図である。図１２において、機械学習部４０は、Pythonでプログラムされているとする。シミュレーションプロセスからのPythonプロセスの開始に応じて、機械学習部４０による機械学習処理が開始される（ステップＳ４１０）。また、以下の説明では推論部６０による推論処理が行われる場合で説明する。学習部５０の学習処理については、第２実施例で詳述する。 FIG. 12 is a flowchart for explaining machine learning processing that operates in cooperation with simulation. In FIG. 12, it is assumed that the machine learning unit 40 is programmed in Python. In response to the start of the Python process from the simulation process, machine learning processing by the machine learning unit 40 is started (step S410). Also, in the following description, the case where the inference processing by the inference unit 60 is performed will be described. The learning process of the learning section 50 will be described in detail in the second embodiment.

機械学習部４０は、共有メモリ１２ａを設定し（ステップＳ４１１）、推論部６０は、訓練済みモデルをロードする（ステップＳ４１２）。共有メモリ１２ａは、シミュレーションプロセスと同じメモリマップトファイルにより設定される。練済みモデルとは、学習部５０により訓練済みのＮＮ２７０に相当する。 The machine learning unit 40 sets the shared memory 12a (step S411), and the inference unit 60 loads the trained model (step S412). The shared memory 12a is configured with the same memory mapped file as the simulation process. A trained model corresponds to the NN 270 trained by the learning unit 50 .

そして、推論部６０は、無限ループを開始する（ステップＳ４１３）。無限ループの開始により、名前付きパイプ“sync”に接続する（ステップＳ４１４）。名前付きパイプ“sync”への接続により、非線形解析部３２へロック解除が通知される。推論部６０は、新たな調整要求があるか否かを判断する（ステップＳ４１５）。 Then, the inference unit 60 starts an infinite loop (step S413). The start of an infinite loop connects to the named pipe "sync" (step S414). Unlocking is notified to the nonlinear analysis unit 32 by connecting to the named pipe "sync". The inference unit 60 determines whether or not there is a new adjustment request (step S415).

推論部６０は、ＮＮ２７０にするためにシミュレーションデータ２０４ｄから入力データを構築し（ステップＳ１７）、入力データをＮＮ２７０へと入力して閾値Ｔｈの調整方向を推論する（ステップＳ４１８）。 The inference unit 60 constructs input data from the simulation data 204d for the NN 270 (step S17), inputs the input data to the NN 270, and infers the adjustment direction of the threshold Th (step S418).

次に、プロセス間通信を例を用いて説明する。図１３及び図１４は、シミュレーションのメインプロセスにおけるプロセス間通信に係る処理の一例を説明するための図である。 Inter-process communication will now be described using an example. 13 and 14 are diagrams for explaining an example of processing related to inter-process communication in the main process of simulation.

図１３より、シミュレーション部３０は、シミュレーションの開始時に共有メモリ１２ａを設定する（ステップＳ３５１）。そして、シミュレーション部３０は、Pythonプロセスを起動（fork）する（ステップＳ３５２）。Python子プロセスが起動し、Python子プロセスと共有メモリ１２ａを共有するために、共有メモリアドレスとタイミングとが通知される。タイミングは、調整要求が行われるタイミングを指定する。タイミングが２を示す場合、非線形解析を２回行うごとに調整要求がなされることを示す。 From FIG. 13, the simulation unit 30 sets the shared memory 12a at the start of the simulation (step S351). Then, the simulation unit 30 starts (forks) the Python process (step S352). A Python child process is activated and notified of the shared memory address and timing for sharing the shared memory 12a with the Python child process. Timing specifies when the reconciliation request is made. When the timing indicates 2, it indicates that an adjustment request is made every two nonlinear analyses.

シミュレーション部３０は、線形解析用の閾値lin_thを初期化する（ステップＳ３５３）。閾値lin_thは、図２の閾値Ｔｈに相当する変数である。閾値lin_thの初期設定値は、シミュレーション部３０のプログラムに組み込まれていても良いし、シミュレーション開始時に、ユーザにより設定されてもよい。 The simulation unit 30 initializes the threshold lin_th for linear analysis (step S353). The threshold lin_th is a variable corresponding to the threshold Th in FIG. The initial set value of the threshold lin_th may be incorporated in the program of the simulation unit 30, or may be set by the user at the start of the simulation.

閾値lin_thが初期設定されると、シミュレーションループが開始される（ステップＳ３５４）。即ち、非線形解析部３２による非線形解析処理が、定めた回数（#nonlin_iter）分繰り返されるように、繰り返し回数を示すイテレーションの値に#nonlin_iterが設定される。#nonlin_iterの初期値は０であるため、初期設定時にはイテレーションの値に０が設定される。そして、非線形解析部３２は、前処理を実行する（ステップＳ３５５）。具体的な前処理の内容は、ステップＳ３５６からＳ３６１で説明する。 When the threshold lin_th is initialized, a simulation loop is started (step S354). That is, #nonlin_iter is set to the iteration value indicating the number of repetitions so that the nonlinear analysis processing by the nonlinear analysis unit 32 is repeated a predetermined number of times (#nonlin_iter). Since the initial value of #nonlin_iter is 0, the iteration value is set to 0 at the time of initialization. Then, the nonlinear analysis unit 32 executes preprocessing (step S355). Specific contents of the preprocessing will be described in steps S356 to S361.

イテレーションの値が０であるか否かが判断される（ステップＳ３５６）。同時に、機械学習部４０へ閾値Ｔｈの調整要求が送信される。イテレーションの値が０である場合（ステップＳ３５６のＹＥＳ）、非線形解析部３２は、ステップＳ３６２へと進む。 It is determined whether or not the iteration value is 0 (step S356). At the same time, a request for adjusting the threshold value Th is sent to the machine learning unit 40 . If the iteration value is 0 (YES in step S356), the nonlinear analysis unit 32 proceeds to step S362.

一方、イテレーションの値が０でない場合（ステップＳ３５６のＮＯ）、非線形解析部３２は、ライトモードで名前付きパイプ“sync”をオープンする（ステップＳ３５７）。非線形解析部３２は、機械学習部４０からのロック解除を待って、共有メモリ１２ａへの書き込みを開始する。 On the other hand, if the iteration value is not 0 (NO in step S356), the nonlinear analysis unit 32 opens the named pipe "sync" in write mode (step S357). The nonlinear analysis unit 32 waits for unlocking from the machine learning unit 40 and starts writing to the shared memory 12a.

ロック解除の検出に応じて、非線形解析部３２は、共有メモリ１２ａにシミュレーションデータ２０４ｄを書き込む（ステップＳ３５８）。ディスク１３に蓄積されたシミュレーションデータ２０４ｄが共有メモリ１２ａにＤＭＡデータ転送によりコピーされる。 Upon detection of unlocking, the nonlinear analysis unit 32 writes the simulation data 204d to the shared memory 12a (step S358). The simulation data 204d stored on the disk 13 is copied to the shared memory 12a by DMA data transfer.

次に、非線形解析部３２は、以下のＷ１～Ｗ４の手順で共有メモリ１２ａへの書き込みを行う。以下の説明において、共有メモリ１２ａが２４（＝４×６）セルの例で説明するが、このメモリサイズに限定するものではない。 Next, the nonlinear analysis unit 32 writes to the shared memory 12a in the following procedures W1 to W4. In the following description, an example in which the shared memory 12a has 24 (=4×6) cells will be described, but the memory size is not limited to this.

また、addresss_1は#nonlin_iter（非線形解析の繰り返し回数）が記憶されるセルのアドレスを示し、addresss_2は#lin_iter（線形解析の反復回数）が記憶されるセルのアドレスを示し、addresss_3は非線形解析データの先頭アドレスを示す。非線形解析データは、非線形解析を繰り返すごとに蓄積されるため、セル数が累積される。そのため、線形解析データの開始アドレスは、addresss_3に#nonlin_iterを加算した値で示される。 Also, addresses_1 indicates the address of the cell where #nonlin_iter (number of iterations of nonlinear analysis) is stored, addresses_2 indicates the address of the cell where #lin_iter (number of iterations of linear analysis) is stored, and addresses_3 indicates the address of the nonlinear analysis data. Indicates the starting address. Since the nonlinear analysis data is accumulated each time the nonlinear analysis is repeated, the number of cells is accumulated. Therefore, the start address of the linear analysis data is indicated by adding #nonlin_iter to addresses_3.

Ｗ１：addresss_3から非線形データを書き込む。
Ｗ２：addresss_3に#nonlin_iterを加算したアドレスから線形データを書き込む。
Ｗ３：addresss_1に#nonlin_iterを記憶する。#nonlin_iterが更新される。
Ｗ４：addresss_2に#lin_iterを記憶する。#lin_iterが更新される。
非線形データ及び線形データを書き込み後に、カウンタとなる#nonlin_iter及び#lin_iterを更新する。Ｗ１～Ｗ４の終了で、新たなデータの書き込みが完了したこととなる。 W1: Write non-linear data from addresses_3.
W2: Write linear data from an address obtained by adding #nonlin_iter to addresses_3.
W3: Store #nonlin_iter in addresses_1. #nonlin_iter is updated.
W4: Store #lin_iter in addresses_2. #lin_iter is updated.
After writing the nonlinear data and linear data, #nonlin_iter and #lin_iter, which are counters, are updated. At the end of W1 to W4, writing of new data is completed.

図１４より、非線形解析部３２は、リードモードで名前付きパイプ“return”をオープンする（ステップＳ３５９）。機械学習部４０によるロック解除に応じて、非線形解析部３２は、名前付きパイプ“return”から値を読み出して今回の推論結果７１を取得する（ステップＳ３６０）。 14, the nonlinear analysis unit 32 opens the named pipe "return" in read mode (step S359). In response to the unlocking by the machine learning unit 40, the nonlinear analysis unit 32 reads the value from the named pipe "return" to obtain the current inference result 71 (step S360).

そして、非線形解析部３２は、得られた推論結果と前回の閾値lin_thとから、今回の閾値lin_thを取得する（ステップＳ３６１）。一例として、閾値lin_thを推論結果に基づいて更新する処理部（algorithm）を備えるようにすればよい。 Then, the nonlinear analysis unit 32 acquires the current threshold lin_th from the obtained inference result and the previous threshold lin_th (step S361). As an example, a processing unit (algorithm) that updates the threshold lin_th based on the inference result may be provided.

更新により今回の閾値lin_thを取得すると、ステップＳ３５６の条件節を抜けて、非線形解析部３２は、線形解析部３４に線形解析を行わせ、線形解析結果を得る（ステップＳ３６２）。線形解析部３４に対して今回の閾値lin_thが通知される。そして、繰り返し回数iterationを１インクリメントする（ステップＳ３６４）。 When the current threshold lin_th is obtained by updating, the nonlinear analysis unit 32 causes the linear analysis unit 34 to perform linear analysis after exiting the conditional clause of step S356, and obtains a linear analysis result (step S362). The current threshold lin_th is notified to the linear analysis unit 34 . Then, the number of iterations iteration is incremented by 1 (step S364).

非線形解析部３２は、非線形解析が収束条件に達したか否かを判断する（ステップＳ３６４）。非線形解析が収束条件に達していない場合（ステップＳ３６４のＮＯ）、非線形解析部３２は、図１３のステップＳ３５６へと戻り、上述した同様の処理を繰り返す。一方、非線形解析が収束条件に達した場合（ステップＳ３６４のＹＥＳ）、非線形解析部３２は、シミュレーションループを終了する。 The nonlinear analysis unit 32 determines whether the nonlinear analysis has reached a convergence condition (step S364). If the nonlinear analysis does not reach the convergence condition (NO in step S364), the nonlinear analysis unit 32 returns to step S356 in FIG. 13 and repeats the same processing as described above. On the other hand, if the nonlinear analysis reaches the convergence condition (YES in step S364), the nonlinear analysis unit 32 terminates the simulation loop.

シミュレーションループが終了すると、シミュレーション部３０は、ライトモードで名前付きパイプ“sync”をオープンする（ステップＳ３６５）。そして、このメインプロセスが終了する。 When the simulation loop ends, the simulation unit 30 opens the named pipe "sync" in write mode (step S365). Then this main process ends.

図１５は、Python子プロセスにおけるプロセス間通信に係る処理の一例を説明するための図である。図１５において、シミュレーション部３０からの共有メモリアドレスとタイミングとの通知に応じて、Python子プロセスが起動し、Python子プロセス内で、機械学習部４０による閾値Ｔｈの調整に係る機械学習が行われる。 FIG. 15 is a diagram for explaining an example of processing related to inter-process communication in a Python child process; In FIG. 15, a Python child process is activated in response to notification of a shared memory address and timing from the simulation unit 30, and machine learning related to adjustment of the threshold value Th is performed by the machine learning unit 40 within the Python child process. .

機械学習部４０は、シミュレーション部３０から通知された共有メモリアドレスに基づいて、共有メモリ１２ａを設定し（ステップＳ４７１）、訓練済みモデルをロードする（ステップＳ４７２）。そして、機械学習部４０による無限ループが開始される（ステップＳ４７３）。各無限ループにおいて行われる処理内容は、ステップＳ４７４～Ｓ４８３で説明する。 The machine learning unit 40 sets the shared memory 12a based on the shared memory address notified from the simulation unit 30 (step S471), and loads the trained model (step S472). Then, an infinite loop is started by the machine learning unit 40 (step S473). The contents of processing performed in each infinite loop will be explained in steps S474 to S483.

機械学習部４０は、リードモードで名前付きパイプ“sync”をオープンする（ステップＳ４７４）。シミュレーションメインプロセス（シミュレーション部３０）にロック解除が通知される。機械学習部４０は、新たな調整要求の有無判定ループを行う（ステップＳ４７５）。 The machine learning unit 40 opens the named pipe "sync" in read mode (step S474). The unlocking is notified to the simulation main process (simulation unit 30). The machine learning unit 40 executes a new adjustment request presence/absence determination loop (step S475).

即ち、機械学習部４０は、解析回数を示す#nonlin_iterと#lin_iterとを共有メモリ１２ａから読み出して（ステップＳ４７６）、データが書き込まれているか否かを判定する（ステップＳ４７７）。具体的には、以下のような処理を行う。 That is, the machine learning unit 40 reads #nonlin_iter and #lin_iter indicating the number of times of analysis from the shared memory 12a (step S476), and determines whether or not data is written (step S477). Specifically, the following processing is performed.

#nonlin_iterが前回の非線形解析回数にタイミングを加算した値に一致しない（条件Ａ）、又は、#lin_iterが前回の線形解析回数と一致する場合（条件Ｂ）、カウントを１インクリメントする。条件Ａと条件Ｂの両方が成立する場合、又は、カウントの更新後、現在のカウントが設定値（例えば、“1000”）以上の場合、機械学習部４０は、新たな調整要求はないと判断し、Pythonプログラムを終了する。 If #nonlin_iter does not match the previous nonlinear analysis count plus timing (condition A), or if #lin_iter matches the previous linear analysis count (condition B), the count is incremented by one. If both condition A and condition B are satisfied, or if the current count is equal to or greater than a set value (eg, "1000") after updating the count, the machine learning unit 40 determines that there is no new adjustment request. and exit the Python program.

一方、条件Ａ及び条件Ｂが成立しない場合、即ち、データの書き込みが確認できた場合、機械学習部４０は、新たな調整要求があると判断し、この新たな調整要求の有無判定ループから抜けて（ステップＳ４７８）、共有メモリ１２ａからデータを読み込む（ステップＳ４７９）。 On the other hand, if condition A and condition B are not satisfied, that is, if data writing can be confirmed, the machine learning unit 40 determines that there is a new adjustment request, and exits from this new adjustment request presence/absence determination loop. (step S478), the data is read from the shared memory 12a (step S479).

推論部４０は、推論に用いる入力データを構築し（ステップＳ４８０）、ＮＮ２７０を用いて閾値Ｔｈの調整を推論する（ステップＳ４８１）。ＮＮ２７０は、Kerase等であり、ＧＰＵ１４ｇで動作する。推論部４０から推論結果７１が出力される。 The inference unit 40 constructs input data used for inference (step S480), and uses the NN 270 to infer adjustment of the threshold Th (step S481). The NN 270 is Kerase or the like and operates on the GPU 14g. An inference result 71 is output from the inference unit 40 .

機械学習部４０は、ライトモードで名前付きパイプ“return”をオープンし（ステップＳ４８２）、“return”モードで推論結果を名前付きパイプ“return”に書き込む。その後、無限ループを終了させて、機械学習部４０による処理を終了する。Python子プロセスが終了する。 The machine learning unit 40 opens the named pipe "return" in write mode (step S482), and writes the inference result to the named pipe "return" in "return" mode. After that, the infinite loop is terminated and the processing by the machine learning unit 40 is terminated. Python child process exits.

図１６は、プロセス間通信における状態例を示す図である。調整要求が行われるタイミングは２であるとする。図１６（Ａ）では、シミュレーションが４回繰り返された直後の共有メモリ１２ａの状態例を示している。共有メモリ１２ａ内には、#nonlin_iterは４回を示し、非線形解析データは４個のセルに書き込まれている。また、#lin_iterは１２回を示し、線形解析データは１２個のセルに書き込まれている。 FIG. 16 is a diagram showing a state example in interprocess communication. Assume that the timing at which an adjustment request is made is 2. FIG. 16A shows an example of the state of the shared memory 12a immediately after the simulation has been repeated four times. In the shared memory 12a, #nonlin_iter indicates 4 times, and nonlinear analysis data is written in 4 cells. #lin_iter indicates 12 times, and linear analysis data is written in 12 cells.

機械学習部４０の状態は、現在の#nonlin_iterは２回であると認識し、#lin_iterは６回であると認識しているものとする。タイミングは２であるとする。この場合、現在の#nonlin_iter「２回」とタイミング「２」とを加算して得た値「４」が、共有メモリ１２ａの#nonlin_iter「４回」と一致する。また、現在の#lin_iter「６回」と#lin_iter「１２回」とは一致しない。この場合に新たな調整要求があったと判定する。 It is assumed that the machine learning unit 40 recognizes that the current #nonlin_iter is 2 times and #lin_iter is 6 times. Assume that the timing is 2. In this case, the value "4" obtained by adding the current #nonlin_iter "2 times" and the timing "2" matches #nonlin_iter "4 times" in the shared memory 12a. Also, the current #lin_iter "6 times" and #lin_iter "12 times" do not match. In this case, it is determined that there is a new adjustment request.

図１６（Ｂ）では、シミュレーションによりデータ書き込まれたが、カウンタ更新されていない状態例を示している。共有メモリ１２ａ内には、#nonlin_iterは４回を示し、非線形解析データは６個のセルに記憶されている。また、#lin_iterは１２回を示し、線形解析データは１６個のセルに記憶されている。 FIG. 16B shows an example of a state in which data has been written by simulation but the counter has not been updated. In the shared memory 12a, #nonlin_iter indicates 4 times, and nonlinear analysis data are stored in 6 cells. #lin_iter indicates 12 times, and linear analysis data are stored in 16 cells.

非線形解析を２回行って、それぞれの結果が各セルに書き込まれ、合計６個のセルが利用されている状態である。図１６（Ａ）で更新された閾値Ｔｈを用いて、線形解析が２回実行され、実行ごとに２個のセルに線形解析データが書き込まれるため、前回までのデータを含めて全体で１６個のセルが利用された状態である。#lin_iterは未更新の状態である。 The nonlinear analysis is performed twice and each result is written to each cell, so that a total of 6 cells are used. Using the threshold value Th updated in FIG. 16A, linear analysis is executed twice, and linear analysis data is written to two cells each time. cells are in use. #lin_iter is in an unupdated state.

機械学習部４０の状態は、現在の#nonlin_iterは４回であると認識し、#lin_iterは１２回であると認識している。この場合、現在の#nonlin_iter「４回」とタイミング「２」とを加算して得た値「６」が、共有メモリ１２ａの#nonlin_iter「４回」と一致しない。一方、現在の#lin_iter「１２回」と#lin_iter「１２回」とは一致する。この場合には新たな調整要求はないと判定する。 The machine learning unit 40 recognizes that the current #nonlin_iter is 4 times and #lin_iter is 12 times. In this case, the value "6" obtained by adding the current #nonlin_iter "4 times" and the timing "2" does not match the #nonlin_iter "4 times" of the shared memory 12a. On the other hand, the current #lin_iter "12 times" matches #lin_iter "12 times". In this case, it is determined that there is no new adjustment request.

図１６（Ｃ）では、データ書き込み完了後、シミュレーションがカウンタを共有メモリに記憶した状態例を示している。共有メモリ１２ａ内には、#nonlin_iterは６回に更新されている。また、#lin_iterは１６回に更新されいる。 FIG. 16C shows an example of a state in which the simulation has stored the counter in the shared memory after data writing is completed. #nonlin_iter is updated 6 times in the shared memory 12a. Also, #lin_iter is updated to 16 times.

機械学習部４０の状態は、現在の#nonlin_iterは４回であると認識し、#lin_iterは１２回であると認識している。この場合、現在の#nonlin_iter「４回」とタイミング「２」とを加算して得た値「６」が、共有メモリ１２ａの#nonlin_iter「６回」と一致する。また、現在の#lin_iter「１２回」と#lin_iter「１２回」とは一致する。この場合には新たな調整要求があったと判定する。 The machine learning unit 40 recognizes that the current #nonlin_iter is 4 times and #lin_iter is 12 times. In this case, the value "6" obtained by adding the current #nonlin_iter "4 times" and the timing "2" matches #nonlin_iter "6 times" in the shared memory 12a. Also, the current #lin_iter "12 times" and #lin_iter "12 times" match. In this case, it is determined that there is a new adjustment request.

図１７は、オーバーヘッドの検証結果例を示す図である。図１７において、横軸にシミュレーション（即ち、非線形解析）の繰り返し回数を示し、縦軸に繰り返し当たりのオーバーヘッドの時間を示している。 FIG. 17 is a diagram illustrating an example of overhead verification results. In FIG. 17, the horizontal axis indicates the number of iterations of the simulation (that is, the nonlinear analysis), and the vertical axis indicates the overhead time per iteration.

動作環境は、データサイズは１ＧＢの場合に、オーバーヘッド１７ａは、第１実施例による共有メモリ１２ａを用いて、シミュレーション部３０と機械学習部４０との間で共有するデータをメモリマップトファイルとしたことによるオーバーヘッドを示し、オーバーヘッド１７ｂは、ディスク１３への入出力により掛かったオーバーヘッドを示している。 As for the operating environment, when the data size is 1 GB, the overhead 17a is caused by using the shared memory 12a according to the first embodiment and making the data shared between the simulation unit 30 and the machine learning unit 40 a memory-mapped file. Overhead 17b indicates overhead due to input/output to disk 13. FIG.

オーバーヘッド１７ｂの合計と、第１実施例におけるオーバーヘッド１７ａから、第１実施例により、オーバーヘッドを削減できたことが分かる。 From the total overhead 17b and the overhead 17a in the first embodiment, it can be seen that the overhead can be reduced by the first embodiment.

更に、データサイズの違いによりオーバーヘッドを検証した結果を図１８及び図１９で示す。 Furthermore, FIG. 18 and FIG. 19 show the results of verifying the overhead due to the difference in data size.

図１８は、ディスクへの入出力によるデータサイズごとの検証結果を示す図である。図１８では、データサイズごとの、オーバーヘッドとシミュレーションのそれぞれの時間と、オーバーヘッドが全時間に占める割合とが示されている。 FIG. 18 is a diagram showing verification results for each data size by input/output to/from a disk. FIG. 18 shows the respective times of overhead and simulation for each data size, and the ratio of the overhead to the total time.

図１９は、ディスクへの入出力によるデータサイズごとの検証結果を示す図である。図１８では、データサイズごとの、オーバーヘッドとシミュレーションのそれぞれの時間と、オーバーヘッドが全時間に占める割合とが示されている。 FIG. 19 is a diagram showing verification results for each data size in input/output to/from a disk. FIG. 18 shows the respective times of overhead and simulation for each data size, and the ratio of the overhead to the total time.

上述したように、機械学習を用いたシミュレーションにおいて、第１実施例による処理時間を明らかに短縮することができる。 As described above, in simulations using machine learning, the processing time according to the first embodiment can be clearly shortened.

以下に、第２実施例において、機械学習による閾値Ｔｈの調整精度を向上させる仕組みについて詳述する。 A mechanism for improving the adjustment accuracy of the threshold value Th by machine learning in the second embodiment will be described in detail below.

［第２実施例］
第２実施例では、線形解析を反復的に解くことで解を求める非線形解析による、構造解析、流体解析等のシミュレーションにおいて、シミュレーションの実行状況に応じて、線形解析における収束判定に用いる閾値Ｔｈを、機械学習により動的に調整する。 [Second embodiment]
In the second embodiment, in the simulation of structural analysis, fluid analysis, etc. by nonlinear analysis that finds a solution by repeatedly solving linear analysis, the threshold value Th used for determining convergence in the linear analysis is set according to the execution status of the simulation. , dynamically adjusted by machine learning.

図２０は、非線形解析を説明するための図である。線形解析を行う線形ソルバでは反復法にＣＧ法を用いて解を求めることが多い。図２０より、反復用では、収束条件を満たすまで解ベクトルｘを繰り返し更新していく。収束条件の一つに残差の閾値Ｔｈが用いられる。残差は、残差ベクトルｒのノルムで表され、残差ベクトルｒは、ｒ_０＝ｂ－Ａｘで初期化される。この残差が閾値Ｔｈ以下になった時点で解ベクトルｘが収束したとみなされ反復を終了する。 FIG. 20 is a diagram for explaining nonlinear analysis. A linear solver that performs linear analysis often finds a solution using a CG method as an iterative method. From FIG. 20, for iteration, the solution vector x is repeatedly updated until the convergence condition is satisfied. A residual threshold Th is used as one of the convergence conditions. The residual is expressed by the norm of the residual vector r, which is initialized with r ₀ =b−Ax. When the residual becomes equal to or less than the threshold Th, it is considered that the solution vector x has converged, and the iteration ends.

閾値Ｔｈは、解の精度と実行時間に影響を与える。閾値を小さくするほど解の精度が高くなるが、反復回数が増加して実行時間が増加するため、求める精度の解をできるだけ少ない反復回数で得られるような閾値を設定することが好ましい。しかしながら、閾値Ｔｈは、経験則又はヒューリスティックに基づいたユーザの判断によって設定されている。したがって、最適な残差の閾値Ｔｈを機械学習により高速に推定し、非線形解析全体（即ち、シミュレーション全体）を高速化する手法について説明する。 The threshold Th affects solution accuracy and execution time. The smaller the threshold value, the higher the accuracy of the solution, but the number of iterations increases and the execution time increases. Therefore, it is preferable to set the threshold value so that the solution with the required accuracy can be obtained with as few iterations as possible. However, the threshold Th is set by the user's judgment based on empirical rules or heuristics. Therefore, a method of estimating the optimum residual threshold Th at high speed by machine learning and speeding up the entire nonlinear analysis (that is, the entire simulation) will be described.

図２１は、第２実施例における情報処理装置の第１の機能構成例において学習部の概要を説明するための図である。図２１において、情報処理装置１００は、主に、シミュレーション部３０と、機械学習部４０とを有する。メインメモリ１２には、問題データ２、ログデータ４ｃ、候補閾値３、基準閾値３ｒｅｆ、学習データ６ｇ等を記憶する。メインメモリ１２の一部が共有メモリ１２ａとして利用されることは、第１実施例で説明した通りである。 FIG. 21 is a diagram for explaining an outline of a learning unit in the first functional configuration example of the information processing apparatus according to the second embodiment. In FIG. 21, information processing apparatus 100 mainly includes simulation section 30 and machine learning section 40 . The main memory 12 stores question data 2, log data 4c, candidate threshold 3, reference threshold 3ref, learning data 6g, and the like. As described in the first embodiment, part of the main memory 12 is used as the shared memory 12a.

シミュレーション部３０は、図２で説明したように、非線形解析部３２と、線形解析部３４とを有し、線形解析の反復ごとの残差と時刻とを取得し、ログデータ４ｃを出力する。ログデータ４ｃには、閾値Ｔｈ内に残差が収束するまで反復して実行された線形解析時の残差と時間とが非線形解析の所定の繰り返し回数分記録されている。 The simulation unit 30 has the nonlinear analysis unit 32 and the linear analysis unit 34, as described with reference to FIG. 2, acquires the residual error and time for each iteration of the linear analysis, and outputs the log data 4c. In the log data 4c, the residuals and times of the linear analysis repeatedly executed until the residuals converge within the threshold value Th are recorded for a predetermined number of iterations of the nonlinear analysis.

また、シミュレーション部３０は、学習部５０から呼び出された場合には、学習部５０から与えられた閾値Ｔｈ固定でシミュレーションを行う。即ち、機械学習を用いないシミュレーションが行われる。具体的には、図４の疑似コード３１内の“call auto_threshold”を無効にしてシミュレーションが行われる。 Further, when called by the learning unit 50 , the simulation unit 30 performs simulation with the fixed threshold value Th given by the learning unit 50 . That is, simulation is performed without using machine learning. Specifically, the simulation is performed with "call auto_threshold" in the pseudo code 31 of FIG. 4 disabled.

機械学習部４０は、図４に示すようにＮＮ２７０により閾値Ｔｈの調整を学習する学習部５０と、シミュレーション中の閾値Ｔｈの調整を行う推論部６０とを有する。推論部６０については図２２で説明する。 The machine learning unit 40 has a learning unit 50 that learns adjustment of the threshold Th by the NN 270 as shown in FIG. 4, and an inference unit 60 that adjusts the threshold Th during simulation. The reasoning unit 60 will be described with reference to FIG.

学習部５０は、シミュレーション部３０が、問題データ２ごとに、候補閾値３の複数の候補閾値３のそれぞれを与えて得られたログデータ４ｃを用いて、閾値Ｔｈを調整するＮＮ２７０を学習する。複数の候補閾値３のそれぞれをシミュレーション部３０に与えてシミュレーションを行わせてログデータ４ｃ取得しておく。又は、学習部５０が、複数の候補閾値３のそれぞれに対してシミュレーションを行わせる処理部を有するようにしてもよい。 The learning unit 50 learns the NN 270 that adjusts the threshold Th using the log data 4c obtained by the simulation unit 30 giving each of the plurality of candidate thresholds 3 of the candidate threshold 3 for each problem data 2 . Each of the plurality of candidate threshold values 3 is given to the simulation unit 30 to perform simulation, and log data 4c is obtained. Alternatively, the learning unit 50 may have a processing unit that causes each of the plurality of candidate threshold values 3 to be simulated.

学習部５０は、選択した候補閾値３をシミュレーション部３０に与えることで、シミュレーション部３０に、問題データ２に対するシミュレーションを行わせる。シミュレーション部３０は、線形解析の反復ごとに掛かった時間と残差ｒとを表すログデータ４ｃを出力する。候補閾値３ごとに得られたログデータ４ｃを得られる。ログデータ４ｃの入出力には、第１実施例で説明したような共有メモリ１２ａを用いた同様のプロセス間通信で行ってもよい。 The learning unit 50 supplies the selected candidate threshold value 3 to the simulation unit 30 to cause the simulation unit 30 to perform a simulation for the question data 2 . The simulation unit 30 outputs log data 4c representing the time taken for each iteration of the linear analysis and the residual r. Log data 4c obtained for each candidate threshold 3 can be obtained. The input/output of the log data 4c may be performed by the same inter-process communication using the shared memory 12a as described in the first embodiment.

学習部５０は、候補閾値３ごとに得られたログデータ４ｃを参照して、反復ごとの残差推移、演算時間などを用いて、候補閾値３の中から基準閾値３ｒｅｆを決定し、メインメモリ１２に記憶する。 The learning unit 50 refers to the log data 4c obtained for each candidate threshold value 3, determines the reference threshold value 3ref from among the candidate threshold values 3, and saves the reference threshold value 3ref in the main memory by using the transition of the residual error for each iteration, the calculation time, and the like. 12.

学習部５０は、ログデータ４ｃごとに、基準閾値３ｒｅｆに基づいてラベルを付与し、学習データ６ｇを作成する。ラベル付けは、ログデータ４ｃの候補閾値３と基準閾値３ｒｅｆとの比較結果に基づいて行われる。 The learning unit 50 creates learning data 6g by assigning a label to each piece of log data 4c based on the reference threshold value 3ref. Labeling is performed based on the result of comparison between the candidate threshold 3 of the log data 4c and the reference threshold 3ref.

ラベル付けの一例として、基準閾値３ｒｅｆより小さい候補閾値３のログデータ４ｃにはラベル「１」を付与し、基準閾値３ｒｅｆより大きい候補閾値３のログデータ４ｃにはラベル「３」を付与する。学習部５０は、基準閾値３ｒｅｆと一致する候補閾値３を使用したログデータ４ｃには、ラベル「２」を付与する。 As an example of labeling, the log data 4c with the candidate threshold 3 smaller than the reference threshold 3ref is given the label "1", and the log data 4c with the candidate threshold 3 larger than the reference threshold 3ref is given the label "3". The learning unit 50 assigns the label "2" to the log data 4c using the candidate threshold 3 that matches the reference threshold 3ref.

学習部５０では、学習データ６ｇから入力データ６ｇを構築しＮＮ２７０に入力し推論結果７１を得ると、学習データ６ｇに付与されているラベルと比較して得られた誤差がＮＮ２７０にフィードバックされる。学習部５０は、シミュレーション部３０よって得られた全てのログデータ４ｃを用いてＮＮ２７０を学習する。 In the learning unit 50, when the input data 6g is constructed from the learning data 6g and input to the NN 270 to obtain the inference result 71, the error obtained by comparing the label assigned to the learning data 6g is fed back to the NN 270. The learning unit 50 learns the NN 270 using all the log data 4c obtained by the simulation unit 30. FIG.

図２２は、第２実施例における情報処理装置の第１の機能構成例において推論部の概要を説明するための図である。図２２において、情報処理装置１００では、推論時には、主に、シミュレーション部３０と、機械学習部４０の推論部６０とが動作する。メインメモリ１２には、問題データ２、ログデータ４ｃ、シミュレーション結果５、閾値Ｔｈ、推論結果７１等を記憶する。ここでは推論部６０について説明する。 FIG. 22 is a diagram for explaining an outline of an inference unit in the first functional configuration example of the information processing apparatus in the second embodiment. In FIG. 22, in the information processing apparatus 100, the simulation unit 30 and the inference unit 60 of the machine learning unit 40 mainly operate during inference. The main memory 12 stores problem data 2, log data 4c, simulation results 5, threshold Th, reasoning results 71, and the like. Here, the inference unit 60 will be described.

シミュレーション部３０は、未知の問題データ２を解析し、得られたログデータ４ｃを出力する。ログデータ４ｃは、第１実施例におけるシミュレーションデータ２０４ｄに相当し、メモリマップトファイルとして扱えばよい。ログデータ４ｃは、共有メモリ１２ａに記憶され、名前付きパイプ１２ｂ－１にログデータ４ｃの先頭アドレスが指定される。 The simulation unit 30 analyzes the unknown problem data 2 and outputs the obtained log data 4c. The log data 4c corresponds to the simulation data 204d in the first embodiment, and may be treated as a memory-mapped file. The log data 4c is stored in the shared memory 12a, and the top address of the log data 4c is specified in the named pipe 12b-1.

推論部６０は、シミュレーション部３０の非線形解析部３２からの呼び出しに応じて、シミュレーション部３０によって得られたログデータ４ｃを用いて、訓練済みのＮＮ２７０を用いて閾値Ｔｈの増減を推論し、得られた推論結果７１を出力する。推論結果７１は、非線形解析部３２へと戻り値として通知される。戻り値は、名前付きパイプ１２ｂ－２に設定されればよい（図７、図１１）。 In response to a call from the nonlinear analysis unit 32 of the simulation unit 30, the inference unit 60 uses the log data 4c obtained by the simulation unit 30 to infer an increase or decrease in the threshold Th using the trained NN 270, and obtains output the inference result 71 obtained. The inference result 71 is notified to the nonlinear analysis unit 32 as a return value. The return value should be set in the named pipe 12b-2 (FIGS. 7 and 11).

次に、基準閾値３ｒｅｆの決定例について説明する。一例として、予め、候補閾値ごとのシミュレーション時間を検証し、得られた実行時間のうち最短の実行時間であった候補閾値３を基準閾値３ｒｅｆに設定すればよい。 Next, an example of determining the reference threshold value 3ref will be described. As an example, the simulation time for each candidate threshold may be verified in advance, and the candidate threshold 3, which is the shortest execution time among the obtained execution times, may be set as the reference threshold 3ref.

図２３は、候補閾値ごとのシミュレーション時間の検証例を示す図である。 FIG. 23 is a diagram illustrating an example of simulation time verification for each candidate threshold.

それぞれの候補閾値３でシミュレーションを行ったときのシミュレーション時間から、最も短かった候補閾値が基準閾値３ｒｅｆに設定される。このように定めた基準閾値３ｒｅｆを境界として、境界に基づいて閾値の変更を決定する。 The shortest candidate threshold is set as the reference threshold 3 ref from the simulation time when the simulation is performed with each candidate threshold 3 . Using the reference threshold value 3ref determined in this manner as a boundary, a change in the threshold value is determined based on the boundary.

図２４は、第１の機能構成例における学習処理を説明するためのフローチャート図である。図２４において、機械学習部４０では、学習部５０が、複数の候補閾値３を順にシミュレーション部３０に与え、シミュレーションを行わせて、候補閾値３ごとのログデータ４ｃを取得する（ステップＳ１１１０）。ログデータ４ｃは、線形解析の反復ごとの残差とシミュレーションの実行時間とを含む。 FIG. 24 is a flowchart for explaining the learning process in the first functional configuration example. In FIG. 24, in the machine learning unit 40, the learning unit 50 sequentially provides the plurality of candidate threshold values 3 to the simulation unit 30, performs simulation, and acquires the log data 4c for each candidate threshold value 3 (step S1110). The log data 4c includes residuals for each iteration of the linear analysis and the run time of the simulation.

ログデータ４ｃを取得すると、学習部５０は、複数のログデータ４ｃの中から、最短時間でシミュレーションを終了した候補閾値３を特定する（ステップＳ１１２０）。そして、学習部５０は、特定した候補閾値３を基準閾値３ｒｅｆとして設定し、基準閾値３ｒｅｆと、候補閾値３との大小関係に基づいて、ログデータ４ｃにラベル付けを行って学習データ６ｇを生成する（ステップＳ１１３０）。 After obtaining the log data 4c, the learning unit 50 identifies the candidate threshold value 3 for which the simulation was completed in the shortest time from among the plurality of log data 4c (step S1120). Then, the learning unit 50 sets the specified candidate threshold 3 as a reference threshold 3ref, labels the log data 4c based on the magnitude relationship between the reference threshold 3ref and the candidate threshold 3, and generates learning data 6g. (step S1130).

学習部５０は、生成した学習データ６ｇを用いてＮＮ２７０を学習する（ステップＳ１１５０）。学習部５０は、複数の問題データ２に対して、候補閾値３の異なる学習データ６ｇを用いた学習を終えると、この学習処理を終了する。 The learning unit 50 learns the NN 270 using the generated learning data 6g (step S1150). When the learning unit 50 completes learning using the learning data 6g with different candidate threshold values 3 for the plurality of question data 2, the learning process ends.

一方、シミュレーション３０では、閾値候補３の受信に応じてシミュレーションを開始し、問題データ２を１つ読み込んで、閾値候補３を閾値Ｔｈに設定する。そして、非線形解析部３２が非線形解析の前処理を行う（ステップＳ２０１１）。 On the other hand, in the simulation 30, the simulation is started in response to the reception of the threshold candidate 3, one question data 2 is read, and the threshold candidate 3 is set to the threshold Th. Then, the nonlinear analysis unit 32 performs preprocessing for nonlinear analysis (step S2011).

次に、線形解析部３４が前処理を行ったのち（ステップＳ２０１２）、線形解析による近似解を計算し（ステップＳ２０１３）、得られた残差と時刻とをメインメモリ１２に記憶する（ステップＳ２０１４）。線形解析部３４は、閾値Ｔｈ（＝候補閾値３）を用いて線形解析の解が収束したか否かの収束判定を行う（ステップＳ２０１５）。収束していないと判定した場合（ステップＳ２０１５のＮＯ）、線形解析部３４は、ステップＳ２０１３へと戻り上述した同様の処理を繰り返す。 Next, after the linear analysis unit 34 performs preprocessing (step S2012), an approximate solution is calculated by linear analysis (step S2013), and the obtained residual and time are stored in the main memory 12 (step S2014). ). The linear analysis unit 34 uses the threshold Th (=candidate threshold 3) to perform convergence determination as to whether or not the solution of the linear analysis has converged (step S2015). If it is determined that the convergence has not occurred (NO in step S2015), the linear analysis unit 34 returns to step S2013 and repeats the same processing as described above.

収束したと判定した場合（ステップＳ２０１５のＹＥＳ）、非線形解析部３２は、非線形解析の後処理（近似解の計算）を行い（ステップＳ２０１６）、非線形解析の解が収束したか否かの収束判定を行う（ステップＳ２０１７）。非線形解析の収束判定では、非線形解析用の閾値を用いて判定する。 If it is determined that convergence has occurred (YES in step S2015), the nonlinear analysis unit 32 performs post-processing of the nonlinear analysis (calculation of an approximate solution) (step S2016), and determines whether or not the solution of the nonlinear analysis has converged. (step S2017). The determination of convergence of nonlinear analysis uses a threshold value for nonlinear analysis.

その結果、収束していないと判定した場合（ステップＳ２０１７のＮＯ）、非線形解析部３２は、ステップＳ２０１１へと戻り上述した同様の処理を繰り返す。一方、収束したと判定した場合（ステップＳ２０１７のＹＥＳ）、シミュレション部３０は、シミュレーション終了時刻をメインメモリ１２に記憶して、このシミュレーションを終了する。シミュレーション終了時には、残差を０とし、シミュレーション終了時刻を記憶してもよい。 As a result, when it is determined that the convergence has not occurred (NO in step S2017), the nonlinear analysis unit 32 returns to step S2011 and repeats the same processing as described above. On the other hand, if it is determined that convergence has occurred (YES in step S2017), the simulation unit 30 stores the simulation end time in the main memory 12 and ends this simulation. At the end of the simulation, the residual may be set to 0 and the simulation end time may be stored.

シミュレーション部３０は、機械学習部４０の学習部５０に、問題データ２それぞれのシミュレーションの終了ごとに終了を通知してもよいし、最後の候補閾値３に対するシミュレーションが終了してから終了を通知してもよい。或いは、シミュレーション部３０は、問題データ２と候補閾値３の全ての組み合せに対してログデータ４ｃを出力してから、学習部５０にシミュレーションの終了を通知してもよい。第２実施例における他の機能構成例についても同様である。 The simulation unit 30 may notify the learning unit 50 of the machine learning unit 40 of the end each time the simulation for each of the question data 2 ends, or notify the end after the simulation for the last candidate threshold value 3 ends. may Alternatively, the simulation unit 30 may output the log data 4c for all combinations of the problem data 2 and the candidate threshold values 3, and then notify the learning unit 50 of the end of the simulation. The same applies to other functional configuration examples in the second embodiment.

図２５は、第１の機能構成例における推論処理を説明するためのフローチャート図である。図２５において、シミュレーション部３０の起動時に、ミュレーション時間はリセットされ、時間計測が開始される。 FIG. 25 is a flowchart for explaining inference processing in the first functional configuration example. In FIG. 25, when the simulation unit 30 is activated, the simulation time is reset and time measurement is started.

シミュレーション部３０は、シミュレーション開始時に閾値Ｔｈを初期設定し、非線形解析部３２は、非線形解析の前処理を行う（ステップＳ３０１１）。そして、線形解析部３４は、線形解析の前処理を行い（ステップＳ３０１２）、線形解析の近似解を計算し（ステップＳ３０１３）、閾値Ｔｈを用いて線形解析の解が収束したか否かの収束判定を行う（ステップＳ３０１４）。収束していないと判定した場合（ステップＳ３０１４のＮＯ）、線形解析部３４は、ステップＳ３０１３へと戻り上述した同様の処理を繰り返す。 The simulation unit 30 initializes the threshold value Th at the start of the simulation, and the nonlinear analysis unit 32 performs preprocessing for nonlinear analysis (step S3011). Then, the linear analysis unit 34 performs preprocessing of the linear analysis (step S3012), calculates an approximate solution of the linear analysis (step S3013), and uses a threshold Th to determine whether the solution of the linear analysis has converged. A determination is made (step S3014). If it is determined that the convergence has not occurred (NO in step S3014), the linear analysis unit 34 returns to step S3013 and repeats the same processing as described above.

収束したと判定した場合（ステップＳ３０１４のＹＥＳ）、非線形解析部３２は、非線形解析の後処理（近似解の計算）を行い（ステップＳ３０１５）、非線形解析の解が収束したか否かの収束判定を行う（ステップＳ３０１６）。非線形解析の収束判定では、非線形解析用の閾値を用いて判定する。 If it is determined that convergence has occurred (YES in step S3014), the nonlinear analysis unit 32 performs post-processing of the nonlinear analysis (calculation of an approximate solution) (step S3015), and determines whether or not the solution of the nonlinear analysis has converged. (step S3016). The determination of convergence of nonlinear analysis uses a threshold value for nonlinear analysis.

その結果、収束していないと判定した場合（ステップＳ３０１６のＮＯ）、非線形解析部３２は、調整要求を発行し、推論部６０に閾値Ｔｈの調整を推論させて、得られた推論結果７１を用いて閾値Ｔｈを更新し（ステップＳ３０１７）、ステップＳ３０１１へと戻り上述同様の処理を繰り返す。収束したと判定した場合（ステップＳ３０１６のＹＥＳ）、非線形解析部３２は、ステップＳ３０１３へと戻り上述した同様の処理を繰り返す。 As a result, when it is determined that convergence has not occurred (NO in step S3016), the nonlinear analysis unit 32 issues an adjustment request, causes the inference unit 60 to infer adjustment of the threshold Th, and obtains an inference result 71 as is used to update the threshold Th (step S3017), the process returns to step S3011, and the same processing as described above is repeated. If it is determined that convergence has occurred (YES in step S3016), the nonlinear analysis unit 32 returns to step S3013 and repeats the same processing as described above.

一方、推論部６０は、調整要求に応じて、直近で得られたログデータ４ｃから現在の線形解析の閾値Ｔｈが基準閾値３ｒｅｆより低いか高いかを訓練済みＮＮ２７０を用いて推論する（ステップＳ４０１０）。推論部６０は、得られた推論結果７１を出力し（ステップＳ４０２０）、この推論処理を終了する。 On the other hand, in response to the adjustment request, the inference unit 60 uses the trained NN 270 to infer whether the current linear analysis threshold Th is lower or higher than the reference threshold 3ref from the most recently obtained log data 4c (step S4010). ). The inference unit 60 outputs the obtained inference result 71 (step S4020), and ends this inference processing.

上記第１の機能構成例では、ログデータ４ｃをそのまま用いたが、ログデータ４ｃを定めた区間ごとに区切って学習させることによりデータ拡張を行い、第１の機能構成例における閾値Ｔｈの調整精度を向上させ得ることを可能とする。 In the first functional configuration example, the log data 4c is used as it is, but the log data 4c is divided into predetermined sections and learned to extend the data, and the adjustment accuracy of the threshold Th in the first functional configuration example can be improved.

第１の機能構成例では、クラス１、２、及び３に分類する場合を例としたが、閾値Ｔｈを上げる場合とそれ以外の場合のみに分類してもよい。その場合には、学習部４０では、学習時に候補閾値３が基準閾値３ｒｅｆより低い場合にラベル０を付与し、それ以外ではラベル１を付与するようにし、推論結果７１として、クラス０又はクラス１を推論するようにしてもよい。また、非線形解析部３２は、内部に線形ソルバを含むような定常解析であってもよい。以下の第２の機能構成例においても同様である。 In the first functional configuration example, classification into classes 1, 2, and 3 was taken as an example. In that case, the learning unit 40 assigns the label 0 when the candidate threshold 3 is lower than the reference threshold 3ref during learning, and assigns the label 1 otherwise. may be inferred. Also, the nonlinear analysis unit 32 may be a stationary analysis including a linear solver therein. The same applies to the following second functional configuration example.

図２６は、第２実施例における情報処理装置の第２の機能構成例において学習部の概要を説明するための図である。図２６中、図２１と同様の処理部及びデータには同様の符号を付し、それらの説明を省略する。第２実施例では、残差曲線を一定区間ごとに区切ることで複数の入力データを作成し学習する。ログデータ４ｃに含まれる残差曲線を表すデータを残差曲線データ４ｄというものとする。 FIG. 26 is a diagram for explaining an outline of a learning unit in the second functional configuration example of the information processing apparatus according to the second embodiment. In FIG. 26, the same processing units and data as those in FIG. 21 are denoted by the same reference numerals, and the description thereof is omitted. In the second embodiment, a plurality of pieces of input data are created and learned by dividing the residual curve into certain sections. The data representing the residual curve included in the log data 4c is referred to as residual curve data 4d.

学習部５０は、第１の機能構成例と同様に、シミュレーション部３０が、問題データ２ごとに、候補閾値３の複数の候補閾値３のそれぞれを与えて得られた複数の残差曲線データ４ｄのから基準閾値３ｒｅｆを決定する。学習部５０は、最短時間でシミュレーションを終了した候補閾値３を特定し、基準閾値３ｒｅｆとする。また、学習部５０は、残差曲線データ４ｄの候補閾値３が基準閾値３ｒｅｆより小さいか否かに基づいてラベルを決定する。 Similar to the first functional configuration example, the learning unit 50 calculates a plurality of residual curve data 4d obtained by the simulation unit 30 giving each of the plurality of candidate thresholds 3 of the candidate thresholds 3 for each problem data 2. , the reference threshold 3ref is determined. The learning unit 50 identifies the candidate threshold value 3 for which the simulation is completed in the shortest time, and sets it as the reference threshold value 3ref. Also, the learning unit 50 determines a label based on whether or not the candidate threshold 3 of the residual curve data 4d is smaller than the reference threshold 3ref.

その後、第２の機能構成例では、学習部５０は、残差曲線データ４ｄを定めた区間ごとに区切って複数の入力データ６ａを作成し、複数の入力データ６ａのそれぞれに、残差曲線データ４ｄに対して決定したラベルを付与して、複数の学習データ６ｇを生成する。ラベル付けは上述した通りである。 After that, in the second functional configuration example, the learning unit 50 creates a plurality of input data 6a by dividing the residual curve data 4d into predetermined sections, and stores the residual curve data in each of the plurality of input data 6a. 4d is given the determined label to generate a plurality of learning data 6g. Labeling is as described above.

学習部５０は、生成した一つ一つの学習データ６ｇをＮＮ２７０に入力して得られた推測結果７１と学習データ６ｇに付与されたラベルとの誤差を、ＮＮ２７０にフィードバックし、ＮＮ２７０の正解精度を改善する。 The learning unit 50 feeds back to the NN 270 the error between the estimation result 71 obtained by inputting each generated learning data 6g to the NN 270 and the label assigned to the learning data 6g, and calculates the correct accuracy of the NN 270. Improve.

図２７は、第２実施例における情報処理装置の第１の機能構成例において推論部の概要を説明するための図である。図２７において、情報処理装置１００では、推論時には、主に、シミュレーション部３０と、機械学習部４０の推論部６０とが動作する。メインメモリ１２には、問題データ２、残差曲線データ４ｄ、閾値Ｔｈ、推論結果７１等を記憶する。メインメモリ１２の一部が共有メモリ１２ａとして利用されることは、第１実施例で説明した通りである。ここでは推論部６０について説明する。 FIG. 27 is a diagram for explaining an outline of an inference unit in the first functional configuration example of the information processing apparatus according to the second embodiment. In FIG. 27, in the information processing apparatus 100, the simulation unit 30 and the inference unit 60 of the machine learning unit 40 mainly operate during inference. The main memory 12 stores the problem data 2, the residual curve data 4d, the threshold value Th, the inference result 71, and the like. As described in the first embodiment, part of the main memory 12 is used as the shared memory 12a. Here, the inference unit 60 will be described.

シミュレーション部３０は、残差曲線データ４ｄを出力し、推論部６０は、シミュレーション部３０の非線形解析部３２からの調整要求に応じて、シミュレーション部３０によって得られた残差曲線データ４ｄを用いて、訓練済みのＮＮ２７０により閾値Ｔｈの増減を推論し、得られた推論結果７１を出力する。推論結果７１は、非線形解析部３２へと戻り値として通知される。 The simulation unit 30 outputs residual curve data 4d, and the inference unit 60 uses the residual curve data 4d obtained by the simulation unit 30 in response to an adjustment request from the nonlinear analysis unit 32 of the simulation unit 30. , the trained NN 270 infers the increase or decrease of the threshold Th, and outputs the obtained inference result 71 . The inference result 71 is notified to the nonlinear analysis unit 32 as a return value.

訓練済みのＮＮ２７０を用いる際に、推論部６０は、シミュレーション部３０の非線形解析部３２からの調整要求に応じて、シミュレーション部３０によって得られた残差曲線データ４ｄを用いて、訓練済みのＮＮ２７０により閾値Ｔｈの増減を推論し、得られた推論結果７１を出力する。推論結果７１は、非線形解析部３２へと戻り値として通知される。 When using the trained NN 270, the inference unit 60 uses the residual curve data 4d obtained by the simulation unit 30 in response to an adjustment request from the nonlinear analysis unit 32 of the simulation unit 30, and uses the trained NN 270 increases or decreases the threshold value Th, and outputs the obtained inference result 71 . The inference result 71 is notified to the nonlinear analysis unit 32 as a return value.

図２８は、第２の機能構成例における学習処理の第１の例を説明するためのフローチャート図である。図２８中、図２４と同様の内容には同一ステップ番号を付し、その説明を省略する。図２８において、機械学習部４０では、学習部５０が、複数の候補閾値３を順にシミュレーション部３０に与え、シミュレーションを行わせて、候補閾値３ごとの残差曲線データ４ｄを取得する（ステップＳ１１１０）。残差曲線データ４ｄは、線形解析の反復ごとの残差とシミュレーションの実行時間とを含む。 FIG. 28 is a flowchart for explaining a first example of learning processing in the second functional configuration example. In FIG. 28, the same step numbers are given to the same contents as in FIG. 24, and the description thereof is omitted. In FIG. 28, in the machine learning unit 40, the learning unit 50 sequentially provides the plurality of candidate threshold values 3 to the simulation unit 30, causes the simulation unit 30 to perform simulation, and obtains the residual curve data 4d for each candidate threshold value 3 (step S1110). ). The residual curve data 4d includes the residuals for each iteration of the linear analysis and the simulation execution time.

残差曲線データ４ｄを取得すると、学習部５０は、複数の残差曲線データ４ｄの中から、最短時間でシミュレーションを終了した候補閾値３を特定する（ステップＳ１１２０）。そして、学習部５０は、特定した候補閾値３を基準閾値３ｒｅｆとして設定し、基準閾値３ｒｅｆと、候補閾値３との大小関係に基づいて、残差曲線データ４ｄに対するラベルを決定する（ステップＳ１１３１）。 After obtaining the residual curve data 4d, the learning unit 50 identifies the candidate threshold value 3 for which the simulation is completed in the shortest time from among the plurality of residual curve data 4d (step S1120). Then, the learning unit 50 sets the identified candidate threshold 3 as the reference threshold 3ref, and determines a label for the residual curve data 4d based on the magnitude relationship between the reference threshold 3ref and the candidate threshold 3 (step S1131). .

学習部５０は、残差曲線を一定区間ごとに区切って複数の入力データ６ａを作成し、作成した複数の入力データのそれぞれにＳ１１３０で決定したラベルを付与して、学習データ６ｇを生成する（ステップＳ１１４０）。 The learning unit 50 creates a plurality of input data 6a by dividing the residual curve into certain intervals, assigns the labels determined in S1130 to each of the created plurality of input data, and generates learning data 6g ( step S1140).

シミュレーション３０による処理は、第１の機能構成例における処理（図２４）と同様であるため、その説明を省略する。 The processing by the simulation 30 is the same as the processing (FIG. 24) in the first functional configuration example, so the description thereof is omitted.

図２９は、第２の機能構成例における推論処理を説明するためのフローチャート図である。図２９中、図２５と同様の内容には同一ステップ番号を付し、その説明を省略する。第２の機能構成例では、シミュレーション部３０による、推論結果７１を用いた、閾値Ｔｈの調整例について説明する。推論部６０による処理は第１の機能構成例と同様であるため、その説明を省略する。 FIG. 29 is a flowchart for explaining inference processing in the second functional configuration example. In FIG. 29, the same step numbers are given to the same contents as in FIG. 25, and the description thereof is omitted. In the second functional configuration example, an example of adjusting the threshold Th using the inference result 71 by the simulation unit 30 will be described. Since the processing by the inference unit 60 is the same as in the first functional configuration example, the description thereof is omitted.

図２９において、非線形解析部３２は、残差曲線データ４ｄを指定して調整要求を調整部６０に行うことで、調整部６０から得た推論結果７１に従って、線形解析の残差閾値の変化率ｎ又は１／ｎで閾値Ｔｈを調整し（ステップＳ３０１８）、ステップＳ３０１１へと戻り上述同様の処理を繰り返す。 In FIG. 29, the nonlinear analysis unit 32 designates the residual curve data 4d and makes an adjustment request to the adjustment unit 60, and according to the inference result 71 obtained from the adjustment unit 60, the residual threshold change rate The threshold value Th is adjusted by n or 1/n (step S3018), the process returns to step S3011, and the same processing as described above is repeated.

ステップＳ３０１８では、具体的には、変化率ｎ（ｎは２以上の自然数）を用いて、推論結果７１が閾値Ｔｈを上げることを示す場合、非線形解析部３２は、閾値Ｔｈをｎ倍する。一方、下げる場合、非線形解析部３２は、閾値Ｔｈを１／ｎ倍（変化率の逆数倍）する。他の例として、推論結果７１が閾値Ｔｈを上げることを示さない場合に閾値Ｔｈを１／ｎ倍してもよい。 Specifically, in step S3018, the nonlinear analysis unit 32 multiplies the threshold Th by n when the inference result 71 indicates that the threshold Th is to be increased using the rate of change n (n is a natural number of 2 or more). On the other hand, when lowering, the nonlinear analysis unit 32 multiplies the threshold Th by 1/n (reciprocal times the rate of change). As another example, the threshold Th may be multiplied by 1/n if the inference result 71 does not indicate that the threshold Th should be raised.

図３０は、第２の機能構成例における学習処理の第２の例を説明するためのフローチャート図である。図３０中、図２８と同様の内容には同一ステップ番号を付し、その説明を省略する。学習処理の第２の例では、候補閾値３から基準閾値３ｒｅｆを特定するのではなく、問題データ２ごとに、複数の候補閾値３間のシミュレーション結果５の比較から基準閾値３ｒｅｆを作成する。 FIG. 30 is a flowchart for explaining a second example of learning processing in the second functional configuration example. In FIG. 30, the same step numbers are given to the same contents as in FIG. 28, and the description thereof is omitted. In the second example of the learning process, instead of specifying the reference threshold 3ref from the candidate thresholds 3, the reference threshold 3ref is created by comparing simulation results 5 among a plurality of candidate thresholds 3 for each problem data 2. FIG.

図３０において、機械学習部４０では、学習部５０が、複数の候補閾値３を順にシミュレーション部３０に与え、シミュレーションを行わせて、候補閾値３ごとの残差曲線データ４ｄとシミュレーション結果５とを取得する（ステップＳ１１１２）。残差曲線データ４ｄは、線形解析の反復ごとの残差とシミュレーションの実行時間とを含む。 In FIG. 30 , in the machine learning unit 40, the learning unit 50 sequentially provides a plurality of candidate threshold values 3 to the simulation unit 30, performs a simulation, and obtains the residual curve data 4d for each candidate threshold value 3 and the simulation result 5. Acquire (step S1112). The residual curve data 4d includes the residuals for each iteration of the linear analysis and the simulation execution time.

学習部５０は、問題データ２ごとに、それぞれの候補閾値３でのシミュレーション結果５に対して、最も候補閾値３が低いときのシミュレーション結果５との結果誤差を計算する（ステップＳ１１２３）。結果誤差の算出例として、学習部５０は、シミュレーション結果５の各要素ごとに、同一問題で最も候補閾値３が低いときの要素の値との差を算出して平均絶対誤差（ＭＡＥ：Mean Absolute Error）を算出して、結果誤差を求める。 The learning unit 50 calculates the result error between the simulation result 5 with each candidate threshold 3 and the simulation result 5 with the lowest candidate threshold 3 for each problem data 2 (step S1123). As an example of calculating the result error, the learning unit 50 calculates the difference between each element of the simulation result 5 and the value of the element when the candidate threshold 3 is the lowest in the same problem, and calculates the mean absolute error (MAE: Mean Absolute error) to obtain the resulting error.

そして、学習部５０は、問題データ２ごとに、結果誤差と実行時間とを、候補閾値３ごとに比較した結果に基づいて、基準閾値３ｒｅｆを設定し、基準閾値３ｒｅｆと、候補閾値３との大小関係に基づいて、残差曲線データ４ｄに対するラベルを決定する（ステップＳ１１４５）。基準閾値３ｒｅｆの決定方法の一例として、学習部５０は、結果誤差（即ち精度）と実行時間（即ち速度）の両方を参照し、予め定めた条件に最も近い候補閾値３を求める。条件の例として、ユーザ使用を満たす精度の中で最速となる候補閾値３を基準閾値３ｒｅｆとしてもよい。或いは、結果誤差と実行時間との積を最小化するときの候補閾値３を基準閾値３ｒｅｆに適用してもよい。 Then, the learning unit 50 sets the reference threshold 3ref based on the result of comparing the result error and the execution time for each candidate threshold 3 for each problem data 2, and sets the reference threshold 3ref and the candidate threshold 3. A label for the residual curve data 4d is determined based on the magnitude relationship (step S1145). As an example of a method for determining the reference threshold 3ref, the learning unit 50 refers to both the result error (ie precision) and the execution time (ie speed) to obtain the candidate threshold 3 that is closest to the predetermined condition. As an example of the condition, the candidate threshold 3 that is the fastest among the accuracies that satisfy the user's usage may be set as the reference threshold 3ref. Alternatively, the candidate threshold 3 when minimizing the product of the resulting error and the execution time may be applied to the reference threshold 3ref.

学習部５０は、各問題データ２の残差曲線データ４ｄそれぞれに対して、残差曲線を一定区間ごとに区切って複数の入力データ６ａを作成し、作成した複数の入力データ６ａのそれぞれにＳ１１４６で決定したラベルを付与して、学習データ６ｇを生成する（ステップＳ１１４７）。そして、学習部５０は、生成した学習データ６ｇを用いてＮＮ２７０を学習する（ステップＳ１１５０）。 For each of the residual curve data 4d of each question data 2, the learning unit 50 creates a plurality of input data 6a by dividing the residual curve into certain intervals, and applies S1146 to each of the created plurality of input data 6a. The label determined in step S1147 is added to generate learning data 6g. The learning unit 50 then learns the NN 270 using the generated learning data 6g (step S1150).

図３１は、第２の機能構成例における学習処理の第３の例を説明するためのフローチャート図である。図３１中、図３０と同様の内容には同一ステップ番号を付し、その説明を省略する。学習処理の第３の例では、ユーザが所望するシミュレーションの精度を考慮して候補閾値３から基準閾値３ｒｅｆを特定する。シミュレーション部３０の処理は、図３０と同等であるため簡略化し、その説明を省略する。 FIG. 31 is a flowchart for explaining a third example of learning processing in the second functional configuration example. In FIG. 31, the same step numbers are given to the same contents as in FIG. 30, and the description thereof is omitted. In the third example of the learning process, the reference threshold value 3ref is specified from the candidate threshold value 3 in consideration of the simulation accuracy desired by the user. Since the processing of the simulation unit 30 is the same as that of FIG. 30, it is simplified and the description thereof is omitted.

図３０において、機械学習部４０では、ユーザが所望するシミュレーションの精度を結果誤差の基準値に設定する（ステップＳ１１５０）。一例としてMAE<0.0001などである。そのため、図３０のステップＳ１１４５の代わりに、ステップＳ１１４６を行う。 In FIG. 30, the machine learning unit 40 sets the simulation accuracy desired by the user as the reference value of the result error (step S1150). An example is MAE<0.0001. Therefore, step S1146 is performed instead of step S1145 of FIG.

ステップＳ１１４６では、学習部５０は、問題データ２ごとに、結果誤差がユーザの所望する精度を満たすものの中で、実行時間が最も短いときの候補閾値３を基準閾値３ｒｅｆに設定し、基準閾値３ｒｅｆと、候補閾値３との大小関係に基づいて、残差曲線データに対するラベルを決定する。以下、ステップＳ１１４７及びＳ１１５０は、図３０と同様であるため説明を省略する。 In step S1146, the learning unit 50 sets the candidate threshold value 3 when the execution time is the shortest among results error satisfying the accuracy desired by the user for each question data 2 as the reference threshold value 3ref. and the candidate threshold 3, the label for the residual curve data is determined. Since steps S1147 and S1150 are the same as those in FIG. 30, description thereof will be omitted.

上述した第１実施例及び第２実施例における学習環境として、
・ＮＮ２７０
一例として、AlexNetを用いてＣＮＮを構成すればよい。
・ＡＩフレームワーク
TensorFlow及びTensorFlowへのＡＰＩとしてKerasなどを用いればよい。このような学習環境を持つ情報処理装置１００において、発明者等が開発した線形解析の閾値Ｔｈを動的に調整する手法をシミュ―レーションに適用し、当該手法を適用した結果として得られた様々な情報を、以下に提示する。 As a learning environment in the first and second embodiments described above,
・NN270
As an example, a CNN may be configured using AlexNet.
・AI framework
Keras or the like may be used as TensorFlow and an API to TensorFlow. In the information processing apparatus 100 having such a learning environment, a method for dynamically adjusting the threshold value Th for linear analysis developed by the inventors was applied to the simulation, and various results obtained as a result of applying the method were applied. Information is presented below.

次に、第２の機能構成例における学習処理の第１の例の学習結果について図３２及び図３３に示す。図３２は、学習処理の第１の例による候補閾値ごとの学習結果を示す図である。この例では、図２３で説明した線形解析の残差閾値の範囲にある１５個の候補閾値３を用いている。 Next, FIG. 32 and FIG. 33 show learning results of the first example of the learning process in the second functional configuration example. FIG. 32 is a diagram showing learning results for each candidate threshold value according to the first example of the learning process. In this example, 15 candidate thresholds 3 within the residual threshold range of the linear analysis described in FIG. 23 are used.

図３２では、候補閾値３のそれぞれについて、線形残差曲線を収束２回ごとの残差曲線が示されている。また、基準閾値３ｒｅｆより小さい候補閾値３に対してラベル「０」を設定し、それ以外の候補閾値３に対してはラベル「１」を設定する。 In FIG. 32, for each of candidate threshold values 3, residual curves are shown every two times the linear residual curve converges. Also, the label "0" is set for candidate threshold values 3 smaller than the reference threshold value 3ref, and the label "1" is set for other candidate threshold values 3.

このようにラベル付けした学習データ６ｇを用いた場合の学習結果を図３３に示す。図３３は、図３２のラベル付けに基づく学習結果を示す図である。 FIG. 33 shows the learning result when using the learning data 6g labeled in this way. FIG. 33 is a diagram showing learning results based on the labeling in FIG.

上述した検証より、図３３（Ａ）に損失グラフ３３ａでは、学習時損失値３３ａ－１と検証時損失値３３ａ－２とを示し、図３３（Ｂ）に示す精度グラフ３３ｂでは、学習時精度値３３ｂ－１と検証時精度値３３ｂ－２とを示している。損失は、エポックの少ない時期に急速に改善し、検証用のデータにおいても学習時と同様の傾向を示している。また、精度についても、エポックの少ない時期に急速に改善し、検証用のデータにおいても学習時と同様の傾向を示している。適切な学習が行われたと言える。 From the above verification, the loss graph 33a in FIG. 33(A) shows the learning loss value 33a-1 and the verification loss value 33a-2. A value 33b-1 and a verification accuracy value 33b-2 are shown. The loss improves rapidly in the period with few epochs, and the validation data shows the same tendency as during training. In addition, the accuracy also improved rapidly when there were few epochs, and the verification data showed the same tendency as during learning. It can be said that appropriate learning was carried out.

次に、第２の機能構成例における学習処理の第２の例の学習結果について図３４及び図３５を参照して説明する。図３４は、学習処理の第２の例による候補閾値ごとの学習結果を示す図である。この例では、線形解析の残差閾値１５個の候補閾値３を用いている。 Next, the learning result of the second example of the learning process in the second functional configuration example will be described with reference to FIGS. 34 and 35. FIG. FIG. 34 is a diagram showing learning results for each candidate threshold value according to the second example of the learning process. In this example, 15 candidate thresholds 3 for residual thresholds of linear analysis are used.

図３４では、候補閾値３のそれぞれについて、線形残差曲線を収束２回ごとの残差曲線が示されている。ＣＧとＡＭＧ（Algebraic Multigrid）とを用いた構造解析のモデル用いている。また、シミュレーション結果の精度が十分で最速時の候補閾値３より小さい場合にラベル「０」を設定し、それ以外の候補閾値３に対してはラベル「１」を設定する。 In FIG. 34, for each of the candidate threshold values 3, the residual curve is shown every two times the linear residual curve converges. A structural analysis model using CG and AMG (Algebraic Multigrid) is used. Also, if the accuracy of the simulation result is sufficient and is smaller than the candidate threshold 3 for the fastest time, the label "0" is set, and the label "1" is set for the other candidate threshold 3.

このようにラベル付けした学習データ６ｇを用いた場合の学習結果を図３５に示す。図３５は、図３４のラベル付けに基づく学習結果を示す図である。 FIG. 35 shows the learning result when using the learning data 6g labeled in this way. FIG. 35 is a diagram showing learning results based on the labeling of FIG.

上述した検証より、図３５（Ａ）に示す損失グラフ３５ａと、図３５（Ｂ）に示す精度グラフ３５ｂとを得た。図３５（Ａ）に損失グラフ３５ａでは、学習時損失値３５ａ－１と検証時損失値３５ａ－２とを示し、図３５（Ｂ）に示す精度グラフ３５ｂでは、学習時精度値３５ｂ－１と検証時精度値３５ｂ－２とを示している。損失は、エポックの少ない時期に急速に改善し、検証用のデータにおいても学習時と同様の傾向を示している。また、精度についても、エポックの少ない時期に急速に改善し、検証用のデータにおいても学習時と同様の傾向を示している。学習処理の第２の例では、特に、検証時損失値３５ａ－２と検証時精度値３５ｂ－２とが、非常に滑らかに推移しており、学習処理の第１の例以上に高精度な学習が行われたと言える。 Through the verification described above, a loss graph 35a shown in FIG. 35(A) and an accuracy graph 35b shown in FIG. 35(B) were obtained. The loss graph 35a in FIG. 35(A) shows a learning loss value 35a-1 and a verification loss value 35a-2, and the accuracy graph 35b shown in FIG. 35(B) shows a learning accuracy value 35b-1 and A verification accuracy value 35b-2 is shown. The loss improves rapidly in the period with few epochs, and the validation data shows the same tendency as during training. In addition, the accuracy also improved rapidly when there were few epochs, and the verification data showed the same tendency as during learning. In the second example of the learning process, in particular, the verification loss value 35a-2 and the verification accuracy value 35b-2 transition very smoothly, and the accuracy is higher than in the first example of the learning process. It can be said that learning has taken place.

更に、第２実施例における学習処理の第１の例と学習処理の第２の例とを比較検証した結果を以下に示す。検証環境は、以下の通りである。
学習部５０（ＡＩ部）の呼び出し間隔: 非線形ループ２回ずつ
訓練済みモデル:
・第１の例のラベル付け:
線形解析の残差閾値が最速時の候補閾値３より小さいときラベル０を設定し、
それ以外のときラベル１を設定する。 Furthermore, the result of comparing and verifying the first example of the learning process and the second example of the learning process in the second embodiment is shown below. The verification environment is as follows.
Call interval of learning unit 50 (AI unit): 2 nonlinear loops each Trained model:
- Labeling for the first example:
set the label 0 when the residual threshold of the linear analysis is less than the candidate threshold 3 at the fastest time,
Otherwise, set label 1.

・第２の例のラベル付け:
シミュレーション結果の精度が十分であり、かつ、
最速時の候補閾値３より小さいときラベル０を設定し、
それ以外のときラベル１を設定する。 - Labeling for the second example:
The accuracy of the simulation results is sufficient, and
Set label 0 when less than candidate threshold 3 at fastest time,
Otherwise, set label 1.

線形解析の残差閾値の変化率:２倍又は１／２倍
としたときの結果を説明する。 Rate of change of residual threshold for linear analysis: results when doubled or halved.

図３６は、学習データの生成例を説明するための図である。図３６では、第２実施例において学習部５０が残差曲線データ４ｄ－１、４ｄ－２のそれぞれを定めた区間ごとに区切って、入力データ６ａを作成する。 FIG. 36 is a diagram for explaining an example of generating learning data. In FIG. 36, in the second embodiment, the learning unit 50 divides the residual curve data 4d-1 and 4d-2 into predetermined sections to create the input data 6a.

残差曲線データ４ｄの候補閾値３が基準閾値３ｒｅｆ以上の場合に区切った全ての入力データ６ａにラベル「１」を付与し、それ以外の場合にラベル「０」を付与した例をそれぞれ示している。一例として、１つの残差曲線データ４ｄに対して１２個の学習データ６ｇが生成されたものとする。 An example is shown in which the label "1" is given to all the divided input data 6a when the candidate threshold value 3 of the residual curve data 4d is equal to or greater than the reference threshold value 3ref, and the label "0" is given otherwise. there is As an example, it is assumed that 12 pieces of learning data 6g are generated for one residual curve data 4d.

学習部５０は、１つの残差曲線データ４ｄに対して、同一ラベルが付与された複数の学習データ６ｇを用いてＮＮ２７０を学習する。ＮＮ２７０に対して誤差はフィードバックされ、第２実施例では、１つの残差曲線データ４ｄから同一ラベルの複数の学習データ６ｇを得られるため、ＮＮ２７０にラベル「１」又は「０」を精度良く学習させることができる。そのため、推論部６０において、閾値Ｔｈを最適な値へと適切に変化させ、シミュレーション時間を高速化することができる。 The learning unit 50 learns the NN 270 using a plurality of learning data 6g assigned the same label for one residual curve data 4d. The error is fed back to the NN 270, and in the second embodiment, a plurality of learning data 6g with the same label can be obtained from one residual curve data 4d. can be made Therefore, the inference unit 60 can appropriately change the threshold value Th to an optimum value, thereby speeding up the simulation time.

シミュレーションの実行時間とシミュレーション結果５の精度とについて検証した結果を以下に示す。 The result of verifying the simulation execution time and the accuracy of the simulation result 5 is shown below.

図３７は、実行時間の検証結果を示す図である。図３７において、縦軸を時間（秒）で示し、左から
・初期設定に用いる規定値「1.0e-08」でシミュレーションを実行した場合
・第２の例の場合、
・第１の例の場合、
・図２３に例示したような実行時間が最速時の閾値でシミュレーションを実行した場合
のそれぞれの実行時間を示している。 FIG. 37 is a diagram showing verification results of the execution time. In FIG. 37, the vertical axis indicates time (seconds), and from the left: When the simulation is executed with the default value “1.0e-08” used for initial setting. In the case of the second example,
・In the case of the first example,
Each execution time is shown when the simulation is executed with the threshold for the fastest execution time as illustrated in FIG. 23 .

初期設定、第２の例、第１の例、そして最速設定の順に実行時間が減少している。最速設定が最も実行時間が短いが、第２の例及び第１の例は、最も実行時間が長い初期設定に対して高速化を実現している。 The execution time decreases in the order of initial setting, second example, first example, and fastest setting. The fastest setting has the shortest execution time, but the second example and the first example achieve speed-up with respect to the initial setting with the longest execution time.

図３８は、シミュレーション結果の検証結果を示す図である。図３８では、縦軸にシミュレーション結果の誤差を示し、図３７同様に、初期設定、第２の例、第１の例、そして最速設定の順に誤差を示している。 FIG. 38 is a diagram showing verification results of simulation results. In FIG. 38, the vertical axis indicates the error of the simulation result, and similarly to FIG. 37, the error is indicated in the order of the initial setting, the second example, the first example, and the fastest setting.

初期設定は、有効数字が平均４桁一致の基準３８ａに到達している。第２の例では、基準３８ａを上回る誤差を示すものの基準３８ａに近い精度を示している。一方、第１の例及び最速設定の精度では、有効数字が平均３桁一致の基準３８ｂを超えているが、略同程度の誤差を示している。 The initial setting has reached the criterion 38a of 4 significant digits match on average. The second example shows an error that exceeds the criterion 38a but an accuracy that is close to the criterion 38a. On the other hand, in the accuracy of the first example and the fastest setting, although the significant digits exceed the standard 38b of coincidence of 3 digits on average, the errors are approximately the same.

上述した検証より、シミュレーション結果５の精度を維持しつつ高速化を試みる観点において、第１の例及び第２の例では、実行速度を改善しつつ精度を保つことを実現しているといえる。 From the above verification, from the viewpoint of trying to speed up while maintaining the accuracy of the simulation result 5, in the first example and the second example, it can be said that the accuracy is maintained while improving the execution speed.

図３９は、閾値の変化例を示す図である。図３９では、縦軸に線形解析の閾値を示し、横軸にニュートン法のステップ数を示す。本実施例におけるシミュレーションの閾値変化３９ａと、本実施例を用いない既存のシミュレーションの閾値変化３９ｂとを所定ステップごとに示している。 FIG. 39 is a diagram showing an example of threshold change. In FIG. 39, the vertical axis indicates the threshold value for linear analysis, and the horizontal axis indicates the number of Newton's method steps. A threshold change 39a of the simulation in this embodiment and a threshold change 39b of an existing simulation that does not use this embodiment are shown at predetermined steps.

本実施例におけるシミュレーションとは、第１実施例及び第２実施例において例示した機能構成を有する情報処理装置１００によって行われたシミュレーションに相当する。また、ステップとは、ニュートン法に基づく線形ソルバにおけるステップに相当し、所定ステップとは、この例では２ステップである。 The simulation in this embodiment corresponds to the simulation performed by the information processing apparatus 100 having the functional configuration illustrated in the first and second embodiments. The steps correspond to steps in a linear solver based on Newton's method, and the predetermined steps are two steps in this example.

図３９より、情報処理装置１００における閾値変化３９ａは、閾値Ｔｈは、ステップ１で初期値に設定されてからステップ４１まで上昇し、その後、動的に変化しながら線形解析が行われていることが分かる。一方、既存のシミュレーションの閾値変化３９ｂは、初期値を維持した状態を示す。 From FIG. 39 , the threshold change 39 a in the information processing apparatus 100 is that the threshold Th is set to the initial value in step 1 and rises up to step 41 , and then the linear analysis is performed while dynamically changing. I understand. On the other hand, the existing simulation threshold change 39b indicates a state in which the initial value is maintained.

図４０は、処理時間の経過例を示す図である。図４０では、縦軸に実行時間を示し、横軸にニュートン法のステップ数を示す。実行時間は、ニュートン法１ステップに掛かった時間を示している。本実施例における処理時間経過４０ａと、本実施例を用いない既存の処理時間経過４０ｂとをステップごとに計測した時間を示している。 FIG. 40 is a diagram showing an example of elapsed processing time. In FIG. 40, the vertical axis indicates the execution time, and the horizontal axis indicates the number of Newton's method steps. The execution time indicates the time taken for one step of Newton's method. The processing time lapse 40a in this embodiment and the existing processing time lapse 40b that does not use this embodiment are shown, which are measured for each step.

既存の処理時間経過４０ｂは、実行時間が一定の変動範囲で推移を繰り返している。一方、本実施例における処理時間経過４０ａでは、ステップを反復するごとに実行時間が短縮され、凡そ３５ステップ以降では、一定の変動範囲で推移を繰り返している。本実施例における処理時間経過４０ａの変動範囲は、既存の処理時間経過４０ｂにおける変動範囲より明らかに高速な範囲である。 In the existing processing time lapse 40b, the execution time repeats transition within a constant variation range. On the other hand, in the processing time lapse 40a in the present embodiment, the execution time is shortened each time the step is repeated, and after about the 35th step, the transition is repeated within a constant variation range. The variation range of the processing time lapse 40a in this embodiment is clearly faster than the variation range of the existing processing time lapse 40b.

図４１は、反復回数の経過例を示す図である。図４１では、縦軸に線形解析の反復回数を示し、横軸にニュートン法のステップ数を示す。本実施例のシミュレーションにおける線形解析の反復回数の経過４１ａと、本実施例を用いない既存のシミュレーションにおける線形解析の反復回数の経過４１ｂとを示している。 FIG. 41 is a diagram showing an example of progress of the number of iterations. In FIG. 41, the vertical axis indicates the number of iterations of the linear analysis, and the horizontal axis indicates the number of Newton's method steps. The progress 41a of the number of iterations of linear analysis in the simulation of this embodiment and the progress 41b of the number of iterations of linear analysis in an existing simulation that does not use this embodiment are shown.

反復回数の経過４１ｂは、ステップ数が増加しても略同様の反復回数で線形解析を行い続け、この例では、６０回以下で線形解析が収束することがない。一方、反復回数の経過４１ａは、減少し続け、一定の反復回数の間で変動しながら推移する。 In the iteration number progress 41b, even if the number of steps increases, the linear analysis continues to be performed with approximately the same number of iterations, and in this example, the linear analysis does not converge at 60 times or less. On the other hand, the progress 41a of the number of iterations continues to decrease and fluctuates between constant number of iterations.

上述した第２実施例に係る検証は、以下のような言語及びライブラリを用いた例として示している。
・シミュレーション部３０
構造解析ＯＳＳ（Open Source Software）であるFrontISTRを用いる。
・機械学習部４０
学習部５０及び推論部６０のメインプログラムは、多くのアプリケーションに組み込み可能なPython等の汎用スクリプト言語を用いればよい。 The verification according to the second embodiment described above is shown as an example using the following languages and libraries.
・Simulation unit 30
FrontISTR, which is structural analysis OSS (Open Source Software), is used.
・Machine learning unit 40
The main programs of the learning unit 50 and the inference unit 60 may use a general-purpose script language such as Python that can be embedded in many applications.

学習部５０及び推論部６０から呼び出すＡＩ推論（ＮＮ２７０に相当）を行うプログラムには、機械学習用ライブラリであるTensorFlowを用い、TensorFlowを呼び出し容易なKerasのＡＰＩを用いればよい。 A program that performs AI inference (corresponding to NN270) called from the learning unit 50 and the inference unit 60 may use TensorFlow, which is a library for machine learning, and use the Keras API that easily calls TensorFlow.

このように、シミュレーション部３０と機械学習部４０とではプログラム言語の構造が異なっている。シミュレーション部３０と機械学習部４０との間でのデータの受け渡しは、既存のディスク１３（図１）を介して行うことが可能であるので、第１実施例の実装は必須ではない。しかしながら、図３７及び図３８で示したように実行速度とシミュレーション結果５の精度とは、トレードオフの関係にある。シミュレーションの精度を維持しつつより高速化する観点において、第１実施例で説明したようなプロセス間通信を実装することがより好ましい。 Thus, the simulation unit 30 and the machine learning unit 40 have different program language structures. Data can be exchanged between the simulation unit 30 and the machine learning unit 40 via the existing disk 13 (FIG. 1), so implementation of the first embodiment is not essential. However, as shown in FIGS. 37 and 38, the execution speed and the accuracy of the simulation result 5 are in a trade-off relationship. From the viewpoint of speeding up the simulation while maintaining the accuracy of the simulation, it is more preferable to implement inter-process communication as described in the first embodiment.

また、上記言語及びライブラリを用いた第２実施例において、シミュレーションを実行した場合には、図４２のような実行ログを得ることができる。図４２は、第２実施例における実行ログの例を示す図である。 Also, in the second embodiment using the language and library described above, when a simulation is executed, an execution log such as that shown in FIG. 42 can be obtained. FIG. 42 is a diagram showing an example of an execution log in the second embodiment.

図４２に例示される実行ログ４ａは、FrontISTRでプログラムされたシミュレーション部３０が実行されると出力されるログに相当する。実行ログ４ａでは、ログ記述４２ａにより、ＮＮ２７０に相当するＡＩ処理を実行するプロセッサが呼び出されている。プロセッサは、ＧＰＵ１４の製品名等で示される。ログ記述４２ｂではＡＩフレームワークが呼び出され、この例ではTensorflowが呼び出されたことが示されている。 The execution log 4a illustrated in FIG. 42 corresponds to a log output when the simulation section 30 programmed with FrontISTR is executed. In the execution log 4a, a processor that executes AI processing corresponding to the NN 270 is called by the log description 42a. A processor is indicated by the product name of the GPU 14 or the like. The log description 42b indicates that the AI framework was called, and that Tensorflow was called in this example.

ログ記述４２ｃ及び４２ｄから、残差の閾値Ｔｈが略1e-08から略2e-08に変化したことが分かる。第２実施例の適用のない既存のシミュレーションでは、このように閾値Ｔｈが変動することはなく、例えば、閾値Ｔｈが1e-08であれば、常に1e-08を示すログが記録される。 From the log descriptions 42c and 42d, it can be seen that the residual threshold Th has changed from approximately 1e-08 to approximately 2e-08. In existing simulations to which the second embodiment is not applied, the threshold Th does not fluctuate in this way. For example, if the threshold Th is 1e-08, a log indicating 1e-08 is always recorded.

なお、第１実施例及び第２実施例を実現する情報処理装置１００では、
・ＡＩ処理を行うプログラム用として、メインプログラムがPythonであれば、
python_main_path=/path/to/ai_main.py
などの設定ファイルが作成され特定の記憶領域に保存される。また、この特定の記憶領域には、
・訓練済みＡＩモデルが生成されると、推論部６０により利用可能なように、一例として、
trained_model_path=/path/to/trained_model.h5
などの設定ファイルが作成され保存される。 Note that in the information processing apparatus 100 that implements the first and second embodiments,
・For programs that perform AI processing, if the main program is Python,
python_main_path=/path/to/ai_main.py
A setting file such as is created and saved in a specific storage area. Also, in this particular storage area,
- Once the trained AI model is generated, so that it can be used by the reasoning unit 60, for example:
trained_model_path=/path/to/trained_model.h5
A configuration file such as is created and saved.

上述において、残差曲線は残差推移を表すデータの一例であり、実行時間は、シミュレーションの演算時間の一例であり、ＮＮ２７０は、推論モデルの一例である。 In the above description, the residual curve is an example of data representing residual transition, the execution time is an example of simulation computation time, and the NN 270 is an example of an inference model.

本発明は、具体的に開示された実施例に限定されるものではなく、特許請求の範囲から逸脱することなく、主々の変形や変更が可能である。 The invention is not limited to the specifically disclosed embodiments, which are capable of major variations and modifications without departing from the scope of the claims.

以上の第１実施例～第２実施例を含む実施形態に関し、更に以下の付記を開示する。
（付記１）
１又は複数のコンピュータに、
線形解析を反復して非線形解析を行う第１の処理を実行させ、
前記第１の処理により複数の実験値で残差閾値ごとに求めた前記線形解析の反復ごとの残差推移と演算時間とに基づいて、推論モデルにより前記線形解析の収束判定に用いる残差閾値を推論させる第２の処理とを実行させ、
前記第１の処理と前記第２の処理との間で行われるデータの受け渡しを、メモリに設定された共有メモリを用いたプロセス間通信により行わせる
ことを特徴とする情報処理プログラム。
（付記２）
前記第１の処理は、
前記共有メモリに前記線形解析の反復ごとの前記残差推移と前記演算時間とを前記共有メモリに記憶して、第１の名前付きパイプにアドレスを設定し、
第２の名前付きパイプから前記第２の処理において書き込まれた前記推論モデルの推論結果を取得し、
前記第２の処理は、
前記共有メモリ内の前記第１の名前付きパイプで指定される前記アドレスから前記残差推移と前記演算時間とを取得し、
前記残差推移と前記演算時間とから定めた基準閾値と該残差推移とから前記推論モデルに前記残差閾値を推論する学習を行わせ、該推論モデルにより出力された前記推論結果を前記第２の名前付きパイプに書き込む
ことを特徴とする付記１記載の情報処理プログラム。
（付記３）
前記第２の処理は、
前記演算時間が最も短いときの前記実験値を前記基準閾値に設定し、
前記残差推移ごとに用いた実験値と前記基準閾値とを比較した結果に応じて、該残差推移にラベル付けして学習データを生成し、
前記学習データを用いて前記推論モデルを行う
ことを特徴とする付記２記載の情報処理プログラム。
（付記４）
前記第２の処理は、
前記演算時間が最も短いときの前記実験値を前記基準閾値に設定し、
前記残差推移ごとに用いた前記実験値と前記基準閾値とを比較した結果に応じて、該残差推移に対するラベルを決定し、
前記残差推移を一定区間ごとに区切って複数の入力データを作成し、作成した複数の入力データのそれぞれに決定した前記ラベルを付与して複数の学習データを生成し、
前記複数の学習データのそれぞれに対して前記推論モデルを行う
ことを特徴とする付記２記載の情報処理プログラム。
（付記５）
前記第２の処理は、
前記非線形解析の問題データごとに、前記演算時間が最も短いときの前記実験値を特定し、特定した該実験値を用いたときの前記第１の処理で得られた第１の結果と、他の実験値を用いたときの該第１の処理で得られた第２の結果それぞれとの結果誤差を計算し、
前記問題データごとに、前記結果誤差と実行時間とを、前記実験値ごとに比較した結果に基づいて、前記基準閾値を設定し、
前記残差推移ごとに用いた前記実験値と前記基準閾値とを比較した結果に応じて、該残差推移に対するラベルを決定し、
前記残差推移を一定区間ごとに区切って複数の入力データを作成し、作成した複数の入力データのそれぞれに決定した前記ラベルを付与して複数の学習データを生成し、
前記複数の学習データのそれぞれに対して前記推論モデルを行う
ことを特徴とする付記２記載の情報処理プログラム。
（付記６）
前記第２の処理は、
前記第１の処理からの呼び出しに応じて、前記第１の処理によって得られた前記残差推移を用いて、訓練済みモデルにより前記残差閾値を推論させた前記推論結果を取得して、前記第１の処理への戻り値として前記第２の名前付きパイプに書き込み、
前記第１の処理は、前記第２の名前付きパイプから前記推論結果を読み出し、読み出した該推論結果と予め定めた変化率とを用いて前記残差閾値を更新する付記３乃至５のいずれか一項に記載の情報処理プログラム。
（付記７）
前記第１の処理は、
前記第１の名前付きパイプに接続し、前記第２の処理からの第１のロック解除を待ち、
前記第１のロック解除の検出に応じて、前記共有メモリに、前記非線形解析により得られた非線形解析データと前記線形解析により得られた線形解析データとを書き込み、
前記共有メモリに、前記非線形解析の繰り返し回数と、前記線形解析の反復回数とを書き込み、
前記第２の名前付きパイプに接続し、前記第２の処理からの第２のロック解除を待ち、
前記第２のロック解除に応じて、前記第２の名前付きパイプから前記推論結果を取得する
ことを特徴とする付記２乃至６のいずれか一項に記載の情報処理プログラム。
（付記８）
前記第２の処理は、
前記第１の名前付きパイプに接続し、該第１の名前付きパイプのロックを解除し、
前記第１の処理からの要求に応じて、前記共有メモリから前記線形解析データと前記非線形解析データとを読み込み、それぞれから前記残差推移と前記演算時間とを取得し、訓練済みの前記推論モデルに前記残差閾値を推論させ、
前記第２の名前付きパイプに接続し、訓練済みの前記推論モデルによる推論結果を該第２の名前付きパイプに書き込んで、該第２の名前付きパイプをロック解除する
ことを特徴とする付記７に記載の情報処理プログラム。
（付記９）
前記第１の処理は、科学計算用の手続き型言語による第１のプログラムを第１のプロセッサが実行することにより行われ、
前記第２の処理は、前記第１のプログラムとは異なるスクリプト言語による第２のプログラムを前記第１のプロセッサが実行することにより行われ、
前記推論モデルは、前記第１のプロセッサとは異なる前記第２のプロセッサが、深層学習言語によるプログラムを実行することにより行われる
ことを特徴とする付記１乃至８のいずれか一項記載の情報処理プログラム。
（付記１０）
１又は複数のコンピュータが、
線形解析を反復して非線形解析を行う第１の処理を実行し、
前記第１の処理により複数の実験値で残差閾値ごとに求めた前記線形解析の反復ごとの残差推移と演算時間とに基づいて、推論モデルにより前記線形解析の収束判定に用いる残差閾値を推論させる第２の処理とを実行し、
前記第１の処理と前記第２の処理との間で行われるデータの受け渡しは、メモリに設定された共有メモリを用いたプロセス間通信により行う
ことを特徴とする情報処理方法。
（付記１１）
メモリと、
前記メモリに接続された１又は複数のプロセッサとを有し、該１又は複数のコンピュータが、
線形解析を反復して非線形解析を行う第１の処理と、
前記第１の処理により複数の実験値で残差閾値ごとに求めた前記線形解析の反復ごとの残差推移と演算時間とに基づいて、推論モデルにより前記線形解析の収束判定に用いる残差閾値を推論させる第２の処理とを実行し、
前記第１の処理と前記第２の処理との間で行われるデータの受け渡しは、前記メモリに設定された共有メモリを用いたプロセス間通信により行うことを特徴とする情報処理装置。 Further, the following additional remarks are disclosed regarding the embodiments including the above-described first to second examples.
(Appendix 1)
on one or more computers,
Execute a first process of repeating linear analysis to perform nonlinear analysis,
Residual threshold used for determining convergence of the linear analysis by the inference model based on the residual transition and computation time for each iteration of the linear analysis obtained for each residual threshold with a plurality of experimental values in the first process and a second process for inferring
An information processing program, characterized in that data exchange between the first process and the second process is performed by inter-process communication using a shared memory set in a memory.
(Appendix 2)
The first processing is
storing in the shared memory the residual transition and the computation time for each iteration of the linear analysis in the shared memory, and setting an address to a first named pipe;
obtaining an inference result of the inference model written in the second process from a second named pipe;
The second processing is
obtaining the residual transition and the computation time from the address specified by the first named pipe in the shared memory;
causing the inference model to learn to infer the residual threshold from the residual transition and a reference threshold determined from the residual transition and the computation time; 2. The information processing program according to claim 1, wherein the information processing program writes to 2 named pipes.
(Appendix 3)
The second processing is
setting the experimental value when the calculation time is the shortest as the reference threshold;
generating learning data by labeling the residual transition according to the result of comparing the experimental value used for each residual transition with the reference threshold;
The information processing program according to appendix 2, wherein the inference model is performed using the learning data.
(Appendix 4)
The second processing is
setting the experimental value when the calculation time is the shortest as the reference threshold;
determining a label for the residual transition according to a result of comparing the experimental value used for each residual transition with the reference threshold;
creating a plurality of input data by dividing the residual transition into certain intervals, assigning the determined label to each of the plurality of created input data to generate a plurality of learning data;
The information processing program according to appendix 2, wherein the inference model is applied to each of the plurality of learning data.
(Appendix 5)
The second processing is
The experimental value when the computation time is the shortest is specified for each problem data of the nonlinear analysis, and the first result obtained in the first process when using the specified experimental value; calculating the result error with each of the second results obtained in the first process when using the experimental value of
setting the reference threshold based on the result of comparing the result error and the execution time for each experimental value for each problem data;
determining a label for the residual transition according to a result of comparing the experimental value used for each residual transition with the reference threshold;
creating a plurality of input data by dividing the residual transition into certain intervals, assigning the determined label to each of the plurality of created input data to generate a plurality of learning data;
The information processing program according to appendix 2, wherein the inference model is applied to each of the plurality of learning data.
(Appendix 6)
The second processing is
In response to a call from the first process, using the residual transition obtained by the first process, obtaining the inference result of inferring the residual threshold by a trained model, writing to the second named pipe as a return value to the first process;
Any one of Appendices 3 to 5, wherein the first process reads the inference result from the second named pipe, and uses the read inference result and a predetermined change rate to update the residual threshold. 1. The information processing program according to item 1.
(Appendix 7)
The first processing is
connecting to said first named pipe and waiting for a first unlock from said second process;
writing the nonlinear analysis data obtained by the nonlinear analysis and the linear analysis data obtained by the linear analysis into the shared memory in response to detection of the first unlocking;
writing the number of iterations of the nonlinear analysis and the number of iterations of the linear analysis into the shared memory;
connecting to said second named pipe and waiting for a second unlock from said second process;
7. The information processing program according to any one of appendices 2 to 6, wherein the inference result is obtained from the second named pipe in response to the second unlocking.
(Appendix 8)
The second processing is
connecting to the first named pipe and unlocking the first named pipe;
reading the linear analysis data and the nonlinear analysis data from the shared memory in response to a request from the first process, obtaining the residual transition and the operation time from each, and obtaining the trained inference model; to infer the residual threshold,
Appendix 7, connecting to the second named pipe, writing inference results from the trained inference model to the second named pipe, and unlocking the second named pipe. The information processing program described in .
(Appendix 9)
The first process is performed by a first processor executing a first program in a procedural language for scientific computing,
The second process is performed by the first processor executing a second program in a script language different from the first program,
9. The information processing according to any one of appendices 1 to 8, wherein the inference model is performed by the second processor, which is different from the first processor, executing a program written in a deep learning language. program.
(Appendix 10)
one or more computers
performing a first process of repeating a linear analysis to perform a nonlinear analysis;
Residual threshold used for determining convergence of the linear analysis by the inference model based on the residual transition and computation time for each iteration of the linear analysis obtained for each residual threshold with a plurality of experimental values in the first process and a second process for inferring
An information processing method, wherein data is transferred between the first process and the second process by inter-process communication using a shared memory set in a memory.
(Appendix 11)
memory;
and one or more processors connected to the memory, the one or more computers comprising:
a first process of repeating linear analysis to perform nonlinear analysis;
Residual threshold used for determining convergence of the linear analysis by the inference model based on the residual transition and computation time for each iteration of the linear analysis obtained for each residual threshold with a plurality of experimental values in the first process and a second process for inferring
An information processing apparatus, wherein data is transferred between the first process and the second process by inter-process communication using a shared memory set in the memory.

２問題データ
３候補閾値
３ｒｅｆ基準閾値
４ａ実行ログ
４ｄ残差曲線データ
５シミュレーション結果
６ａ入力データ
６ｇ学習データ
１１ＣＰＵ
１２メインメモリ
１２ａ共有メモリ
１２ｍ－１ＣＭ領域
１２ｍ－２ＭＬ領域
１２ｐ名前付きパイプ領域
１２ｖＯＳ仮想メモリ
１３ディスク
１４ｇＧＰＵ
１４ｍＧＰＵメモリ
１５入力装置
１６表示装置
１７通信Ｉ／Ｆ
１８ドライブ装置
１９記憶媒体
３０シミュレーション部
３２非線形解析部
３４線形解析部
４０機械学習部
５０学習部
６０推論部
２３０高性能アプリケーション
２３２高性能アプリケーション
２０４ｄシミュレーションデータ
１００情報処理装置 2 problem data 3 candidate threshold 3ref reference threshold 4a execution log 4d residual curve data 5 simulation result 6a input data 6g learning data 11 CPU
12 Main memory 12a Shared memory 12m-1 CM area 12m-2 ML area 12p Named pipe area 12v OS virtual memory 13 Disk 14g GPU
14m GPU memory 15 input device 16 display device 17 communication I/F
18 drive unit 19 storage medium 30 simulation unit 32 nonlinear analysis unit 34 linear analysis unit 40 machine learning unit 50 learning unit 60 inference unit 230 high performance application 232 high performance application 204d simulation data 100 information processing device

Claims

on one or more computers,
Execute a first process of repeating linear analysis to perform nonlinear analysis,
Based on the residual transition and the computation time for each iteration of the linear analysis obtained for each residual threshold with a plurality of experimental values in the first process, the NN determines the residual threshold used to determine the convergence of the linear analysis. Execute a second process for inferring,
An information processing program, characterized in that data exchange between the first process and the second process is performed by inter-process communication using a shared memory set in a memory.

The first processing is
storing the residual transition and the computation time for each iteration of the linear analysis in the shared memory and setting an address to a first named pipe;
obtaining the inference result of the NN written in the second process from a second named pipe;
The second processing is
obtaining the residual transition and the computation time from the address specified by the first named pipe in the shared memory;
making the NN learn to infer the residual threshold from the residual transition and a reference threshold determined from the residual transition and the computation time; 2. The information processing program according to claim 1, writing to a named pipe.

The second processing is
setting the experimental value when the calculation time is the shortest as the reference threshold;
generating learning data by labeling the residual transition according to the result of comparing the experimental value used for each residual transition with the reference threshold;
3. The information processing program according to claim 2, wherein learning of said neural network is performed using said learning data.

The second processing is
setting the experimental value when the calculation time is the shortest as the reference threshold;
determining a label for the residual transition according to a result of comparing the experimental value used for each residual transition with the reference threshold;
creating a plurality of input data by dividing the residual transition into certain intervals, assigning the determined label to each of the plurality of created input data to generate a plurality of learning data;
3. The information processing program according to claim 2, wherein learning of said neural network is performed using said plurality of learning data.

The second processing is
The experimental value when the computation time is the shortest is specified for each problem data of the nonlinear analysis, and the first result obtained in the first process when using the specified experimental value; calculating the result error with each of the second results obtained in the first process when using the experimental value of
setting the reference threshold based on the result of comparing the result error and the execution time for each experimental value for each problem data;
determining a label for the residual transition according to a result of comparing the experimental value used for each residual transition with the reference threshold;
creating a plurality of input data by dividing the residual transition into certain intervals, assigning the determined label to each of the plurality of created input data to generate a plurality of learning data;
3. The information processing program according to claim 2, wherein learning of said neural network is performed using said plurality of learning data.

The second processing is
Acquiring the inference result of inferring the residual threshold by the trained NN using the residual transition obtained by the first process in response to a call from the first process. , writing to said second named pipe as a return value to said first process;
6. The first process reads the inference result from the second named pipe, and uses the read inference result and a predetermined rate of change to update the residual threshold. 1. The information processing program according to 1.

The first processing is
connecting to said first named pipe and waiting for a first unlock from said second process;
writing the nonlinear analysis data obtained by the nonlinear analysis and the linear analysis data obtained by the linear analysis into the shared memory in response to detection of the first unlocking;
writing the number of iterations of the nonlinear analysis and the number of iterations of the linear analysis into the shared memory;
connecting to said second named pipe and waiting for a second unlock from said second process;
7. The information processing program according to any one of claims 2 to 6, wherein the inference result is obtained from the second named pipe in response to the second unlocking.

The second processing is
connecting to the first named pipe and unlocking the first named pipe;
In response to a request from the first process, read the linear analysis data and the nonlinear analysis data from the shared memory, acquire the residual transition and the operation time from each, and load the trained NN inferring the residual threshold;
8. Connecting to said second named pipe, writing inference results by said trained neural network to said second named pipe, and unlocking said second named pipe. The information processing program described in .

one or more computers
performing a first process of repeating a linear analysis to perform a nonlinear analysis;
Based on the residual transition and the computation time for each iteration of the linear analysis obtained for each residual threshold with a plurality of experimental values in the first process, the NN determines the residual threshold used to determine the convergence of the linear analysis. performing a second process for inferring,
An information processing method, wherein data is transferred between the first process and the second process by inter-process communication using a shared memory set in a memory.

memory;
and one or more processors connected to the memory, the one or more computers comprising:
a first process of repeating linear analysis to perform nonlinear analysis;
Based on the residual transition and the computation time for each iteration of the linear analysis obtained for each residual threshold with a plurality of experimental values in the first process, the NN determines the residual threshold used to determine the convergence of the linear analysis. performing a second process for inferring,
An information processing apparatus, wherein data is transferred between the first process and the second process by inter-process communication using a shared memory set in the memory.