JP2017130111A

JP2017130111A - Dataflow log generation device, relational database, dataflow log generation method, program, and monitoring system

Info

Publication number: JP2017130111A
Application number: JP2016010003A
Authority: JP
Inventors: 伊藤　俊夫; Toshio Ito; 俊夫伊藤
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2016-01-21
Filing date: 2016-01-21
Publication date: 2017-07-27

Abstract

PROBLEM TO BE SOLVED: To generate a dataflow log capable of comprehensively monitoring data crossing various types of components while suppressing a load applied onto a computational system or the like as much as possible.SOLUTION: A dataflow log generation device comprises: a first acquisition part for, when a dataflow between a first object in a first component as a process and a second component occurs, acquiring first position information indicating a start point or end point of the dataflow in the first component; a second acquisition part for acquiring second position information indicating the start point or end point of the dataflow in the second component; and a log generation part for generating a dataflow log in which the first position information and the second position information are recorded.SELECTED DRAWING: Figure 1

Description

本発明の実施形態は、データフローログ生成装置、リレーショナルデータベース、データフローログ生成方法、プログラム、および監視システムに関する。 Embodiments described herein relate generally to a data flow log generation device, a relational database, a data flow log generation method, a program, and a monitoring system.

近年、計算機の処理効率の向上、情報サービスの高度化などに伴い、複数の計算機およびプログラムが連携して動作する計算システムが一般化している。このような計算システムにおいて、計算処理の異常の検出、セキュリティ診断、デバッグ、性能監視、性能劣化要因分析などを実現するために、複数のプログラムを横断したデータの流れ（データフロー）を観測・監視することが知られている。 In recent years, with the improvement of processing efficiency of computers and the advancement of information services, computer systems in which a plurality of computers and programs operate in cooperation have become common. In such a computing system, observation and monitoring of data flow (data flow) across multiple programs is performed in order to realize calculation processing abnormality detection, security diagnosis, debugging, performance monitoring, performance degradation factor analysis, etc. It is known to do.

上記のようなデータフローの監視を行う監視手法は、（１）データが、例えば、プロセス、ファイル、データベースなどの様々な種類のコンポーネントを横断する場合でも網羅的に監視できることが求められる。しかし、例えば、ＯＳ標準の監視機能を用いると、ファイルやプロセス間通信に基づくデータフローを監視することはできるが、プロセス内部のデータフロー、またはプロセスとデータベースとの間のデータフローを監視することは難しい。 The monitoring method for monitoring the data flow as described above requires (1) that data can be comprehensively monitored even when data crosses various types of components such as processes, files, and databases. However, for example, if the OS standard monitoring function is used, it is possible to monitor the data flow based on the file or the inter-process communication, but the data flow inside the process or the data flow between the process and the database is monitored. Is difficult.

一方で、（２）計算システムにかかる負荷（オーバーヘッド）または構成変更をなるべく小さく抑えられることも求められる。例えば、コンポーネントを横断するデータフローを観測するために識別用のメタデータを付与する仕組みを追加すると、計算システムにかかる負荷等が増加してしまう。ゆえに、網羅的な監視と計算システムにかかる負荷とをいかにして両立させるかが重要となる。 On the other hand, (2) it is also required that the load (overhead) or the configuration change applied to the calculation system be minimized. For example, if a mechanism for adding identification metadata for observing a data flow across components is added, the load on the calculation system increases. Therefore, it is important how to make comprehensive monitoring and the load on the computing system compatible.

特開２０１３−２４２６３３号公報JP 2013-242633 A

ＣｈｕｎＨｕｉＳｕｅｎ，ＲｙａｎＫＬＫｏ，ＹｕＳｈｙａｎｇＴａｎ，ＰｅｔｅｒＪａｇａｄｐｒａｍａｎａ，ＢｕＳｕｎｇＬｅｅ，“Ｓ２Ｌｏｇｇｅｒ：Ｅｎｄ−ｔｏ−ＥｎｄＤａｔａＴｒａｃｋｉｎｇＭｅｃｈａｎｉｓｍｆｏｒＣｌｏｕｄＤａｔａＰｒｏｖｅｎａｎｃｅ，”ＩｎＰｒｏｃｅｅｄｉｎｇｓｏｆ１２ｔｈＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＴｒｕｓｔ，ＳｅｃｕｒｉｔｙａｎｄＰｒｉｖａｃｙｉｎＣｏｍｐｕｔｉｎｇａｎｄＣｏｍｎｍｕｎｉｃａｔｉｏｎｓ（ＴｒｕｓｔＣｏｍ ‘１３），ｐ．５９４−６０２．Chun Hui Suen, Ryan K L Ko, Yu Shyang Tan, Peter Jagadpramana, Bu Sung Lee, "S2Logger: End-to-End Data Tracking Mechanism for Cloud Data Provenance," In Proceedings of 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom '13), p. 594-602.

本発明の実施形態は、計算システムにかかる負荷などをなるべく抑えつつ、様々な種類のコンポーネントを横断するデータも網羅的に監視可能なデータフローログを生成する。 The embodiment of the present invention generates a data flow log that can comprehensively monitor data crossing various types of components while minimizing the load on the computing system.

本発明の一態様としてのデータフローログ生成装置は、プロセスである第１コンポーネント内の第１オブジェクトと、第２コンポーネントとの間におけるデータフローが生じた場合において、前記第１コンポーネントにおける、前記データフローの始点または終点を示す第１位置情報を取得する第１取得部と、前記第２コンポーネントにおける、前記データフローの始点または終点を示す第２位置情報を取得する第２取得部と、前記第１位置情報と前記第２位置情報とが記録されたデータフローログを生成するログ生成部と、を備える。 When the data flow between the 1st object in the 1st component which is a process and the 2nd component arises, the data flow log generation device as one mode of the present invention WHEREIN: The data in the 1st component A first acquisition unit that acquires first position information indicating a start point or an end point of a flow; a second acquisition unit that acquires second position information indicating a start point or an end point of the data flow in the second component; A log generation unit that generates a data flow log in which one position information and the second position information are recorded.

第１の実施形態に係る計算機システム全体の概略構成の一例を示すブロック図。1 is a block diagram illustrating an example of a schematic configuration of an entire computer system according to a first embodiment. アクセス先テーブルの一例を示す図。The figure which shows an example of an access destination table. Ｊａｖａプログラム内部のデータフローログの生成処理のフローチャート。The flowchart of the production | generation process of the data flow log inside a Java program. データベースからデータを読み込む場合のデータフローログの生成処理のフローチャート。The flowchart of the production | generation process of the data flow log in the case of reading data from a database. バインド一時保存部に保存される内容の一例を示す図。The figure which shows an example of the content preserve | saved at a bind temporary preservation | save part. バインド一時保存部へのデータ記録処理のフローチャート。The flowchart of the data recording process to a bind temporary storage part. データベースへデータを書き込む場合のデータフローログの生成処理のフローチャート。The flowchart of the production | generation process of the data flow log in the case of writing data to a database. 一時ログテーブルの内容の一例を示す図。The figure which shows an example of the content of a temporary log table. 第２の実施形態に係る計算機システム全体の概略構成の一例を示すブロック図。The block diagram which shows an example of schematic structure of the whole computer system which concerns on 2nd Embodiment. アクセス先ファイル名一時保存部に保存された情報の一例を示す図。The figure which shows an example of the information preserve | saved at the access destination file name temporary preservation | save part. ファイルからデータを読み込む場合のデータフローログの生成処理のフローチャート。The flowchart of the production | generation process of the data flow log in the case of reading data from a file. ファイルへデータを書き込む場合のデータフローログの生成処理のフローチャート。The flowchart of the production | generation process of the data flow log in the case of writing data to a file. 第３の実施形態に係る計算機システム全体の概略構成の一例を示すブロック図。The block diagram which shows an example of schematic structure of the whole computer system which concerns on 3rd Embodiment. 第３の実施形態に係るＪａｖａプログラム内部のデータフローログの生成処理のフローチャート。The flowchart of the production | generation process of the data flow log inside the Java program which concerns on 3rd Embodiment. 第３の実施形態に係るＪａｖａプログラム同士間のデータフローログの生成処理のフローチャート。12 is a flowchart of data flow log generation processing between Java programs according to the third embodiment. 本発明の一実施形態におけるハードウェア構成の一例を示すブロック図。The block diagram which shows an example of the hardware constitutions in one Embodiment of this invention.

以下、図面を参照しながら、本発明の実施形態について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

（第１の実施形態）
図１は、第１の実施形態に係る計算機システム全体の概略構成の一例を示すブロック図である。第１の実施形態に係る計算機システムは、プログラムによるプロセス１と、リレーショナルデータベース（ＲｅｌａｔｉｏｎａｌＤａｔａｂａｓｅ：ＲＤＢ）２と、ログ受信部３とを備える。 (First embodiment)
FIG. 1 is a block diagram illustrating an example of a schematic configuration of the entire computer system according to the first embodiment. The computer system according to the first embodiment includes a process 1 by a program, a relational database (RDB) 2, and a log receiving unit 3.

第１の実施形態に係る計算機システムは、第１コンポーネントであるプロセス（プログラム）１により、所定の手続きが実施されることを想定する。そして、所定の手続きが実施されるにあたり、様々なデータの授受（やり取り）が行われる。ここでは、データの授受の流れをデータフローと称する。なお、この授受されるデータは特に限定されるものではない。また、例えば、データの転送だけでなく、プログラム内のオブジェクトが別のオブジェクトを生成するといった事象もデータフローに含まれる。 The computer system according to the first embodiment assumes that a predetermined procedure is performed by a process (program) 1 that is a first component. Then, when a predetermined procedure is performed, various data are exchanged (exchanged). Here, the flow of data exchange is referred to as a data flow. The data exchanged is not particularly limited. Further, for example, not only data transfer but also an event that an object in a program generates another object is included in the data flow.

ここでは、プログラムにＪａｖａ（登録商標）が用いられることを想定し、以降は、Ｊａｖａプログラム１として説明する。なお、Ｊａｖａプログラム１と同機能を有するオブジェクト指向プログラムなど、Ｊａｖａ以外のプログラムを用いてもよい。 Here, it is assumed that Java (registered trademark) is used for the program, and the following description will be made as Java program 1. A program other than Java, such as an object-oriented program having the same function as the Java program 1, may be used.

Ｊａｖａプログラム１は、データフローが記録されたログを生成する。このログをデータフローログと称する。データフローログには、データフローの始点または終点を示す情報（位置情報）が含まれる。このようなデータフローログが生成されることにより、データフローの監視を行うことができる。 The Java program 1 generates a log in which the data flow is recorded. This log is referred to as a data flow log. The data flow log includes information (position information) indicating the start point or end point of the data flow. By generating such a data flow log, it is possible to monitor the data flow.

第１の実施形態に係る計算機システムでは、Ｊａｖａプログラム１内部のデータフローと、Ｊａｖａプログラム１とＲＤＢ２間のデータフローが起こり得る。ゆえに、下記３種類のデータフローに係るデータフローログが生成される。
（１）Ｊａｖａプログラム１内部のデータフロー
（２）Ｊａｖａプログラム１がＲＤＢ２からデータを読み込むデータフロー
（３）Ｊａｖａプログラム１からＲＤＢ２へデータを書き込むデータフロー
（２）と（３）のデータフローログにより、データフローが第１コンポーネントと別のコンポーネント（第２コンポーネント）を横断する場合においても、網羅的に監視を行うことができる。 In the computer system according to the first embodiment, a data flow inside the Java program 1 and a data flow between the Java program 1 and the RDB 2 can occur. Therefore, data flow logs relating to the following three types of data flows are generated.
(1) Data flow in Java program 1 (2) Data flow in which Java program 1 reads data from RDB 2 (3) Data flow in which data is written from Java program 1 to RDB 2 (2) and data flow logs in (3) Even when the data flow crosses the first component and another component (second component), comprehensive monitoring can be performed.

次に、Ｊａｖａプログラム１について説明する。図１に示すＪａｖａプログラム１は、Ｊａｖａオブジェクト１０１および１０２と、Ｓｙｓｔｅｍクラス１０３と、Ｌｏｇｇｅｒオブジェクト１０４と、ＪＤＢＣ（ＪａｖａＤａｔａｂａｓｅＣｏｎｎｅｃｔｉｖｉｔｙ）ドライバと、Ｌｏｇｇｅｒフック１０７と、バインド一時保存部１０５とを有する。 Next, the Java program 1 will be described. The Java program 1 shown in FIG. 1 includes Java objects 101 and 102, a System class 103, a Logger object 104, a JDBC (Java Database Connectivity) driver, a Logger hook 107, and a temporary binding storage unit 105.

Ｊａｖａオブジェクトは、所定の手続きを実行するために生成されるオブジェクトであり、Ｊａｖａプログラム内でデータを保持する単位である。Ｊａｖａオブジェクト間において、データの授受が行われることで、Ｊａｖａプログラムが動作する。 A Java object is an object generated for executing a predetermined procedure, and is a unit for holding data in a Java program. A Java program operates by exchanging data between Java objects.

図１に示されるＪａｖａオブジェクト１０１とＪａｖａオブジェクト１０２との間の矢印は、上記（１）のＪａｖａプログラム１内部のデータフローを意味する。また、Ｊａｖａオブジェクト１０２とＲＤＢ２との矢印は、上記（２）と（３）のＪａｖａプログラム１とＲＤＢ２間のデータフローを意味する。 The arrows between the Java object 101 and the Java object 102 shown in FIG. 1 mean the data flow inside the Java program 1 of (1) above. Moreover, the arrows between the Java object 102 and the RDB2 mean the data flow between the Java program 1 and the RDB 2 in the above (2) and (3).

なお、用いられるＪａｖａオブジェクトは手続きごとに異なる。用いられるＪａｖａオブジェクトは、後述する手続きの説明ごとに紹介する。 Note that the Java object used varies from procedure to procedure. The Java object used is introduced for each procedure described later.

Ｓｙｓｔｅｍクラス１０３は、Ｊａｖａオブジェクトそれぞれに対してオブジェクト識別子を与え、当該オブジェクト識別子を管理する。本実施形態では、オブジェクト識別子は、データの授受が行われたＪａｖａオブジェクトを識別するために用いられる。すなわち、オブジェクト識別子は、データフローの始点または終点を示すために用いられる。 The System class 103 gives an object identifier to each Java object and manages the object identifier. In the present embodiment, the object identifier is used to identify the Java object to which data has been exchanged. That is, the object identifier is used to indicate the start point or end point of the data flow.

Ｌｏｇｇｅｒオブジェクト１０４は、Ｊａｖａプログラム１内部のデータフローを監視する。また、Ｌｏｇｇｅｒオブジェクト１０４は、後述するＬｏｇｇｅｒフック１０７を生成し、Ｊａｖａプログラム１とＲＤＢ２との間のデータフローを監視する。さらに、Ｌｏｇｇｅｒオブジェクト１０４は、データフローログを生成し、データフローログをログ受信部３に送る。 The Logger object 104 monitors the data flow inside the Java program 1. In addition, the Logger object 104 generates a Logger hook 107 to be described later and monitors the data flow between the Java program 1 and the RDB 2. Further, the Logger object 104 generates a data flow log and sends the data flow log to the log receiving unit 3.

バインド一時保存部１０５は、Ｌｏｇｇｅｒオブジェクト１０４がデータフローログを生成する際に必要なデータを一時的に保存する。 The temporary binding storage unit 105 temporarily stores data necessary for the Logger object 104 to generate a data flow log.

ＪＤＢＣドライバ１０６は、Ｊａｖａプログラム１とＲＤＢ２との間のデータの授受に係る通信を行う。また、ＪＤＢＣドライバ１０６は、データフローログを生成するために必要な情報をＲＤＢ２から取得するための通信を行う。 The JDBC driver 106 performs communication related to data exchange between the Java program 1 and the RDB 2. Further, the JDBC driver 106 performs communication for acquiring information necessary for generating a data flow log from the RDB 2.

Ｌｏｇｇｅｒフック１０７は、Ｌｏｇｇｅｒオブジェクト１０４により生成され、ＪＤＢＣドライバ１０６に接続される。これにより、Ｊａｖａプログラム１とＲＤＢ２との間のデータフローは全てＬｏｇｇｅｒフック１０７を経由することになる。そして、Ｌｏｇｇｅｒフック１０７がＪａｖａプログラム１とＲＤＢ２との間のデータフローを検知することにより、Ｊａｖａプログラム１とＲＤＢ２との間のデータフローを網羅することができる。 The logger hook 107 is generated by the logger object 104 and connected to the JDBC driver 106. As a result, all data flows between the Java program 1 and the RDB 2 go through the Logger hook 107. Then, the Logger hook 107 detects the data flow between the Java program 1 and the RDB 2 so that the data flow between the Java program 1 and the RDB 2 can be covered.

次に、ＲＤＢ２について説明する。ＲＤＢ２は、データベースインタフェース（ＤａｔａｂａｓｅＩｎｔｅｒｆａｃｅｒ：ＤＢ＿Ｉ／Ｆ）２０１、アクセス先テーブル２０２、システムカタログ２０３、一時ログテーブル２０４を有する。 Next, RDB2 will be described. The RDB 2 includes a database interface (DB_I / F) 201, an access destination table 202, a system catalog 203, and a temporary log table 204.

ＤＢ＿Ｉ／Ｆ２０１はＪａｖａプログラム１に対して公開されるデータベースの操作インタフェースである。ＪＤＢＣドライバ１０６とＤＢ＿Ｉ／Ｆ２０１とが通信を行うことにより、Ｊａｖａプログラム１とＲＤＢ２間においてデータの授受が行われる。 DB_I / F 201 is a database operation interface that is open to the Java program 1. Data is exchanged between the Java program 1 and the RDB 2 by the communication between the JDBC driver 106 and the DB_I / F 201.

アクセス先テーブル２０２は、Ｊａｖａプログラム１が実行する所定の手続きにおいて、Ｊａｖａオブジェクト１０２からアクセスされるテーブルである。図２は、アクセス先テーブル２０２の一例を示す図である。ここでは、アクセス先テーブル２０２が図２の構造から成ることを想定して、説明する。なお、図２に示されていない情報が含まれていてもよい。 The access destination table 202 is a table accessed from the Java object 102 in a predetermined procedure executed by the Java program 1. FIG. 2 is a diagram illustrating an example of the access destination table 202. Here, description will be made assuming that the access destination table 202 has the structure of FIG. Information that is not shown in FIG. 2 may be included.

アクセス先テーブル２０２は、ｓａｍｐｌｅ＿ｓｃｈｅｍａというスキーマ名と、ｓａｍｐｌｅ＿ｔａｂｌｅというテーブル名をもつとする。ｓａｍｐｌｅ＿ｔａｂｌｅには、ｓｉｇｎａｌ＿ｉｄと、ｈｉｓｔｏｒｙ＿ｔｉｍｅと、ｓｉｇｎａｌ＿ｖａｌｕｅとの３つの列をもつ。ここでは、ｓｉｇｎａｌ＿ｉｄとｈｉｓｔｏｒｙ＿ｔｉｍｅのペア（組み合わせ）を、テーブルに含まれる行を一意に特定するための主キーとする。 It is assumed that the access destination table 202 has a schema name “sample_schema” and a table name “sample_table”. The sample_table has three columns: signal_id, history_time, and signal_value. Here, a pair (combination) of signal_id and history_time is used as a primary key for uniquely identifying a row included in the table.

システムカタログ２０３は、ＲＤＢ２内の全テーブル（アクセス先テーブル２０２を含む）に関する情報を保持するテーブルである。 The system catalog 203 is a table that holds information regarding all the tables (including the access destination table 202) in the RDB2.

一次ログテーブルは、アクセス先テーブル２０２へのデータ書き込みに対するログを一時的に保存するためのテーブルである。 The primary log table is a table for temporarily storing a log for data writing to the access destination table 202.

次に、ログ受信部３について説明する。ログ受信部３は、生成されたデータフローログを受け取る。データフローログの送受信は、ＴＣＰ／ＩＰ通信など任意の通信手段を用いればよい。ログ受信部３は、後にデータフローログが分析されるために、データフローログを保存しておいてもよいし、他のシステムなどに出力してもよい。 Next, the log receiving unit 3 will be described. The log receiving unit 3 receives the generated data flow log. Any communication means such as TCP / IP communication may be used for transmission / reception of the data flow log. The log receiving unit 3 may store the data flow log or output it to another system or the like in order to analyze the data flow log later.

なお、図１において、ログ受信部３はＪａｖａプログラム１外部にあるとしたが、Ｊａｖａプログラム１の内部にあってもよいし、ＲＤＢ２の内部にあってもよい。 In FIG. 1, the log receiving unit 3 is located outside the Java program 1, but may be located inside the Java program 1 or inside the RDB 2.

次に、上記３種類のデータフローのデータフローログを生成するための具体的な記録方法について説明する。
（１）Ｊａｖａプログラム１の内部のデータフロー
Ｊａｖａプログラム１内部のデータフローには、図１に示すように、Ｊａｖａオブジェクト１０１からＪａｖａオブジェクト１０２へのデータ転送、または、Ｊａｖａオブジェクト１０１によるＪａｖａオブジェクト１０２の生成といった事象などがある。例えば、下記に示されるコードは、Ｊａｖａプログラム１の内部のデータフローを生ずる。
Object copied_data = original_data.clone()
Object processed_data = processData (copied_data)
１行目において、ｏｒｉｇｉｎａｌ＿ｄａｔａオブジェクトからｃｏｐｉｅｄ＿ｄａｔａオブジェクトへのデータフローが発生していると言える。この場合、ｏｒｉｇｉｎａｌ＿ｄａｔａオブジェクトがＪａｖａオブジェクト１０１であり、ｃｏｐｉｅｄ＿ｄａｔａオブジェクトがＪａｖａオブジェクト１０２と言える。また、２行目においてｃｏｐｉｅｄ＿ｄａｔａオブジェクトからｐｒｏｃｅｓｓｅｄ＿ｄａｔａオブジェクトへのデータフローが発生していると言える。 Next, a specific recording method for generating data flow logs of the above three types of data flows will be described.
(1) Data flow inside Java program 1 As shown in FIG. 1, the data flow inside Java program 1 includes data transfer from Java object 101 to Java object 102, or Java object 102 using Java object 101. There are events such as generation. For example, the code shown below produces a data flow within Java program 1.
Object copied_data = original_data.clone ()
Object processed_data = processData (copied_data)
In the first line, it can be said that a data flow from the original_data object to the copied_data object has occurred. In this case, it can be said that the original_data object is the Java object 101 and the copied_data object is the Java object 102. In addition, it can be said that the data flow from the copied_data object to the processed_data object occurs in the second line.

上記のように、Ｊａｖａプログラム１内部にてデータフローが発生した場合に、データフローの始点であるＪａｖａオブジェクト１０１とデータフローの終点であるＪａｖａオブジェクト１０２が、Ｌｏｇｇｅｒオブジェクト１０４に渡されることにより、Ｌｏｇｇｅｒオブジェクト１０４はデータフローの発生を検知することができる。 As described above, when a data flow occurs in the Java program 1, the Java object 101 that is the start point of the data flow and the Java object 102 that is the end point of the data flow are passed to the Logger object 104, so that the Logger The object 104 can detect the occurrence of a data flow.

より詳細に説明すると、Ｊａｖａオブジェクト１０１および１０２は、Ｌｏｇｇｅｒオブジェクト１０４に対するｆｌｏｗメソッドの呼び出しにより、Ｌｏｇｇｅｒオブジェクト１０４に渡される。上記の２つのデータフローに対するｆｌｏｗメソッドの呼び出しは、下記のようなコードになる。
logger.flow(original_data,copied_data)
logger.flow(copied_data, processed_data)
上記コードのｌｏｇｇｅｒがＬｏｇｇｅｒオブジェクト１０４を意味する。ｌｏｇｇｅｒ．ｆｌｏｗによりｆｌｏｗメソッドが呼び出され、カッコ内のオブジェクトがＬｏｇｇｅｒオブジェクト１０４に渡される。上記コードの例では、括弧内の１番目がデータフローの始点であるＪａｖａオブジェクト１０１、括弧内の２番目がデータフローの終点であるＪａｖａオブジェクト１０２として、Ｌｏｇｇｅｒオブジェクト１０４に渡される。 More specifically, the Java objects 101 and 102 are passed to the Logger object 104 by calling a flow method for the Logger object 104. Calling the flow method for the above two data flows results in the following code.
logger.flow (original_data, copied_data)
logger.flow (copied_data, processed_data)
The logger of the above code means the logger object 104. logger. The flow method is called by the flow, and the object in parentheses is passed to the Logger object 104. In the above code example, the first object in the parentheses is passed to the Logger object 104 as the Java object 101 that is the start point of the data flow, and the second object in the parentheses is the Java object 102 that is the end point of the data flow.

データフローの発生を検知したＬｏｇｇｅｒオブジェクト１０４は、当該データフローのデータフローログを生成する。図３は、Ｊａｖａプログラム内部のデータフローログの生成処理のフローチャートである。本フローは、前述のように、ｆｌｏｗメソッドが呼び出された時点で開始される。 The Logger object 104 that has detected the occurrence of the data flow generates a data flow log of the data flow. FIG. 3 is a flowchart of the data flow log generation process inside the Java program. As described above, this flow starts when the flow method is called.

Ｌｏｇｇｅｒオブジェクト１０４は、まず、Ｓｙｓｔｅｍクラス１０３のｉｄｅｎｔｉｆｙＨａｓｈＣｏｄｅメソッドを呼び出し、Ｊａｖａオブジェクト１０１および１０２のオブジェクト識別子を取得する（Ｓ１０１とＳ１０２）。次に、Ｌｏｇｇｅｒオブジェクト１０４は、取得した両識別子に基づき、データフローログを生成する（Ｓ１０３）。生成されたデータフローログは、ログ受信部３に送られる（Ｓ１０４）。ログの送信は、ＴＣＰ／ＩＰ通信など任意の通信手段を用いればよい。以上が、データフローログの生成処理のフローである。 The Logger object 104 first calls the identifyHashCode method of the System class 103 to acquire the object identifiers of the Java objects 101 and 102 (S101 and S102). Next, the Logger object 104 generates a data flow log based on the acquired identifiers (S103). The generated data flow log is sent to the log receiving unit 3 (S104). For the log transmission, any communication means such as TCP / IP communication may be used. The above is the flow of data flow log generation processing.

なお、本フローチャートは一例であり、これに限られるものではない。例えば、Ｓ１０１とＳ１０２の順番は異なっていてもよい。以降に説明される各実施形態におけるフローチャートも同様である。 This flowchart is an example, and the present invention is not limited to this. For example, the order of S101 and S102 may be different. The same applies to flowcharts in the embodiments described below.

データフローログは、例えば、ＪＳＯＮ（ＪａｖａＳｃｒｉｐｔ（登録商標）ＯｂｊｅｃｔＮｏｔａｔｉｏｎ）形式のテキストデータとして、生成される。下記にＪＳＯＮ形式のテキストデータとして生成されたデータフローログの一例を示す。
{
“src”:{
“type”:“java_internal”,
“object_hash”:“1487365346”
},
“dst”:{
“type”:“java_internal”,
“object_hash”:“1786281489”
},
“time”:“1439172160987”
}
ｓｒｃにはデータフローの始点に関する情報を示す。ｄｓｔは、データフローの終点に関する情報を示す。ｓｒｃおよびｄｓｔに含まれているｔｙｐｅフィールドは、コンポーネントの種類を示す。ｔｙｐｅフィールドの値ｊａｖａ＿ｉｎｔｅｒｎａｌは、Ｊａｖａオブジェクトを意味する。また、ｏｂｊｅｃｔ＿ｈａｓｈフィールドは、オブジェクトの識別子を示す。ゆえに、上記データフローログから、データフローの始点のＪａｖａオブジェクトおよび終点のＪａｖａオブジェクトを分析することができる。また、ｔｉｍｅフィールドは、データフローログが生成された時刻を示す。 The data flow log is generated, for example, as text data in JSON (Java Script (registered trademark) Object Notation) format. An example of a data flow log generated as JSON format text data is shown below.
{
“Src”: {
“Type”: “java_internal”,
“Object_hash”: “1487365346”
},
“Dst”: {
“Type”: “java_internal”,
“Object_hash”: “1786281489”
},
“Time”: “1439172160987”
}
src indicates information related to the start point of the data flow. dst indicates information regarding the end point of the data flow. The type field included in src and dst indicates the type of component. The value field Java_internal of the type field means a Java object. The object_hash field indicates the identifier of the object. Therefore, it is possible to analyze the Java object at the start point and the Java object at the end point of the data flow from the data flow log. The time field indicates the time when the data flow log is generated.

なお、データフローログには、上記の情報の他にも様々な情報を含めてもよい。例えば、Ｊａｖａプログラム１が動作している計算機のホスト名、Ｊａｖａプログラム１のプロセスＩＤ、Ｊａｖａプログラム１のスレッドＩＤ、始点もしくは終点のＪａｖａオブジェクトのクラス名、ｆｌｏｗメソッドを呼び出すコードのファイル名と当該ファイル内の行番号、またはｆｌｏｗメソッドを呼び出すクラス名とメソッド名などの情報が、データフローログに含まれていてもよい。 The data flow log may include various information in addition to the above information. For example, the host name of the computer on which the Java program 1 is running, the process ID of the Java program 1, the thread ID of the Java program 1, the class name of the Java object at the start or end point, the file name of the code that calls the flow method, and the file The data flow log may include information such as the line number in the table or the class name and method name that calls the flow method.

このように、Ｌｏｇｇｅｒオブジェクト１０４は、Ｓｙｓｔｅｍクラス１０３から、データフローの始点を示すオブジェクト識別子と、データフローの終点を示すオブジェクト識別子を取得する。そして、Ｌｏｇｇｅｒオブジェクト１０４は、両オブジェクト識別子が記録されたデータフローログを生成する。これにより、生成されたデータフローログに基づきＪａｖａプログラムの内部のデータフローを監視することができる。 Thus, the Logger object 104 acquires an object identifier indicating the start point of the data flow and an object identifier indicating the end point of the data flow from the System class 103. The Logger object 104 generates a data flow log in which both object identifiers are recorded. As a result, the data flow inside the Java program can be monitored based on the generated data flow log.

（２）Ｊａｖａプログラム１がＲＤＢ２からデータを読み込むデータフロー
Ｊａｖａプログラム１がＲＤＢ２からデータを読み込むデータフローは、例えば、図１に示すように、Ｊａｖａオブジェクト１０２が、ＲＤＢ２のアクセス先テーブル２０２からデータを取得するといったものがある。Ｊａｖａプログラム１とＲＤＢ２との間のデータフローに関する情報は、Ｌｏｇｇｅｒフック１０７により集められる。 (2) Data flow in which the Java program 1 reads data from the RDB 2 The data flow in which the Java program 1 reads data from the RDB 2 is, for example, as shown in FIG. There is something to get. Information regarding the data flow between the Java program 1 and the RDB 2 is collected by the Logger hook 107.

Ｌｏｇｇｅｒフック１０７は、ＪＤＢＣドライバ１０６に接続される追加機能である。Ｌｏｇｇｅｒフック１０７は、例えば、ｊａｖａ．ｌａｎｇ．ｒｅｆｌｅｃｔ．Ｐｒｏｘｙクラスの機能を用いて実現可能である。そして、Ｌｏｇｇｅｒオブジェクト１０４は、下記に示すコードのように、ｗｒａｐメソッドを呼び出し、ＪＤＢＣドライバ１０６にＬｏｇｇｅｒフック１０７をラップ（接続）しておく。
connection=logger.wrap(connection)
ｃｏｎｎｅｃｔｉｏｎは、ＪＤＢＣドライバ１０６の一部であるｊａｖａ．ｓｑｌ．Ｃｏｎｎｅｃｔｉｏｎ型のオブジェクトである。ｗｒａｐメソッドは、与えられたｃｏｎｎｅｃｔｉｏｎにＬｏｇｇｅｒフック１０７を接続し、その結果を戻り値として返す。これにより、ｃｏｎｎｅｃｔｉｏｎにＬｏｇｇｅｒフック１０７が接続され、ｃｏｎｎｅｃｔｉｏｎおよびｃｏｎｎｅｃｔｉｏｎから生成されるオブジェクトに対して実施される操作は、全てＬｏｇｇｅｒフックにより監視される。 The Logger hook 107 is an additional function connected to the JDBC driver 106. The Logger hook 107 is, for example, Java. lang. reflect. This can be realized by using the function of the Proxy class. Then, the Logger object 104 calls the wrap method and wraps (connects) the Logger hook 107 to the JDBC driver 106 as shown in the following code.
connection = logger.wrap (connection)
connection is Java. which is a part of the JDBC driver 106. sql. It is a Connection type object. The wrap method connects the Logger hook 107 to a given connection and returns the result as a return value. Accordingly, the Logger hook 107 is connected to the connection, and all operations performed on the object generated from the connection and the connection are monitored by the Logger hook.

ＪＤＢＣドライバ１０６を用いてＲＤＢ２からデータを読み込む場合、主にＲｅｓｕｌｔＳｅｔオブジェクトが用いられる。Ｌｏｇｇｅｒフック１０７は、このＲｅｓｕｌｔＳｅｔオブジェクトに対するデータの読み込む操作を検出する。そして、検出の通知を受けたＬｏｇｇｅｒオブジェクト１０４が、検出されたデータの読み込む操作に係るデータフローログを生成する。 When data is read from RDB2 using the JDBC driver 106, a ResultSet object is mainly used. The Logger hook 107 detects an operation of reading data for the ResultSet object. Then, the Logger object 104 that has received the detection notification generates a data flow log related to an operation of reading the detected data.

下記は、ＲｅｓｕｌｔＳｅｔオブジェクトを用いたデータの読み込みのコードの一例である。下記のコードでは、９行目に示されたｒｓ．ｇｅｔＯｂｊｅｃｔメソッドによりＲＤＢ２のデータが読み込まれるため、ｒｓ．ｇｅｔＯｂｊｅｃｔメソッドが呼び出された時点でＬｏｇｇｅｒフック１０７がデータフローを検出する。
PreparedStatement ps = connection.prepareStatement(
“SELECT signal_id, history_value, signal_value”
+“FROM sample_schema.sample_table”
+“WHERE signal_id = ? ORDER BY history_time ASC LIMIT 1”
);
ps.setInt(1,1);
ResultSet rs = ps.executeQuery();
while(rs.next()){
Double value = (Double)rs.getObject(“signal_value”);
} The following is an example of data reading code using a ResultSet object. In the code below, the rs. Since the RDB2 data is read by the getObject method, rs. When the getObject method is called, the Logger hook 107 detects a data flow.
PreparedStatement ps = connection.prepareStatement (
“SELECT signal_id, history_value, signal_value”
+ “FROM sample_schema.sample_table”
+ “WHERE signal_id =? ORDER BY history_time ASC LIMIT 1”
);
ps.setInt (1,1);
ResultSet rs = ps.executeQuery ();
while (rs.next ()) {
Double value = (Double) rs.getObject (“signal_value”);
}

なお、上記コードにおいて、データの読み込み先であるＪａｖａオブジェクト１０２は、ＲｅｓｕｌｔＳｅｔオブジェクトではなく、ＲｅｓｕｌｔＳｅｔオブジェクトに対してｇｅｔＯｂｊｅｃｔなどのデータ読み込みメソッドを呼び出した結果、戻り値として得られるものである。 In the above code, the Java object 102 that is the data reading destination is obtained as a return value as a result of calling a data reading method such as getObject on the ResultSet object, not on the ResultSet object.

図４は、データベースからデータを読み込む場合のデータフローログの生成処理のフローチャートである。このフローは、前述のとおり、ＲｅｓｕｌｔＳｅｔオブジェクトがｇｅｔＯｂｊｅｃｔまたはｇｅｔＳｔｒｉｎｇなどのデータ読み込みメソッドを呼び出した時点にて開始される。 FIG. 4 is a flowchart of data flow log generation processing when data is read from the database. As described above, this flow starts when the ResultSet object calls a data reading method such as getObject or getString.

Ｌｏｇｇｅｒフック１０７は、ＲｅｓｕｌｔＳｅｔオブジェクトを生成したＣｏｎｎｅｃｔｉｏｎオブジェクトの自動コミット設定を確認し、自動コミット設定が有効であった場合は自動コミット設定を無効にする（Ｓ２０１）。自動コミット設定を無効にするのは、正確なデータフローログを生成するためである。 The Logger hook 107 confirms the automatic commit setting of the Connection object that generated the ResultSet object. If the automatic commit setting is valid, the Logger hook 107 invalidates the automatic commit setting (S201). The reason for invalidating the autocommit setting is to generate an accurate data flow log.

自動コミット設定の無効化は、Ｃｏｎｎｅｃｔｉｏｎオブジェクトに対しｇｅｔＡｕｔｏＣｏｍｍｉｔメソッドを呼び出して、自動コミット設定を調べた上で、ｓｅｔＡｕｔｏＣｏｍｍｉｔメソッドを呼び出すことにより実現される。詳細は後述する。 Invalidation of the automatic commit setting is realized by calling the getAutoCommit method for the Connection object, checking the automatic commit setting, and then calling the setAutoCommit method. Details will be described later.

次に、Ｊａｖａオブジェクト１０２により、ＲＤＢ２からデータが読み込まれる（Ｓ２０２）と、Ｌｏｇｇｅｒフック１０７がＲｅｓｕｌｔＳｅｔオブジェクトを用いて、読み込まれたデータが保管されていたＲＤＢ２の場所（位置）を示す情報を取得する（Ｓ２０３からＳ２０５）。ＲＤＢ２の場所（位置）を示す情報は、データベース名、カタログ名、スキーマ名、テーブル名、列の名称、およびテーブルの主キーの名称と値である。ここでは、これらの情報をデータベース関連情報と称する。テーブルの主キーの名称と値が含まれることにより、ＲＤＢ２のアクセス先テーブル２０２の何行目にデータが書き込まれているかを認識することができる。 Next, when data is read from the RDB2 by the Java object 102 (S202), the Logger hook 107 uses the ResultSet object to acquire information indicating the location (position) of the RDB2 where the read data is stored. (S203 to S205). Information indicating the location (position) of the RDB 2 is a database name, catalog name, schema name, table name, column name, and table primary key name and value. Here, these pieces of information are referred to as database related information. By including the name and value of the primary key of the table, it is possible to recognize in which row of the access destination table 202 of RDB2 the data is written.

まず、Ｌｏｇｇｅｒフック１０７はデータベース名を取得する（Ｓ２０３）。データベース名は、接続先のＲＤＢ２を示すＵＲＬ文字列を取得することにより、取得することができる。ＵＲＬ文字列は、ｒｓ．ｇｅｔＳｔａｔｅｍｅｎｔ（）．ｇｅｔＣｏｎｎｅｃｔｉｏｎ（）．ｇｅｔＭｅｔａＤａｔａ（）．ｇｅｔＵＲＬ（）メソッドを呼び出すことにより取得することができる。なお、ｒｓは上記コードの６行目にて示すように、ＲｅｓｕｌｔＳｅｔオブジェクトを意味する。 First, the Logger hook 107 acquires a database name (S203). The database name can be acquired by acquiring a URL character string indicating the connection destination RDB2. The URL string is rs. getStatement (). getConnection (). getMetaData (). It can be obtained by calling the getURL () method. Note that rs means a ResultSet object as shown in the sixth line of the above code.

次に、Ｌｏｇｇｅｒフック１０７は、カタログ名、スキーマ名、テーブル名、列名を取得する（Ｓ２０４）。これらの名称は、ｒｓ．ｇｅｔＭｅｔａＤａｔａ（）により取得したＲｅｓｕｌｔＳｅｔＭｅｔａＤａｔａ型オブジェクトに対し、それぞれｇｅｔＣａｔａｌｏｇＮａｍｅ、ｇｅｔＳｃｈｅｍａＮａｍｅ、ｇｅｔＴａｂｌｅＮａｍｅ、ｇｅｔＣｏｌｕｍｎＮａｍｅメソッドを実施することにより、取得することができる。 Next, the Logger hook 107 acquires a catalog name, a schema name, a table name, and a column name (S204). These names are rs. It can be acquired by executing the getCatalogName, getSchemaName, getTableName, and getColumnName methods for the ResultSetMetaData type object acquired by getMetaData ().

次に、Ｌｏｇｇｅｒフック１０７は、テーブルの主キーの名称を取得する（Ｓ２０５）。主キーの名称は、取得したカタログ名、スキーマ名、テーブル名を、ｒｓ．ｇｅｔＳｔａｔｅｍｅｎｔ（）．ｇｅｔＣｏｎｎｅｃｔｉｏｎ（）．ｇｅｔＭｅｔａＤａｔａ（）．ｇｅｔＰｒｉｍａｒｙＫｅｙｓ（）メソッドの引数として渡すことにより、取得することができる。その後、Ｌｏｇｇｅｒフック１０７は、ＲｅｓｕｌｔＳｅｔオブジェクトｒｓに含まれるデータのうち、列名が主キーに含まれるものの値を取得し、主キーの名称と値のペアとして記憶する（Ｓ２０６）。 Next, the Logger hook 107 acquires the name of the primary key of the table (S205). The name of the primary key is the acquired catalog name, schema name, table name, rs. getStatement (). getConnection (). getMetaData (). It can be obtained by passing it as an argument of the getPrimaryKeys () method. Thereafter, the Logger hook 107 acquires the value of the data whose column name is included in the primary key among the data included in the ResultSet object rs, and stores it as a pair of the primary key name and value (S206).

なお、Ｓ２０３からＳ２０５の処理において実行されたメソッドは、ＲｅｓｕｌｔＳｅｔオブジェクトを介して、最終的にＲＤＢ２のシステムカタログ２０３へ問い合わせを行うものである。 Note that the method executed in the processing from S203 to S205 finally makes an inquiry to the system catalog 203 of RDB2 via the ResultSet object.

Ｌｏｇｇｅｒフック１０７が取得したこれらのデータベース関連情報は、Ｌｏｇｇｅｒオブジェクト１０４に渡され、Ｌｏｇｇｅｒフック１０７は、Ｃｏｎｎｅｃｔｉｏｎの自動コミット設定を無効化した場合は有効化する（Ｓ２０７）。Ｌｏｇｇｅｒオブジェクト１０４は、取得したデータの読み込み先であるＪａｖａオブジェクト１０２のオブジェクト識別子をＳｙｓｔｅｍクラス１０３から取得する（Ｓ２０８）。 The database related information acquired by the Logger hook 107 is passed to the Logger object 104, and the Logger hook 107 is activated when invalidating the connection auto-commit setting (S207). The Logger object 104 acquires the object identifier of the Java object 102 from which the acquired data is read from the system class 103 (S208).

Ｌｏｇｇｅｒオブジェクト１０４は、Ｌｏｇｇｅｒフック１０７から取得したデータベース関連情報と、Ｓｙｓｔｅｍクラス１０３から取得したオブジェクト識別子とを用いて、データフローログを生成する（Ｓ２０９）。Ｌｏｇｇｅｒオブジェクト１０４は生成したデータフローログをログ受信部３に送信する（Ｓ２１０）。以上が、ＲＤＢ２からデータを読み込む場合のデータフローログの生成処理のフローである。 The Logger object 104 generates a data flow log using the database related information acquired from the Logger hook 107 and the object identifier acquired from the System class 103 (S209). The Logger object 104 transmits the generated data flow log to the log receiving unit 3 (S210). The above is the flow of data flow log generation processing when data is read from the RDB2.

Ｌｏｇｇｅｒフック１０７がＣｏｎｎｅｃｔｉｏｎの自動コミット設定を無効化した理由は、ＲｅｓｕｌｔＳｅｔオブジェクトからのデータの読み込みと、その後のＬｏｇｇｅｒフック１０７からのシステムカタログ２０３への問い合わせが同一のトランザクションにて実行されるためである。これらの処理が別々のトランザクションにて実施されると、ＲｅｓｕｌｔＳｅｔからデータを読み込んだ直後に、別のプログラムがＲＤＢ２に命令を実行すると、問題が生ずる可能性がある。例えば、別のプログラムがＲＤＢ２にテーブル名の変更命令を行った後にシステムカタログ２０３から情報を読み込むと、当該情報がＲｅｓｕｌｔＳｅｔから読み込んだデータを正しく表現できなくなる。ゆえに、自動コミット設定を操作して、同一のトランザクションにて各処理を実行させることにより、上記のような問題が生じず、常に正しい情報をシステムカタログ２０３から取得できる。 The reason why the Logger hook 107 invalidates the connection auto-commit setting is that the data read from the ResultSet object and the subsequent inquiry to the system catalog 203 from the Logger hook 107 are executed in the same transaction. . If these processes are performed in separate transactions, a problem may occur if another program executes an instruction to the RDB 2 immediately after reading data from the ResultSet. For example, when information is read from the system catalog 203 after another program issues a table name change instruction to the RDB 2, the information cannot correctly represent the data read from the ResultSet. Therefore, by operating the automatic commit setting and executing each process in the same transaction, the above information does not occur and correct information can always be acquired from the system catalog 203.

なお、上記では、Ｃｏｎｎｅｃｔｉｏｎの自動コミット設定を操作したが、同様にｇｅｔＴｒａｎｓａｃｔｉｏｎＩｓｏｌａｔｉｏｎメソッドとｓｅｔＴｒａｎｓａｃｔｉｏｎＩｓｏｌａｔｉｏｎメソッドを用いてトランザクション分離レベルを一時的に操作してもよい。例えば、トランザクション分離レベルを一時的に最も強い分離レベルであるＴＲＡＮＳＡＣＴＩＯＮ＿ＳＥＲＩＡＬＩＺＡＢＬＥにすることにより、しないときよりも確実に正しいデータフローログを取得できるようになる。 In the above description, the connection auto-commit setting is operated. However, similarly, the transaction isolation level may be temporarily operated using the getTransactionIsolation method and the setTransactionIsolation method. For example, when the transaction isolation level is temporarily set to TRANSACTION_SERIALIZEABLE, which is the strongest isolation level, a correct data flow log can be obtained more reliably than when not.

下記は、生成されたデータフローログの一例である。当該データフローログは、図２に示したアクセス先テーブル２０２における上から２番目の行（列名の行を除く）のｓｉｇｎａｌ＿ｖａｕｅ列（値は１００．４）をＪａｖａオブジェクト１０２が読み込んだ際のデータフローログである。
{
“src”: {
“type”:“rdb”,
“db_name”:“sample_db”,
“db_catalog”:“”,
“db_schema”:“sample_schema”,
“db_table”:“sample_table”,
“db_pk_signal_id”:“1”,
“db_pk_history_time”:“2015-09-07 10:05:48.392+09:00”,
“db_column”:“signal_value”
},
“dst”: {
“type”:“java_internal”,
“object_hash”:“337422011”
},
“time”:“1439172160332”
} The following is an example of the generated data flow log. The data flow log is data when the Java object 102 reads the signal_value column (value is 100.4) in the second row (excluding the column name row) from the top in the access destination table 202 shown in FIG. It is a flow log.
{
“Src”: {
“Type”: “rdb”,
“Db_name”: “sample_db”,
“Db_catalog”: “”,
“Db_schema”: “sample_schema”,
“Db_table”: “sample_table”,
“Db_pk_signal_id”: “1”,
“Db_pk_history_time”: “2015-09-07 10: 05: 48.392 + 09: 00”,
“Db_column”: “signal_value”
},
“Dst”: {
“Type”: “java_internal”,
“Object_hash”: “337422011”
},
“Time”: “1439172160332”
}

データフローログの２行目から示されているｓｒｃフィールドには、データフローの始点に関する情報が含まれている。ここでは、データフローの始点はＲＤＢ２内であり、ｓｒｃフィールドには、ＲＤＢ２内におけるデータの記録場所（カタログ名、スキーマ名、テーブル名、列の名前）などのデータベース関連情報が含まれる。 The src field shown from the second line of the data flow log includes information on the start point of the data flow. Here, the starting point of the data flow is in the RDB2, and the src field includes database related information such as the data recording location (catalog name, schema name, table name, column name) in the RDB2.

ｓｒｃフィールド中のｔｙｐｅフィールドの値“ｒｄｂ”はデータベースを表している。ｄｂ＿ｎａｍｅ、ｄｂ＿ｃａｔａｌｏｇ、ｄｂ＿ｓｃｈｅｍａ、ｄｂ＿ｔａｂｌｅ、ｄｂ＿ｃｏｌｕｍｎはそれぞれデータベース名、カタログ名、スキーマ名、テーブル名、列の名前を示す。例えば、データベース名は、ｓａｍｐｌｅ＿ｄｂである。 The value “rdb” of the type field in the src field represents a database. db_name, db_catalog, db_schema, db_table, and db_column indicate the database name, catalog name, schema name, table name, and column name, respectively. For example, the database name is sample_db.

ｄｂ＿ｐｋ＿ｓｉｇｎａｌ＿ｉｄフィールドとｄｂ＿ｐｋ＿ｈｉｓｔｏｒｙ＿ｔｉｍｅフィールドは、ｓａｍｐｌｅ＿ｔａｂｌｅの主キーを構成するペアである。このように、主キーがログに記録されているために、データが格納されていたアクセス先テーブル２０２の行まで特定することができる。ｔｉｍｅフィールドはこのデータフローログが生成された時刻である。 The db_pk_signal_id field and the db_pk_history_time field are a pair that constitutes the primary key of sample_table. In this way, since the primary key is recorded in the log, it is possible to specify up to the row of the access destination table 202 where the data was stored. The time field is the time when this data flow log is generated.

データフローログの１３行目から示されているｄｓｔフィールドには、データフローの終点に関する情報が含まれている。ここでは、データフローの終点はＪａｖａオブジェクト１０２である。ｄｓｔフィールド中の各フィールドは、前述のＪａｖａプログラム１の内部のデータフローと同じである。 The dst field shown from the 13th line of the data flow log includes information regarding the end point of the data flow. Here, the end point of the data flow is the Java object 102. Each field in the dst field is the same as the internal data flow of the Java program 1 described above.

なお、データフローログには、上記で述べた情報の他にも様々な情報を含めてよい。例えば、ＲＤＢ２に関するＩＰアドレス、待ち受けポート番号、または製品名およびそのバージョン番号などをデータフローログに含めてよい。また、ＲＤＢ２にアクセスした際のトランザクションＩＤ、読み込み対象の行に対してＲＤＢ２が割り振っている行ＩＤ、読み込み対象の行の主キー以外の列の値などもデータフローログに含めてもよい。 The data flow log may include various information in addition to the information described above. For example, an IP address related to RDB2, a standby port number, or a product name and its version number may be included in the data flow log. Further, the transaction ID when accessing RDB2, the row ID assigned by RDB2 to the row to be read, the values of columns other than the primary key of the row to be read, and the like may be included in the data flow log.

また、図４に示したＳ２０６の処理において、Ｌｏｇｇｅｒフック１０７はＲｅｓｕｌｔＳｅｔオブジェクトから主キーの値を取得するが、一般に、ＲｅｓｕｌｔＳｅｔオブジェクト内に読み込む対象のテーブルの主キーが全て含まれているとは限らない。そこで、ＲｅｓｕｌｔＳｅｔオブジェクト内に主キーの値が含まれることを保証するために、Ｊａｖａプログラム１がデータ読み込み命令（ＳＱＬにおけるＳＥＬＥＣＴ文）をＲＤＢ２へ発行したときは、Ｌｏｇｇｅｒフック１０７が当該データ読み込み命令を捕捉・解析を行い、読み込み対象のテーブルの主キーとともに読み込みを行うように、当該データ読み込み命令を書き換えた上で、ＲＤＢ２へ送信するようにしてもよい。読み込み命令の補足は、ＪＤＢＣにおけるＣｏｎｎｅｃｔｉｏｎクラス、Ｓｔａｔｅｍｅｎｔクラス、ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔクラスなどへのメソッド呼び出しを監視することで実現できる。これにより、Ｊａｖａプログラム１がＲＤＢ２から読み込むデータを問わず、正確なデータフローログを記録することができる。 In the process of S206 shown in FIG. 4, the Logger hook 107 acquires the value of the primary key from the ResultSet object. However, generally, the primary key of the table to be read is not necessarily included in the ResultSet object. Absent. Therefore, when the Java program 1 issues a data read command (a SELECT statement in SQL) to the RDB 2 in order to guarantee that the ResultSet object includes the primary key value, the Logger hook 107 issues the data read command. The data read command may be rewritten and transmitted to the RDB 2 so that the data is captured and analyzed and read together with the primary key of the table to be read. Reading instructions can be supplemented by monitoring method calls to the Connection class, Statement class, PreparedStatement class, etc. in the JDBC. As a result, an accurate data flow log can be recorded regardless of the data read from the RDB 2 by the Java program 1.

このように、Ｌｏｇｇｅｒオブジェクト１０４が、Ｓｙｓｔｅｍクラス１０３からデータフローの始点を示すオブジェクト識別子を取得する。また、Ｌｏｇｇｅｒフック１０７がＲＤＢ２に問い合わせを行うことにより、データフローの終点を示すデータベース関連情報を取得する。そして、Ｌｏｇｇｅｒオブジェクト１０４は、データフローの始点を示す情報および終点を示す情報が記録されたデータフローログを生成する。これにより、生成されたデータフローログに基づき、Ｊａｖａプログラム１がＲＤＢ２からデータを読み込むデータフローを監視することができる。 As described above, the Logger object 104 acquires an object identifier indicating the start point of the data flow from the System class 103. Further, the Logger hook 107 makes an inquiry to the RDB 2 to acquire database related information indicating the end point of the data flow. The Logger object 104 generates a data flow log in which information indicating the start point of the data flow and information indicating the end point are recorded. Accordingly, it is possible to monitor the data flow in which the Java program 1 reads data from the RDB 2 based on the generated data flow log.

（３）Ｊａｖａプログラム１からＲＤＢ２へデータを書き込むデータフロー
Ｊａｖａプログラム１からＲＤＢ２へデータを書き込む場合も、Ｊａｖａプログラム１がＲＤＢ２からデータを読み込む場合と同様、データフローに関する情報は、Ｌｏｇｇｅｒフック１０７により集められる。 (3) Data flow for writing data from Java program 1 to RDB2 When data is written from Java program 1 to RDB2, information related to the data flow is collected by Logger hook 107, as is the case when Java program 1 reads data from RDB2. It is done.

ＪＤＢＣドライバ１０６を用いてＲＤＢ２へデータを書き込む場合、主にＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトが用いられる。Ｌｏｇｇｅｒフック１０７は、このＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトによるデータの書き込む操作を検出する。 When writing data to the RDB 2 using the JDBC driver 106, a PreparedState object is mainly used. The Logger hook 107 detects an operation of writing data by this PreparedStatement object.

下記は、ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトを用いたデータの書き込みコードの一例である。
PreparedStatement ps = connection.prepareStatement(
“UPDATE sample_schema.sample_table”
+ “SET signal_value = ?”
+ “WHERE signal_id = ?”
);
ps.setObject(1, write_value);
ps.setInt(2, 2);
ps.execute(); The following is an example of a data write code using a PreparedStatement object.
PreparedStatement ps = connection.prepareStatement (
“UPDATE sample_schema.sample_table”
+ “SET signal_value =?”
+ “WHERE signal_id =?”
);
ps.setObject (1, write_value);
ps.setInt (2, 2);
ps.execute ();

上記に示されたデータの書き込みコードは、３つの手続きから構成されている。まず、１行目に示すように、（i）ｃｏｎｎｅｃｔｉｏｎ．ｐｒｅｐａｒｅＳｔａｔｅｍｅｎｔにより、ＲＤＢ２に対するＳＱＬ命令を有するＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトが生成される。上記コードでは、３行目から５行目がＳＱＬ命令である。次に、６と７行目に示すように、（ii）ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔに対してＪａｖａオブジェクト１０２およびパラメータがバインドされる。そして、８行目に示すように、（iii）ｐｓ．ｅｘｅｃｕｔｅにより、データがＲＤＢ２に書き込まれる。 The data write code shown above is composed of three procedures. First, as shown in the first line, (i) connection. PreparedStatement generates a PreparedStatement object having an SQL instruction for RDB2. In the above code, the 3rd to 5th lines are SQL instructions. Next, as shown in the 6th and 7th lines, (ii) the Java object 102 and the parameter are bound to the PreparedStatement. Then, as shown in the eighth line, (iii) ps. With execute, data is written to RDB2.

上記（ii）の手続きは、ＲＤＢ２へ発行するＳＱＬ命令内のプレースホルダーに対し、Ｊａｖａオブジェクト１０２を対応付ける（バインドする）ものである。これにより、Ｊａｖａプログラム１はデータをＲＤＢ２へ書き込むことができる。 The procedure (ii) above associates (binds) the Java object 102 with the placeholder in the SQL instruction issued to the RDB 2. As a result, the Java program 1 can write data to the RDB 2.

なお、上記コードにおいて、データの書き込み元であるＪａｖａオブジェクト１０２は、ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトではなく、ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトにバインドされたほうである。 In the above code, the Java object 102 that is the data write source is not the PreparedState object, but is bound to the PreparedState object.

上記コードにおけるデータフローログの生成処理は、３つの処理に分けられる。まず上記（i）の手続きが行われると、Ｌｏｇｇｅｒフック１０７は、バインド一時保存部１０５に、データ書き込みに係る情報を格納する処理を行う。次に、上記（ii）の手続きが行われると、Ｌｏｇｇｅｒフック１０７は、バインドされたＪａｖａオブジェクト１０２の識別子を取得し、バインド一時保存部１０５に格納したデータと対応付ける。そして、上記（iii）の手続きが行われると、Ｌｏｇｇｅｒフック１０７は、データが書き込まれたアクセス先テーブル２０２の行を判別するための主キーの情報を取得する。そして、Ｌｏｇｇｅｒオブジェクトがデータフローログを生成する。 The data flow log generation process in the above code is divided into three processes. First, when the procedure (i) is performed, the Logger hook 107 performs processing for storing information related to data writing in the temporary binding storage unit 105. Next, when the procedure (ii) is performed, the Logger hook 107 acquires the identifier of the bound Java object 102 and associates it with the data stored in the temporary binding storage unit 105. When the procedure (iii) is performed, the Logger hook 107 acquires primary key information for determining the row of the access destination table 202 in which data is written. The Logger object then generates a data flow log.

図５は、バインド一時保存部１０５に保存される内容の一例を示す図である。バインド一時保存部１０５は、ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトと、ＲＤＢ２内のアクセス先テーブル２０２の情報（カタログ名、スキーマ名、テーブル名、列の名前）と、プレースホルダーインデックスと、バインドオブジェクトとの４種のデータを相互に対応付けて管理するためのデータ構造を有する。 FIG. 5 is a diagram illustrating an example of contents stored in the bind temporary storage unit 105. The temporary binding storage unit 105 stores four types of data including a prepared statement object, information on the access destination table 202 in the RDB 2 (catalog name, schema name, table name, column name), placeholder index, and bind object. It has a data structure for managing in association with each other.

ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクト識別子は、ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトの識別子である。プレースホルダーインデックスは、プレースホルダーがＳＱＬ命令内に現れる順序番号である。バインドオブジェクト識別子は、バインドされたＪａｖａオブジェクト１０２の識別子である。 The PreparedStatement object identifier is an identifier of a PreparedStatement object. The placeholder index is a sequence number in which the placeholder appears in the SQL instruction. The bind object identifier is an identifier of the bound Java object 102.

一般に、Ｊａｖａプログラム１からＲＤＢ２へのデータ書き込みでは、ＳＱＬ命令内に書き込み先の列の名前と書き込むデータを表すプレースホルダーが記される。例えば、上記コードでは、“ｓｉｇｎａｌ＿ｖａｌｕｅ＝？”という文字列が、列の名前である“ｓｉｇｎａｌ＿ｖａｌｕｅ”とプレースホルダーである“？”の組を表している。 In general, when writing data from the Java program 1 to the RDB 2, a placeholder representing the name of the write destination column and the data to be written is written in the SQL instruction. For example, in the above code, the character string “signal_value =?” Represents a set of “signal_value” that is the name of the column and “?” That is the placeholder.

上記のｓｉｇｎａｌ＿ｖａｌｕｅとペアになるプレースホルダーの場合、ＳＱＬ命令内の最初のプレースホルダーであるため、そのインデックスは１である。ゆえに、図５の１行目のプレースホルダーインデックスに１が格納されている。一般に、１つのＳＱＬ命令内には、列の名前とプレースホルダーインデックスのペアが複数存在する。その場合はペアごとに別々の行がバインド一時保存部１０５に生成される。 In the case of the placeholder paired with the above signal_value, the index is 1 because it is the first placeholder in the SQL instruction. Therefore, 1 is stored in the placeholder index in the first row in FIG. Generally, there are a plurality of pairs of column names and placeholder indexes in one SQL instruction. In that case, a separate line is generated in the temporary binding storage unit 105 for each pair.

バインド一時保存部１０５は、Ｊａｖａプログラム１のメモリ上に確保されてもよいし、ＲＤＢ２上に確保されてもよいし、その他の任意の記憶領域に確保されてもよい。なお、バインド一時保存部１０５に保存される各データ（各行）は、該当するＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトが用いられなくなった時点で削除してもよい。 The bind temporary storage unit 105 may be secured on the memory of the Java program 1, may be secured on the RDB 2, or may be secured on any other storage area. Each data (each row) stored in the temporary binding storage unit 105 may be deleted when the corresponding prepared statement object is no longer used.

図６は、バインド一時保存部１０５へのデータ記録処理のフローチャートである。バインド一時保存部１０５への記録は、上記（i）のｃｏｎｎｅｃｔｉｏｎ．ｐｒｅｐａｒｅＳｔａｔｅｍｅｎｔメソッドの実行時に行われる処理と、（ii）のＪａｖａオブジェクト１０２のバインドの実行時に行われる処理との２段階に分けて実施される。図６（Ａ）に、（i）のｃｏｎｎｅｃｔｉｏｎ．ｐｒｅｐａｒｅＳｔａｔｅｍｅｎｔメソッドの実行時に行われる処理のフローを、図６（Ｂ）に、（ii）のＪａｖａオブジェクト１０２のバインド時に行われる処理のフローを示す。 FIG. 6 is a flowchart of a data recording process to the bind temporary storage unit 105. Recording to the temporary binding storage unit 105 is performed by using the connection. The processing is performed in two stages, that is, processing that is performed when the prepareStatement method is executed and processing that is performed when the Java object 102 is bound in (ii). FIG. 6A shows the connection. FIG. 6B shows the flow of processing that is performed when the prepareStatement method is executed. FIG. 6B shows the flow of processing that is performed when the Java object 102 is bound in (ii).

まず、図６（Ａ）に示すｐｒｅｐａｒｅＳｔａｔｅｍｅｎｔメソッドの実行時に行われる処理について述べる。本フローは、Ｊａｖａプログラム１がＣｏｎｎｅｃｔｉｏｎオブジェクトに対するｐｒｅｐａｒｅＳｔａｔｅｍｅｎｔメソッド呼び出しを実行したときに開始される。Ｌｏｇｇｅｒフック１０７は、ｐｒｅｐａｒｅＳｔａｔｅｍｅｎｔメソッドの戻り値であるＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトを取得する（Ｓ３０１）。次に、Ｌｏｇｇｅｒフック１０７がＳＱＬ命令文字列を解析し、書き込み先のカタログ名、スキーマ名、テーブル名、およびプレースホルダーインデックス、と対応する列の名前のペアを取得する（Ｓ３０２）。これにより、バインド一時保存部１０５のデータ構造のうちバインドオブジェクト識別子以外の情報が取得される。Ｌｏｇｇｅｒフック１０７は、取得したこれらの情報をバインド一時保存部１０５に記録する（Ｓ３０３）。 First, a process performed when the prepareStatement method shown in FIG. This flow is started when the Java program 1 executes a prepareStatement method call for the Connection object. The Logger hook 107 acquires a PreparedStatement object that is a return value of the prepareStatement method (S301). Next, the Logger hook 107 analyzes the SQL instruction character string, and acquires a pair of column names corresponding to the catalog name, schema name, table name, and placeholder index of the write destination (S302). As a result, information other than the bind object identifier in the data structure of the temporary bind storage unit 105 is acquired. The Logger hook 107 records the acquired information in the bind temporary storage unit 105 (S303).

なお、ＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクト識別子は、Ｓｙｓｔｅｍクラス１０３から取得することができる。 The PreparedStatement object identifier can be acquired from the System class 103.

次に、図６（Ｂ）に示すＪａｖａオブジェクト１０２のバインド時に行われる処理について述べる。本フローは、Ｊａｖａプログラム１がＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトに対するｓｅｔＯｂｊｅｃｔなどのメソッド呼び出しを実行したときに開始される。Ｌｏｇｇｅｒフック１０７は、ｓｅｔＯｂｊｅｃｔなどのメソッドに引数として与えられるプレースホルダーインデックスおよびバインドされるＪａｖａオブジェクト１０２、並びに、メソッドが呼ばれたＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトのオブジェクト識別子を取得する（Ｓ４０１）。Ｌｏｇｇｅｒフック１０７は、プレースホルダーインデックスとＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクト識別子に基づき、バインド一時保存部１０５の各行を検索する（Ｓ４０２）。プレースホルダーインデックスとＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクト識別子が同一の値である行が存在したときは（Ｓ４０３のＹＥＳ）、Ｌｏｇｇｅｒフック１０７は、検出した行に対し、バインドされるＪａｖａオブジェクト１０２のオブジェクト識別子をバインドオブジェクト識別子として記録する（Ｓ４０４）。プレースホルダーインデックスとＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクト識別子が同一の値である行が存在しないときは（Ｓ４０３のＮＯ）、Ｌｏｇｇｅｒフック１０７は何もせず、本フローは終了する。上記２つの手続きにより、バインド一時保存部１０５への記録が完了する。 Next, processing that is performed when the Java object 102 shown in FIG. 6B is bound will be described. This flow is started when the Java program 1 executes a method call such as setObject for the PreparedStatement object. The Logger hook 107 acquires the placeholder index given as an argument to a method such as setObject and the Java object 102 to be bound, and the object identifier of the PreparedStatement object from which the method is called (S401). The Logger hook 107 searches each row of the temporary binding storage unit 105 based on the placeholder index and the prepared statement object identifier (S402). When there is a row in which the placeholder index and the prepared statement object identifier have the same value (YES in S403), the Logger hook 107 uses the object identifier of the Java object 102 to be bound as the bind object identifier for the detected row. Record (S404). If there is no row having the same value for the placeholder index and the PreparedStatement object identifier (NO in S403), the Logger hook 107 does nothing and this flow ends. Recording to the bind temporary storage unit 105 is completed by the above two procedures.

図７は、データベースへデータを書き込む場合のデータフローログの生成処理のフローチャートである。このフローチャートは、前述の通り、上記（iii）のＰｒｅｐａｒｅｄＳｔａｔｅｍｅｎｔオブジェクトに対して、データ書き込み実行メソッド、例えば、ｅｘｅｃｕｔｅ、ｅｘｅｃｕｔｅＵｐｄａｔｅメソッドなどが呼び出された際に開始される。 FIG. 7 is a flowchart of data flow log generation processing when data is written to the database. As described above, this flowchart is started when a data write execution method such as execute or executeUpdate method is called for the PreparedStatement object of (iii) above.

まず、Ｌｏｇｇｅｒフック１０７がＣｏｎｎｅｃｔｉｏｎオブジェクトの自動コミット設定を一時的に無効化する（Ｓ５０１）。この無効化は、図４に示したＲＤＢ２からデータを読み込む場合のデータフローログの生成処理において実施されたのと同じ方法および理由による。また、Ｌｏｇｇｅｒフック１０７は、一時ログテーブル２０４の全内容を消去しておく（Ｓ５０２）。 First, the Logger hook 107 temporarily invalidates the automatic commit setting of the Connection object (S501). This invalidation is based on the same method and reason as that performed in the data flow log generation process when data is read from the RDB 2 shown in FIG. Further, the Logger hook 107 deletes all the contents of the temporary log table 204 (S502).

そして、ＲＤＢ２に対しデータが書き込まれる（Ｓ５０３）と、アクセス先テーブル２０２に設定されたトリガーが、データ書き込み先の情報を一時ログテーブル２０４へ記録する（Ｓ５０４）。 When data is written to the RDB 2 (S503), the trigger set in the access destination table 202 records data write destination information in the temporary log table 204 (S504).

本実施形態ではアクセス先テーブル２０２に予めトリガーを設定しておく。「トリガー」とは、テーブルに対して何らかの操作が実行された時に、ＲＤＢ２内で実行される手続きを意味する。ここでは、トリガーがアクセス先テーブル２０２に設定され、アクセス先テーブル２０２においてデータ書き込み操作が実行された時に起動されたトリガーが、一時ログテーブル２０４へデータ書き込み先の情報を記録するように設定しておく。なお、多くのＲＤＢ２製品はトリガーを設定・実行する機能を提供しており、当該機能を用いればよい。 In this embodiment, a trigger is set in advance in the access destination table 202. “Trigger” means a procedure executed in the RDB 2 when an operation is performed on a table. Here, the trigger is set in the access destination table 202, and the trigger activated when the data write operation is executed in the access destination table 202 is set to record the data write destination information in the temporary log table 204. deep. Many RDB2 products provide a function for setting and executing a trigger, and the function may be used.

図８は、一時ログテーブル２０４の内容の一例を示す図である。一時ログテーブル２０４は、書き込み操作が実際に行われたテーブルのカタログ名、スキーマ名、テーブル名、書き込み先の行の主キーの名称と値、および書き込み先の列の名前を保存する。一度の書き込み操作において、複数のテーブル、複数の行、または複数の列に対する書き込みが発生した場合、当該書き込みに応じた複数の行が一時ログテーブル２０４に書き込まれる。なお、書き込みが実施された列の名前などの情報は、システムカタログ２０３から適宜読み込まれ、一時ログテーブル２０４へ書き込まれる。 FIG. 8 is a diagram illustrating an example of the contents of the temporary log table 204. The temporary log table 204 stores a catalog name, a schema name, a table name, a name and a value of a primary key of a write destination row, and a name of a write destination column. When writing to a plurality of tables, a plurality of rows, or a plurality of columns occurs in one writing operation, a plurality of rows corresponding to the writing are written to the temporary log table 204. Note that information such as the name of the column on which writing has been performed is appropriately read from the system catalog 203 and written to the temporary log table 204.

図７のフローの説明に戻る。トリガーが一時ログテーブル２０４に書き込み先のデータベース情報を書き込むと、Ｌｏｇｇｅｒフック１０７がＲＤＢ２から一時ログテーブル２０４の情報を読み取り、一時ログテーブル２０４の情報をＬｏｇｇｅｒオブジェクト１０４へ転送する（Ｓ５０５）。また、Ｌｏｇｇｅｒフック１０７がＣｏｎｎｅｃｔｉｏｎの自動コミット設定を元に戻す（Ｓ５０６）。 Returning to the description of the flow in FIG. When the trigger writes the write destination database information to the temporary log table 204, the Logger hook 107 reads the information of the temporary log table 204 from the RDB 2 and transfers the information of the temporary log table 204 to the Logger object 104 (S505). Further, the Logger hook 107 restores the connection auto-commit setting (S506).

Ｌｏｇｇｅｒオブジェクト１０４は、Ｌｏｇｇｅｒフック１０７から一時ログテーブル２０４の情報を取得し（Ｓ５０７）、当該一時ログテーブル２０４の情報に基づき、バインド一時保存部１０５を検索し、バインドオブジェクト識別子を取得する（Ｓ５０８）。 The Logger object 104 acquires information on the temporary log table 204 from the Logger hook 107 (S507), searches the bind temporary storage unit 105 based on the information on the temporary log table 204, and acquires a bind object identifier (S508). .

そして、Ｌｏｇｇｅｒオブジェクト１０４は、データベース名、カタログ名、スキーマ名、テーブル名、主キーの名称と値、列の名前、およびバインドオブジェクト識別子から成るデータの組み合わせに対し、下記に示すようなデータフローログを生成する（Ｓ５０９）。データの組み合わせが複数取得されたときは、当該データの組み合わせそれぞれに対して、データフローログを生成する。
{
“src”: {
“type”:“java_internal”,
“object_hash”:“337422011”
},
“dst”: {
“type”:“rdb”,
“db_name”:“sample_db”,
“db_catalog”:“”,
“db_schema”:“sample_schema”,
“db_table”:“sample_table”,
“db_pk_signal_id”:“2”,
“db_pk_history_time”:“2015-09-07 10:10:53.334+09:00”,
“db_column”:“signal_value”
},
“time”:“1439172160332”
}
上記データフローログのｓｒｃフィールド内にあるｏｂｊｅｃｔ＿ｈａｓｈフィールドがバインドオブジェクト識別子を示す。ｄｓｔフィールド内の各フィールドは、書き込み先のＲＤＢ２の情報を示すものであり、（２）Ｊａｖａプログラム１がＲＤＢ２からデータを読み込む場合のデータフローログと同じである。ｔｉｍｅフィールドはこのデータフローログが生成された時刻を表す。 The Logger object 104 includes a data flow log as shown below for a combination of data including a database name, catalog name, schema name, table name, primary key name and value, column name, and bind object identifier. Is generated (S509). When a plurality of data combinations are acquired, a data flow log is generated for each data combination.
{
“Src”: {
“Type”: “java_internal”,
“Object_hash”: “337422011”
},
“Dst”: {
“Type”: “rdb”,
“Db_name”: “sample_db”,
“Db_catalog”: “”,
“Db_schema”: “sample_schema”,
“Db_table”: “sample_table”,
“Db_pk_signal_id”: “2”,
“Db_pk_history_time”: “2015-09-07 10: 10: 53.334 + 09: 00”,
“Db_column”: “signal_value”
},
“Time”: “1439172160332”
}
The object_hash field in the src field of the data flow log indicates the bind object identifier. Each field in the dst field indicates information on the write destination RDB2, and (2) is the same as the data flow log when the Java program 1 reads data from the RDB2. The time field represents the time when this data flow log was generated.

Ｌｏｇｇｅｒオブジェクト１０４は、生成されたデータフローログを、ログ受信部３に送信する（Ｓ５１０）。以上が、データフローの報告処理のフローである。 The Logger object 104 transmits the generated data flow log to the log receiving unit 3 (S510). The above is the flow of the data flow reporting process.

なお、上記データフローの報告処理では、「主キーの名称と値」が用いられたが、これに代わり、ＲＤＢ２内部において各行を識別するために用いられる行ＩＤを用いてもよい。また、上記のデータフローログには、その他様々な関連情報を含めてもよい。 In the data flow reporting process, “name and value of primary key” are used, but instead of this, a row ID used to identify each row in the RDB 2 may be used. The data flow log may include various other related information.

このように、Ｌｏｇｇｅｒフック１０７が、Ｓｙｓｔｅｍクラス１０３からデータフローの始点を示すバインドオブジェクト識別子を取得する。また、Ｌｏｇｇｅｒフック１０７が、バインド一時保存部１０５の情報と一時ログテーブルの情報とに基づき、データフローの終点を示すデータベース関連情報を取得する。そして、Ｌｏｇｇｅｒオブジェクト１０４は、データフローの始点を示す情報および終点を示す情報が記録されたデータフローログを生成する。これにより、生成されたデータフローログに基づき、Ｊａｖａプログラム１がＲＤＢ２へデータを書き込むデータフローを監視することができる。 Thus, the Logger hook 107 acquires the bind object identifier indicating the start point of the data flow from the System class 103. Further, the Logger hook 107 acquires database related information indicating the end point of the data flow based on the information of the temporary binding storage unit 105 and the information of the temporary log table. The Logger object 104 generates a data flow log in which information indicating the start point of the data flow and information indicating the end point are recorded. As a result, the data flow in which the Java program 1 writes data to the RDB 2 can be monitored based on the generated data flow log.

以上のように、本実施形態によれば、（１）Ｊａｖａプログラム１内部のデータフローと、（２）Ｊａｖａプログラム１がＲＤＢ２からデータを読み込むデータフローと、（３）Ｊａｖａプログラム１からＲＤＢ２へデータを書き込むデータフローとを、統一された形式にて記録することができる。これにより、Ｊａｖａプログラム１とＲＤＢ２を横断するデータフローを詳細かつ網羅的に記録することができる。例えば、データがデータベースから読み込まれ、Ｊａｖａプログラム内部で変換され、データベースへ書き込まれ、そのデータをまた別のＪａｖａプログラムが読み出すといったデータフローも記録することができる。また、本実施形態によれば、データそのものに対し、タグなどのメタデータを付与する必要がないため、容易に実現することができる。 As described above, according to the present embodiment, (1) the data flow in the Java program 1, (2) the data flow in which the Java program 1 reads data from the RDB 2, and (3) the data from the Java program 1 to the RDB 2 Can be recorded in a unified format. As a result, the data flow across the Java program 1 and the RDB 2 can be recorded in detail and exhaustively. For example, a data flow can be recorded in which data is read from a database, converted inside a Java program, written to the database, and read by another Java program. In addition, according to the present embodiment, it is not necessary to add metadata such as a tag to the data itself, which can be easily realized.

（第２の実施形態）
第１の実施形態では、Ｊａｖａプログラム１とＲＤＢ２にて構成されるシステムにおけるデータフローを記録する方法について示した。本実施形態では、Ｊａｖａプログラム１とファイルサービス４にて構成されるシステムのデータフローを記録する方法について説明する。そして、第２コンポーネントがファイルサービス４であっても、データフローが追跡可能であることを示す。なお、第１の実施形態と重複する説明は省略する。 (Second Embodiment)
In the first embodiment, the method for recording the data flow in the system configured by the Java program 1 and the RDB 2 has been described. In this embodiment, a method for recording a data flow of a system constituted by the Java program 1 and the file service 4 will be described. Even if the second component is the file service 4, the data flow can be traced. Note that a description overlapping that of the first embodiment is omitted.

図９は、第２の実施形態に係る計算機システム全体の概略構成の一例を示すブロック図である。第２の実施形態に係る計算機システムは、Ｊａｖａプログラム１のアクセス先がファイルサービス４である点が第１の実施形態と異なる。また、それに伴い、Ｊａｖａプログラム１は、ＪＤＢＣドライバ１０６の代わりにＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を、バインド一時保存部１０５の代わりにアクセス先ファイル名一時保存部１０９を有する。 FIG. 9 is a block diagram illustrating an example of a schematic configuration of the entire computer system according to the second embodiment. The computer system according to the second embodiment is different from the first embodiment in that the access destination of the Java program 1 is the file service 4. Accordingly, the Java program 1 includes a Random Access File object 108 instead of the JDBC driver 106 and an access destination file name temporary storage unit 109 instead of the temporary binding storage unit 105.

ファイルサービス４は、アクセス先ファイル４０１と、ファイルサービスインタフェース（Ｉ／Ｆ）４０２と、ファイルポインタ管理部４０３とを有する。ファイルサービス４は、Ｊａｖａプログラム１に対して、ファイルおよびそのファイル内部のデータの授受を行う。 The file service 4 includes an access destination file 401, a file service interface (I / F) 402, and a file pointer management unit 403. The file service 4 sends and receives files and data in the files to the Java program 1.

アクセス先ファイル４０１は内部にデータを有し、Ｊａｖａプログラム１にて当該データの読み書きが行われる。ファイルサービスＩ／Ｆ４０２は、Ｊａｖａプログラム１とのファイルとデータの授受を行うためのインタフェースである。 The access destination file 401 has data therein, and the Java program 1 reads and writes the data. The file service I / F 402 is an interface for exchanging files and data with the Java program 1.

ファイルポインタ管理部４０３は、ファイルポインタを管理する。ファイルポインタとは、アクセス先ファイル４０１内にて現在行われているデータの読み書きの場所（位置）を示す情報である。ファイルポインタは、ファイル先頭からのバイト数で表される。 The file pointer management unit 403 manages file pointers. The file pointer is information indicating the location (position) of data reading / writing currently performed in the access destination file 401. The file pointer is represented by the number of bytes from the beginning of the file.

一方、Ｊａｖａプログラム１のＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８は、Ｊａｖａの標準ライブラリに含まれる機能である。Ｊａｖａプログラム１は、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を用いて、ファイルシステムＩ／Ｆとデータの授受を行う。また、Ｌｏｇｇｅｒオブジェクト１０４は、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８にＬｏｇｇｅｒフック１０７を接続し、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を介したデータフローを監視する。 On the other hand, the Random Access File object 108 of the Java program 1 is a function included in the Java standard library. The Java program 1 uses the RandomAccessFile object 108 to exchange data with the file system I / F. The Logger object 104 connects the Logger hook 107 to the Random Access File object 108 and monitors the data flow through the Random Access File object 108.

アクセス先ファイル名一時保存部１０９は、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８がアクセスする先のファイル名を一時的に保存する。 The access destination file name temporary storage unit 109 temporarily stores the file name to which the Random Access File object 108 accesses.

第２の実施形態に係る計算機システムでは、下記３種類のデータフローを想定し、監視する。
（１）Ｊａｖａプログラム１内部のデータフロー
（２）Ｊａｖａプログラム１がファイルからデータを読み込むデータフロー
（３）Ｊａｖａプログラム１からファイルへデータを書き込むデータフロー
（１）のＪａｖａプログラム１内部のデータフローは、第１の実施形態と同様であるため省略する。 In the computer system according to the second embodiment, the following three types of data flows are assumed and monitored.
(1) Data flow in Java program 1 (2) Data flow in which Java program 1 reads data from a file (3) Data flow in which data is written from Java program 1 to a file (1) The data flow in Java program 1 is Since it is the same as that of 1st Embodiment, it abbreviate | omits.

Ｊａｖａプログラム１とファイルサービス４との間のデータフローを記録するために、Ｊａｖａプログラム１は、Ｌｏｇｇｅｒオブジェクト１０４を通じてＬｏｇｇｅｒフック１０７が接続されたＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を生成する。ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８の生成は、例えば、Ｌｏｇｇｅｒオブジェクト１０４に対してｏｐｅｎＦｉｌｅメソッドなどを呼び出すことで実現される。下記は、ｏｐｅｎＦｉｌｅメソッドの呼び出しのコードの一例である。
RandomAccessFile file = logger.openFile(“/home/user/sample.dat”, “rw”); In order to record the data flow between the Java program 1 and the file service 4, the Java program 1 generates a Random Access File object 108 to which the Logger hook 107 is connected through the Logger object 104. The generation of the RandomAccessFile object 108 is realized by calling an openFile method or the like on the Logger object 104, for example. The following is an example of the code for calling the openFile method.
RandomAccessFile file = logger.openFile (“/ home / user / sample.dat”, “rw”);

ｏｐｅｎＦｉｌｅメソッドは、与えられた引数を用いてＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を生成する。上記コードでは、“ｓａｍｐｌｅ．ｄａｔ”と“ｒｗ”が引数である。そして、ｏｐｅｎＦｉｌｅメソッドは、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８にＬｏｇｇｅｒフック１０７を接続させる。また、その際、Ｌｏｇｇｅｒオブジェクト１０４は、アクセス先ファイル名一時保存部１０９にアクセス先のファイル名などの情報を保存する。 The openFile method generates a RandomAccessFile object 108 using a given argument. In the above code, “sample.dat” and “rw” are arguments. Then, the openFile method connects the Logger hook 107 to the RandomAccessFile object 108. At that time, the Logger object 104 stores information such as the file name of the access destination in the access destination file name temporary storage unit 109.

図１０は、アクセス先ファイル名一時保存部１０９に保存された情報の一例を示す図である。Ｌｏｇｇｅｒオブジェクト１０４は、ｌｏｇｇｅｒ．ｏｐｅｎＦｉｌｅメソッドの第１引数と、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８のオブジェクト識別子を、アクセス先ファイル名一時保存部１０９に格納する。上記のコードでは、“／ｈｏｍｅ／ｕｓｅｒ／ｓａｍｐｌｅ．ｄａｔ”がアクセス先ファイル４０１名として保存される。これにより、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８と、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８がアクセスしているファイルの名前とが関連付けられる。なお、アクセス先ファイル名一時保存部１０９に保存された情報は、当該ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８が用いられなくなった時点で削除してもよい。 FIG. 10 is a diagram illustrating an example of information stored in the access destination file name temporary storage unit 109. The Logger object 104 is a logger. The first argument of the openFile method and the object identifier of the RandomAccessFile object 108 are stored in the access destination file name temporary storage unit 109. In the above code, “/home/user/sample.dat” is stored as the name of the access destination file 401. Thereby, the Random Access File object 108 is associated with the name of the file accessed by the Random Access File object 108. The information stored in the access destination file name temporary storage unit 109 may be deleted when the Random Access File object 108 is no longer used.

ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を用いたデータ読み込みでは、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８に対してｒｅａｄやｒｅａｄＬｉｎｅといったメソッドを呼び出すことによってＪａｖａオブジェクト１０２へデータが読み込まれる。 In data reading using the RandomAccessFile object 108, data is read into the Java object 102 by calling methods such as read and readLine on the RandomAccessFile object 108.

（２）Ｊａｖａプログラム１がファイルからデータを読み込むデータフロー
図１１は、ファイルからデータを読み込む場合のデータフローログの生成処理のフローチャートである。このフローは、Ｌｏｇｇｅｒフック１０７によりＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８に対するデータ読み込みメソッドが呼び出された時点にて開始される。 (2) Data flow in which Java program 1 reads data from a file FIG. 11 is a flowchart of data flow log generation processing when data is read from a file. This flow is started when a data reading method for the RandomAccessFile object 108 is called by the Logger hook 107.

Ｌｏｇｇｅｒフック１０７は、データが読み込まれたファイルの場所を得るために、データを読み込む前に、ファイルポインタをファイルポインタ管理部４０３から取得する（Ｓ６０１）。その後、実際にファイルからデータが読み込まれる（Ｓ６０２）と、Ｌｏｇｇｅｒフック１０７は再びファイルポインタをファイルポインタ管理部４０３から取得する（Ｓ６０３）。これにより、データの読み込み前後におけるファイルポインタを取得する。 The Logger hook 107 acquires a file pointer from the file pointer management unit 403 before reading the data in order to obtain the location of the file from which the data has been read (S601). Thereafter, when data is actually read from the file (S602), the Logger hook 107 acquires the file pointer from the file pointer management unit 403 again (S603). Thereby, the file pointer before and after reading the data is acquired.

なお、Ｌｏｇｇｅｒフック１０７は、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８のｇｅｔＦｉｌｅＰｏｉｎｔｅｒメソッドを呼び出すことにより、ファイルポインタ管理部４０３からファイルポインタを取得することができる。 The Logger hook 107 can acquire the file pointer from the file pointer management unit 403 by calling the getFilePointer method of the RandomAccessFile object 108.

Ｌｏｇｇｅｒフック１０７は、取得したＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８、データ読み込み先のＪａｖａオブジェクト１０２、読み込み前のファイルポインタ、読み込み後のファイルポインタをＬｏｇｇｅｒオブジェクト１０４へ転送する（Ｓ６０４）。Ｌｏｇｇｅｒフック１０７から情報を受け取ったＬｏｇｇｅｒオブジェクト１０４は、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を用いて、アクセス先ファイル名一時保存部１０９を検索し、ファイル名を取得する（Ｓ６０５）。そして、Ｌｏｇｇｅｒオブジェクト１０４は、データ読み込み先のＪａｖａオブジェクト１０２のオブジェクト識別子をＳｙｓｔｅｍクラス１０３から取得する（Ｓ６０６）。 The Logger hook 107 transfers the acquired Random Access File object 108, the data reading destination Java object 102, the file pointer before reading, and the file pointer after reading to the Logger object 104 (S604). The Logger object 104 that has received the information from the Logger hook 107 searches the access destination file name temporary storage unit 109 using the RandomAccessFile object 108 and acquires the file name (S605). Then, the Logger object 104 acquires the object identifier of the Java object 102 that is the data reading destination from the System class 103 (S606).

Ｌｏｇｇｅｒオブジェクト１０４は、取得したアクセス先ファイル４０１名、読み込み前のファイルポインタ、読み込み後のファイルポインタ、読み込み先Ｊａｖａオブジェクト１０２のオブジェクト識別子から、データフローログを生成する（Ｓ６０７）。Ｌｏｇｇｅｒオブジェクト１０４は、生成されたデータフローログをログ受信部３に送信する（Ｓ６０８）。以上がファイルからデータを読み込む場合のデータフローログの生成処理である。 The Logger object 104 generates a data flow log from the acquired access destination file 401 name, the file pointer before reading, the file pointer after reading, and the object identifier of the reading Java object 102 (S607). The Logger object 104 transmits the generated data flow log to the log receiving unit 3 (S608). The above is data flow log generation processing when data is read from a file.

下記は、ファイルからデータを読み込む場合のデータフローログの一例である。
{
“src”: {
“type”:“file”,
“path”:“/home/user/sample.dat”,
“from”:“10”,
“to”:“32”
},
“dst”: {
“type”:“java_internal”,
“object_hash”:“337422011”
},
“time”:“1439172160332”
}
ｓｒｃフィールドが、アクセス先ファイル４０１内のデータを表し、ｄｓｔフィールドが読み込み先のＪａｖａオブジェクト１０２を表す。ｓｒｃフィールドのｔｙｐｅフィールドの値“ｆｉｌｅ”は、ｓｒｃフィールドがファイル内のデータを表していることを示す。ｓｒｃフィールドのｐａｔｈフィールドはアクセス先ファイル４０１名である。ｓｒｃフィールドのｆｒｏｍフィールドは、読み込み前のファイルポインタを示す。ｓｒｃフィールドのｔｏフィールドは、読み込み後のファイルポインタを示す。 The following is an example of a data flow log when reading data from a file.
{
“Src”: {
“Type”: “file”,
“Path”: “/ home / user / sample.dat”,
“From”: “10”,
“To”: “32”
},
“Dst”: {
“Type”: “java_internal”,
“Object_hash”: “337422011”
},
“Time”: “1439172160332”
}
The src field represents data in the access destination file 401, and the dst field represents the Java object 102 to be read. The value “file” in the type field of the src field indicates that the src field represents data in the file. The path field of the src field is the name of the access destination file 401. The from field of the src field indicates a file pointer before reading. The to field of the src field indicates the file pointer after reading.

このように、Ｌｏｇｇｅｒフック１０７が、ファイルポインタ管理部４０３から、データフローの始点を示すファイルポインタを取得する。また、Ｌｏｇｇｅｒオブジェクト１０４が、Ｓｙｓｔｅｍクラス１０３からデータフローの終点を示すオブジェクト識別子を取得する。そして、Ｌｏｇｇｅｒオブジェクト１０４は、データフローの始点を示す情報および終点を示す情報が記録されたデータフローログを生成する。これにより、生成されたデータフローログに基づき、Ｊａｖａプログラム１がファイルサービス４からファイルのデータを読み込むデータフローを監視することができる。 As described above, the Logger hook 107 acquires the file pointer indicating the start point of the data flow from the file pointer management unit 403. Further, the Logger object 104 acquires an object identifier indicating the end point of the data flow from the System class 103. The Logger object 104 generates a data flow log in which information indicating the start point of the data flow and information indicating the end point are recorded. Thereby, the data flow in which the Java program 1 reads file data from the file service 4 can be monitored based on the generated data flow log.

（３）Ｊａｖａプログラム１からファイルへデータを書き込むデータフロー
図１２は、ファイルへデータを書き込む場合のデータフローログの生成処理のフローチャートである。 (3) Data Flow for Writing Data from Java Program 1 to File FIG. 12 is a flowchart of data flow log generation processing when data is written to a file.

Ｌｏｇｇｅｒフック１０７は、データが書き込まれたファイルの場所を得るために、データを書き込む前に、ファイルポインタをファイルポインタ管理部４０３から取得する（Ｓ７０１）。その後、実際にファイルへデータが書き込まれると（Ｓ７０２）、Ｌｏｇｇｅｒフック１０７は再びファイルポインタをファイルポインタ管理部４０３から取得する（Ｓ７０３）。これにより、データの書き込み前後におけるファイルポインタを取得する。 The Logger hook 107 acquires a file pointer from the file pointer management unit 403 before writing the data in order to obtain the location of the file in which the data is written (S701). Thereafter, when data is actually written to the file (S702), the Logger hook 107 acquires the file pointer from the file pointer management unit 403 again (S703). Thereby, the file pointer before and after the data writing is acquired.

なお、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクトを用いたデータ書き込みでは、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクトに対してｗｒｉｔｅやｗｒｉｔｅＵＴＦなどのメソッドを呼び出すことにより、データがファイルへ書き込まれる。 In the data writing using the RandomAccessFile object, data is written to the file by calling a method such as write or writeUTF on the RandomAccessFile object.

Ｌｏｇｇｅｒフック１０７は、取得したＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８、データ書き込み元のＪａｖａオブジェクト１０２、書き込み前のファイルポインタ、書き込み後のファイルポインタをＬｏｇｇｅｒオブジェクト１０４へ転送する（Ｓ７０４）。Ｌｏｇｇｅｒフック１０７から情報を受け取ったＬｏｇｇｅｒオブジェクト１０４は、ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト１０８を用いて、アクセス先ファイル名一時保存部１０９を検索し、アクセス先ファイル４０１名を取得する（Ｓ７０５）。そして、Ｌｏｇｇｅｒオブジェクト１０４は、データ書き込み元のＪａｖａオブジェクト１０２のオブジェクト識別子をＳｙｓｔｅｍクラス１０３から取得する（Ｓ７０６）。 The Logger hook 107 transfers the obtained RandomAccessFile object 108, the Java object 102 from which data is written, the file pointer before writing, and the file pointer after writing to the Logger object 104 (S704). The Logger object 104 that has received the information from the Logger hook 107 searches the access destination file name temporary storage unit 109 using the RandomAccessFile object 108 and acquires the name of the access destination file 401 (S705). Then, the Logger object 104 acquires the object identifier of the Java object 102 that is the data writing source from the System class 103 (S706).

Ｌｏｇｇｅｒオブジェクト１０４は、取得したアクセス先ファイル４０１名、書き込み前のファイルポインタ、書き込み後のファイルポインタ、書き込み元Ｊａｖａオブジェクト１０２のオブジェクト識別子から、データフローログを生成する（Ｓ７０７）。Ｌｏｇｇｅｒオブジェクト１０４は、生成されたデータフローログをログ受信部３に送信する（Ｓ７０８）。以上がファイルへデータを書き込む場合のデータフローログの生成処理である。 The Logger object 104 generates a data flow log from the acquired access destination file 401 name, the file pointer before writing, the file pointer after writing, and the object identifier of the writing Java object 102 (S707). The Logger object 104 transmits the generated data flow log to the log receiving unit 3 (S708). The above is data flow log generation processing when data is written to a file.

下記は、ファイルへデータを書き込む場合のデータフローログの一例である。
{
“src”: {
“type”:“java_internal”,
“object_hash”:“337422011”
},
“dst”: {
“type”:“file”,
“path”:“/home/user/sample.dat”,
“from”:“3”,
“to”:“35”
},
“time”:“1439172160332”
}
ｓｒｃフィールドが書き込み元のＪａｖａオブジェクト１０２を表し、ｄｓｔフィールドがアクセス先ファイル４０１内のデータを表す。その他は、ファイルからデータを読み込む場合のデータフローログと同様である。 The following is an example of a data flow log when writing data to a file.
{
“Src”: {
“Type”: “java_internal”,
“Object_hash”: “337422011”
},
“Dst”: {
“Type”: “file”,
“Path”: “/ home / user / sample.dat”,
“From”: “3”,
“To”: “35”
},
“Time”: “1439172160332”
}
The src field represents the writing source Java object 102, and the dst field represents the data in the access destination file 401. Others are the same as the data flow log when data is read from a file.

上記のデータフローログに示すように、ｄｓｔフィールドのｆｒｏｍフィールドとｔｏフィールドは、バイト数で示されている。これにより、複数のＪａｖａオブジェクトが１つのファイルに書き込んだ場合においても、ファイル内のデータの場所を弁別することができる。 As shown in the above data flow log, the from field and to field of the dst field are indicated by the number of bytes. Thereby, even when a plurality of Java objects are written in one file, the location of data in the file can be distinguished.

このように、Ｌｏｇｇｅｒフック１０７が、ファイルポインタ管理部４０３から、データフローの終点を示すファイルポインタを取得する。また、Ｌｏｇｇｅｒオブジェクト１０４が、Ｓｙｓｔｅｍクラス１０３からデータフローの始点を示すオブジェクト識別子を取得する。そして、Ｌｏｇｇｅｒオブジェクト１０４は、データフローの始点を示す情報および終点を示す情報が記録されたデータフローログを生成する。これにより、生成されたデータフローログに基づき、Ｊａｖａプログラム１がファイルサービス４へファイルのデータを書き込むデータフローを監視することができる。 As described above, the Logger hook 107 acquires the file pointer indicating the end point of the data flow from the file pointer management unit 403. Also, the Logger object 104 acquires an object identifier indicating the start point of the data flow from the System class 103. The Logger object 104 generates a data flow log in which information indicating the start point of the data flow and information indicating the end point are recorded. Accordingly, it is possible to monitor the data flow in which the Java program 1 writes the file data to the file service 4 based on the generated data flow log.

以上のように、本実施形態によれば、Ｊａｖａプログラム１とファイルサービス４との間のデータフローを記録することができる。また、ファイル内のデータの場所が、バイト数により示されるファイルポインタにて記録されるため、ファイル内のデータの場所を弁別することができる。 As described above, according to the present embodiment, the data flow between the Java program 1 and the file service 4 can be recorded. Further, since the location of data in the file is recorded by a file pointer indicated by the number of bytes, the location of data in the file can be discriminated.

（第３の実施形態）
第１の実施形態では、Ｊａｖａプログラム１内部におけるデータフローを記録する方法について示した。本実施形態では、Ｊａｖａプログラム１内部ではなく、２つのＪａｖａプログラム同士が直接データを授受する場合において、データフローログを記録する方法について説明する。そして、第２コンポーネントがプロセスであっても、データフローが追跡可能であることを示す。なお、第１の実施形態と重複する説明は省略する。 (Third embodiment)
In the first embodiment, the method for recording the data flow in the Java program 1 has been described. In the present embodiment, a method for recording a data flow log in the case where data is directly exchanged between two Java programs instead of the Java program 1 will be described. And even if the second component is a process, the data flow can be traced. Note that a description overlapping that of the first embodiment is omitted.

図１３は、第３の実施形態に係る計算機システム全体の概略構成の一例を示すブロック図である。ここでは、２つのＪａｖａプログラムを、第１Ｊａｖａプログラム１Ａと第２Ｊａｖａプログラム１Ｂと称して区別する。なお、第１Ｊａｖａプログラム１Ａと第２Ｊａｖａプログラム１Ｂとは別のＪａｖａプログラムがあってもよい。第１Ｊａｖａプログラム１Ａと第２Ｊａｖａプログラム１Ｂとは、同じ構成を有する。第１Ｊａｖａプログラム１Ａおよび第２Ｊａｖａプログラム１Ｂは、第１の実施形態のＪａｖａプログラム１と比べ、バインド一時保存部１０５、ＪＤＢＣドライバ１０６、Ｌｏｇｇｅｒフック１０７ではなく、オブジェクト送信部１１０と、オブジェクト受信部１１１と、プロセスＩＤ管理部１１２とを有することが異なる。 FIG. 13 is a block diagram illustrating an example of a schematic configuration of the entire computer system according to the third embodiment. Here, the two Java programs are distinguished by being referred to as a first Java program 1A and a second Java program 1B. There may be another Java program for the first Java program 1A and the second Java program 1B. The first Java program 1A and the second Java program 1B have the same configuration. Compared with the Java program 1 of the first embodiment, the first Java program 1A and the second Java program 1B are not the bind temporary storage unit 105, the JDBC driver 106, and the Logger hook 107, but the object transmission unit 110, the object reception unit 111, And the process ID management unit 112.

オブジェクト送信部１１０は、Ｊａｖａプログラム１内のＪａｖａオブジェクトを、別のＪａｖａプログラム１へ送信する。オブジェクト受信部１１１は、別のＪａｖａプログラム１から送信されたＪａｖａオブジェクトデータを受信する。なお、図１３では、第１Ｊａｖａプログラム１Ａが送信を行い（送信側）、第２Ｊａｖａプログラム１Ｂが受信を行う（受信側）場合を示しているが、逆であってもよい。 The object transmission unit 110 transmits a Java object in the Java program 1 to another Java program 1. The object receiving unit 111 receives Java object data transmitted from another Java program 1. Although FIG. 13 shows the case where the first Java program 1A performs transmission (transmission side) and the second Java program 1B performs reception (reception side), the reverse may be possible.

プロセスＩＤ管理部１１２は、Ｊａｖａプログラム１に付与されるプロセスＩＤを保持する。プロセスＩＤ管理部１１２は、Ｊａｖａの標準ライブラリに含まれるｊａｖａ．ｌａｎｇ．ｍａｎａｇｅｍｅｎｔ．ＲｕｎｔｉｍｅＭＸＢｅａｎオブジェクトなどを用いることができる。 The process ID management unit 112 holds a process ID assigned to the Java program 1. The process ID management unit 112 is a Java. lang. management. A Runtime MXBean object or the like can be used.

第３の実施形態では、Ｌｏｇｇｅｒオブジェクト１０４がプロセスＩＤ管理部１１２からＪａｖａオブジェクト１０１および１０２のプロセスＩＤを取得する。第３の実施形態では、複数のＪａｖａプログラム１が動作し、複数のＪａｖａプログラム１それぞれがＪａｖａプログラム１内部のデータフローログを生成する。ゆえに、データフローログの生成元のＪａｖａプログラム１を識別するために、データフローログに生成元のＪａｖａプログラム１の識別子を含める必要がある。そのため、第３の実施形態は、Ｊａｖａプログラム１のプロセスＩＤをデータフローログに含める。 In the third embodiment, the Logger object 104 acquires the process IDs of the Java objects 101 and 102 from the process ID management unit 112. In the third embodiment, a plurality of Java programs 1 operate, and each of the plurality of Java programs 1 generates a data flow log inside the Java program 1. Therefore, in order to identify the Java program 1 that generated the data flow log, it is necessary to include the identifier of the Java program 1 that generated the data flow log. Therefore, the third embodiment includes the process ID of the Java program 1 in the data flow log.

第３の実施形態に係る計算機システムでは、下記２種類のデータフローを想定し、監視する。
（１）Ｊａｖａプログラム１内部のデータフロー
（２）Ｊａｖａプログラム１同士間（第１Ｊａｖａプログラム１Ａと第２Ｊａｖａプログラム１Ｂ間）のデータフロー In the computer system according to the third embodiment, the following two types of data flows are assumed and monitored.
(1) Data flow inside Java program 1 (2) Data flow between Java programs 1 (between first Java program 1A and second Java program 1B)

（１）Ｊａｖａプログラム１内部のデータフロー
図１４は、第３の実施形態に係るＪａｖａプログラム内部のデータフローログの生成処理のフローチャートである。本フローは、第１の実施形態同様、ｆｌｏｗメソッドの呼び出しにより、Ｌｏｇｇｅｒオブジェクト１０４にデータフローの始点および終点であるＪａｖａオブジェクト１０１と１０２が渡された時点で開始される。 (1) Data Flow Inside Java Program 1 FIG. 14 is a flowchart of data flow log generation processing inside the Java program according to the third embodiment. As in the first embodiment, this flow starts when the Java objects 101 and 102 that are the start and end points of the data flow are passed to the Logger object 104 by calling the flow method.

フロー開始時に、Ｌｏｇｇｅｒオブジェクト１０４はプロセスＩＤ管理部１１２からＪａｖａプログラム１のプロセスＩＤを取得する（Ｓ８０１）。後のＳ１０１からＳ１０４のフローは、第１の実施形態に係るＪａｖａプログラム１内部のデータフローログの生成処理と同様であるが、Ｓ１０３においてＬｏｇｇｅｒオブジェクト１０４はＳ８０１にて取得したプロセスＩＤをデータフローログに含ませる。ゆえに、生成されたデータフローログは第１および第２の実施形態と異なる。 At the start of the flow, the Logger object 104 acquires the process ID of the Java program 1 from the process ID management unit 112 (S801). The subsequent flow from S101 to S104 is the same as the data flow log generation process inside the Java program 1 according to the first embodiment. In S103, the Logger object 104 uses the process ID acquired in S801 as the data flow log. Included. Therefore, the generated data flow log is different from the first and second embodiments.

下記は、第３の実施形態のＪａｖａプログラム１内部のデータフローログの一例である。
{
“src”: {
“type”:“java_internal”,
“process_id”:“3233”,
“object_hash”:“1487365346”
},
“dst”: {
“type”:“java_internal”,
“process_id”:“3233”,
“object_hash”:“1786281489”
},
“time”:“1439172160987”
}
本実施形態では、ｓｒｃフィールドとｄｓｔフィールドのそれぞれに、ｐｒｏｃｅｓｓ＿ｉｄフィールドが含まれている。ｐｒｏｃｅｓｓ＿ｉｄフィールドは、Ｊａｖａプログラム１のプロセスＩＤを示す。Ｊａｖａプログラム１内部のデータフローログであるため、ｓｒｃフィールドとｄｓｔフィールドのｐｒｏｃｅｓｓ＿ｉｄフィールドの値（上記例では“３２３３”）は同じである。これにより、生成元のＪａｖａプログラム１を識別することができる。その他は第１の実施形態と同様である。 The following is an example of the data flow log inside the Java program 1 of the third embodiment.
{
“Src”: {
“Type”: “java_internal”,
“Process_id”: “3233”,
“Object_hash”: “1487365346”
},
“Dst”: {
“Type”: “java_internal”,
“Process_id”: “3233”,
“Object_hash”: “1786281489”
},
“Time”: “1439172160987”
}
In the present embodiment, a process_id field is included in each of the src field and the dst field. The process_id field indicates the process ID of the Java program 1. Since it is a data flow log inside the Java program 1, the value of the process_id field of the src field and the dst field ("3233" in the above example) is the same. As a result, the Java program 1 that is the generation source can be identified. Others are the same as in the first embodiment.

このように、Ｌｏｇｇｅｒオブジェクト１０４は、データフローの始点を示す情報として、Ｓｙｓｔｅｍクラス１０３からＪａｖａオブジェクト１０１のオブジェクト識別子を、プロセスＩＤ管理部１１２からＪａｖａプログラム１のプロセスＩＤを取得する。また、Ｌｏｇｇｅｒオブジェクト１０４は、データフローの終点を示す情報として、Ｓｙｓｔｅｍクラス１０３からＪａｖａオブジェクト１０２のオブジェクト識別子を、プロセスＩＤ管理部１１２からＪａｖａプログラム１のプロセスＩＤを取得する。そして、Ｌｏｇｇｅｒオブジェクト１０４は、始点および終点を示す情報が記録されたデータフローログを生成する。これにより、生成されたデータフローログに基づき、各ＪａｖａプログラムのＪａｖａプログラムの内部のデータフローを監視することができる。 As described above, the Logger object 104 acquires the object identifier of the Java object 101 from the System class 103 and the process ID of the Java program 1 from the process ID management unit 112 as information indicating the start point of the data flow. Further, the Logger object 104 acquires the object identifier of the Java object 102 from the system class 103 and the process ID of the Java program 1 from the process ID management unit 112 as information indicating the end point of the data flow. The Logger object 104 generates a data flow log in which information indicating the start point and the end point is recorded. Thereby, the data flow inside the Java program of each Java program can be monitored based on the generated data flow log.

（２）Ｊａｖａプログラム１同士間のデータフロー
次に、Ｊａｖａプログラム１同士間のデータフローの記録方法について述べる。図１５は、第３の実施形態に係るＪａｖａプログラム同士間のデータフローログの生成処理のフローチャートである。 (2) Data Flow Between Java Programs 1 Next, a method for recording data flow between Java programs 1 will be described. FIG. 15 is a flowchart of data flow log generation processing between Java programs according to the third embodiment.

送信側（第１Ｊａｖａプログラム１Ａ）のＬｏｇｇｅｒオブジェクト１０４Ａが、オブジェクト識別子をＳｙｓｔｅｍクラス１０３Ａから取得する（Ｓ９０１）。また、送信側のＬｏｇｇｅｒオブジェクト１０４Ａは、プロセスＩＤをプロセスＩＤ管理部１１２Ａから取得する（Ｓ９０２）。そして、送信側のオブジェクト送信部１１０Ａは、取得されたオブジェクト識別子およびプロセスＩＤ、並びにシリアライズされたＪａｖａオブジェクト１０２Ａを受信側（第２Ｊａｖａプログラム１Ｂ）へ送信する（Ｓ９０３）。 The Logger object 104A on the transmission side (first Java program 1A) acquires the object identifier from the System class 103A (S901). Further, the Logger object 104A on the transmission side acquires the process ID from the process ID management unit 112A (S902). Then, the transmission-side object transmission unit 110A transmits the acquired object identifier and process ID, and the serialized Java object 102A to the reception side (second Java program 1B) (S903).

送信側のオブジェクト送信部１１０ＡがＪａｖａオブジェクト１０２Ａを送信する際、Ｊａｖａオブジェクトは通信路に送信可能な形式に変換されなくてはならない。この変換は、シリアライズと称される。受信側のオブジェクト受信部１１１Ｂは、送信されたデータを受け取り、シリアライズされたデータからＪａｖａオブジェクト１０１Ｂを生成する（Ｓ９０４）。このシリアライズされたデータからＪａｖａオブジェクトを生成することは、デシリアライズと称される。シリアライズおよびデシリアライズは、Ｊａｖａプログラム１のＳｅｒｉａｌｉｚａｂｌｅインタフェースを用いればよい。 When the object transmission unit 110A on the transmission side transmits the Java object 102A, the Java object must be converted into a format that can be transmitted to the communication path. This conversion is called serialization. The object receiving unit 111B on the receiving side receives the transmitted data, and generates the Java object 101B from the serialized data (S904). Generating a Java object from this serialized data is called deserialization. For serialization and deserialization, the Serializable interface of the Java program 1 may be used.

なお、送信側のオブジェクト送信部１１０Ａと、受信側のオブジェクト受信部１１１Ｂとの間の通信は、ＵＮＩＸ（登録商標）システムにおけるパイプ、ＵＮＩＸドメインソケット、ＴＣＰ／ＩＰ、または何らかのメッセージバスを用いた通信など、様々な手段を用いて実現することができる。 Note that the communication between the object transmission unit 110A on the transmission side and the object reception unit 111B on the reception side is communication using a pipe, UNIX domain socket, TCP / IP, or some message bus in the UNIX (registered trademark) system. It can be realized using various means.

受信側のＬｏｇｇｅｒオブジェクト１０４ＢがデシリアライズされたＪａｖａオブジェクト１０１Ｂのオブジェクト識別子をＳｙｓｔｅｍクラス１０３Ｂから取得する（Ｓ９０５）。また、受信側のＬｏｇｇｅｒオブジェクト１０４ＢがプロセスＩＤをプロセスＩＤ管理部１１２Ｂから取得する（Ｓ９０６）。そして、受信側のＬｏｇｇｅｒオブジェクト１０４Ｂは、送信側から受信したオブジェクト識別子およびプロセスＩＤ、並びに受信側にて取得したオブジェクト識別子とプロセスＩＤを用いて、データフローログを生成する（Ｓ９０７）。受信側のＬｏｇｇｅｒオブジェクト１０４Ｂは、生成したデータフローログをログ受信部３に送信する（Ｓ９０８）。以上が、第３の実施形態に係るＪａｖａプログラム１同士間のデータフローログの生成処理のフローである。 The object identifier of the Java object 101B from which the receiving-side Logger object 104B is deserialized is acquired from the System class 103B (S905). Further, the Logger object 104B on the receiving side acquires the process ID from the process ID management unit 112B (S906). Then, the Logger object 104B on the reception side generates a data flow log using the object identifier and process ID received from the transmission side, and the object identifier and process ID acquired on the reception side (S907). The Logger object 104B on the receiving side transmits the generated data flow log to the log receiving unit 3 (S908). The above is the flow of data flow log generation processing between Java programs 1 according to the third embodiment.

下記は、第３の実施形態のＪａｖａプログラム１同士間のデータフローログの一例である。
{
“src”: {
“type”:“java_internal”,
“process_id”:“3233”,
“object_hash”:“1786281489”
},
“dst”: {
“type”:“java_internal”,
“process_id”:“1832”,
“object_hash”:“558392438”
},
“time”:“1439172173920”
}
上記データフローログにおいて、ｓｒｃフィールドのｐｒｏｃｅｓｓ＿ｉｄフィールドが送信側Ｊａｖａプログラム１ＡのプロセスＩＤを、ｄｓｔフィールドのｐｒｏｃｅｓｓ＿ｉｄフィールドが受信側Ｊａｖａプログラム１ＢのプロセスＩＤを表している。これにより、送信側のＪａｖａプログラム１Ａと受信側Ｊａｖａプログラム１Ｂを弁別することができる。 The following is an example of a data flow log between Java programs 1 of the third embodiment.
{
“Src”: {
“Type”: “java_internal”,
“Process_id”: “3233”,
“Object_hash”: “1786281489”
},
“Dst”: {
“Type”: “java_internal”,
“Process_id”: “1832”,
“Object_hash”: “558392438”
},
“Time”: “1439172173920”
}
In the data flow log, the process_id field of the src field represents the process ID of the sending Java program 1A, and the process_id field of the dst field represents the process ID of the receiving Java program 1B. Thereby, the Java program 1A on the transmission side and the Java program 1B on the reception side can be distinguished.

このように、受信側のオブジェクト受信部１１１Ｂは、データフローの始点を示す情報として、Ｊａｖａオブジェクト１０２Ａのオブジェクト識別子およびＪａｖａプログラム１ＡのプロセスＩＤを取得する。また、受信側のＬｏｇｇｅｒオブジェクト１０４は、データフローの終点を示す情報として、Ｓｙｓｔｅｍクラス１０３ＢからデシリアライズされたＪａｖａオブジェクト１０１Ｂのオブジェクト識別子を、プロセスＩＤ管理部１１２ＢからＪａｖａプログラム１ＢのプロセスＩＤを取得する。そして、Ｌｏｇｇｅｒオブジェクト１０４Ｂは、始点および終点を示す情報が記録されたデータフローログを生成する。これにより、生成されたデータフローログに基づき、Ｊａｖａプログラム１同士間のデータフローを監視することができる。 As described above, the object receiving unit 111B on the receiving side acquires the object identifier of the Java object 102A and the process ID of the Java program 1A as information indicating the starting point of the data flow. Further, the Logger object 104 on the receiving side acquires the object identifier of the Java object 101B deserialized from the System class 103B as information indicating the end point of the data flow, and the process ID of the Java program 1B from the process ID management unit 112B. . The Logger object 104B generates a data flow log in which information indicating the start point and the end point is recorded. Thereby, the data flow between the Java programs 1 can be monitored based on the generated data flow log.

以上のように、本実施形態によれば、Ｊａｖａオブジェクトをシリアライズして送信する際に、送信側Ｊａｖａプログラム１におけるオブジェクト識別子と、送信側Ｊａｖａプログラム１のプロセスＩＤをともに送信する。これらの情報を用いて受信側Ｊａｖａプログラム１がデータフローログを生成することにより、Ｊａｖａプログラム１同士が直接データを授受する場合におけるデータフローを詳細に記録することができる。また、Ｊａｖａプログラム１内部のデータフローログの生成元のＪａｖａプログラム１を、複数のＪａｖａプログラム１が動作している場合においても、識別することができる。 As described above, according to the present embodiment, when a Java object is serialized and transmitted, both the object identifier in the transmission-side Java program 1 and the process ID of the transmission-side Java program 1 are transmitted. By using this information, the receiving Java program 1 generates a data flow log, whereby the data flow when the Java programs 1 directly exchange data can be recorded in detail. In addition, the Java program 1 that is the source of the data flow log in the Java program 1 can be identified even when a plurality of Java programs 1 are operating.

上記に説明した実施形態におけるデータフローログ生成装置は、例えば、汎用のコンピュータ装置を基本ハードウェアとして用い、コンピュータ装置に搭載されたプロセッサに上記プログラムを実行させることにより実現することが可能である。 The data flow log generation device in the embodiment described above can be realized by, for example, using a general-purpose computer device as basic hardware and causing a processor mounted on the computer device to execute the program.

図１６は、本発明の一実施形態におけるハードウェア構成の一例を示すブロック図である。データフローログ生成装置は、プロセッサ５０１、主記憶装置５０２、補助記憶装置５０３、ネットワークインタフェース５０４、デバイスインタフェース５０５、入力装置５０６、出力装置５０７を備え、これらがバス５０８を介して接続された、コンピュータ装置として実現できる。 FIG. 16 is a block diagram illustrating an example of a hardware configuration according to an embodiment of the present invention. The data flow log generation device includes a processor 501, a main storage device 502, an auxiliary storage device 503, a network interface 504, a device interface 505, an input device 506, and an output device 507, which are connected via a bus 508. It can be realized as a device.

プロセッサ５０１が、補助記憶装置５０３からＪａｖａプログラム１を読み出して、主記憶装置５０２に展開して、実行することで、Ｊａｖａプログラム１内部の各機能を実現することができる。 The processor 501 reads out the Java program 1 from the auxiliary storage device 503, expands it in the main storage device 502, and executes it, thereby realizing each function in the Java program 1.

本実施形態のデータフローログ生成装置は、当該データフローログ生成装置で実行されるプログラムをコンピュータ装置に予めインストールすることで実現してもよいし、プログラムをＣＤ−ＲＯＭなどの記憶媒体に記憶して、あるいはネットワークを介して配布して、コンピュータ装置に適宜インストールすることで実現してもよい。 The data flow log generation device according to the present embodiment may be realized by previously installing a program executed by the data flow log generation device in a computer device, or may store the program in a storage medium such as a CD-ROM. Alternatively, it may be realized by being distributed through a network and appropriately installed in a computer device.

ネットワークインタフェース５０４は、通信ネットワークに接続するためのインタフェースである。ＲＤＢ２、ログ受信部３、ファイルサービス４などと通信にて接続される場合は、このネットワークインタフェース５０４にて実現してもよい。ここではネットワークインタフェースを１つのみ示しているが、複数のネットワークインタフェースが搭載されていてもよい。 The network interface 504 is an interface for connecting to a communication network. When connected to the RDB 2, the log receiving unit 3, the file service 4, and the like by communication, this network interface 504 may be used. Although only one network interface is shown here, a plurality of network interfaces may be installed.

デバイスインタフェース５０５は、外部記憶媒体６などの機器に接続するインタフェースである。外部記憶媒体６は、ＨＤＤ、ＣＤ−Ｒ、ＣＤ−ＲＷ、ＤＶＤ−ＲＡＭ、ＤＶＤ−Ｒ、ＳＡＮ（Ｓｔｏｒａｇｅａｒｅａｎｅｔｗｏｒｋ）等の任意の記録媒体でよい。また、ＲＤＢ２は、デバイスインタフェース５０５により接続されてもよい。 The device interface 505 is an interface connected to a device such as the external storage medium 6. The external storage medium 6 may be an arbitrary recording medium such as an HDD, a CD-R, a CD-RW, a DVD-RAM, a DVD-R, and a SAN (Storage area network). The RDB 2 may be connected by a device interface 505.

主記憶装置５０２は、プロセッサ５０１が実行する命令、および各種データ等を一時的に記憶するメモリ装置であり、ＤＲＡＭ等の揮発性メモリでも、ＭＲＡＭ等の不揮発性メモリでもよい。バインド一時保存部１０５、アクセス先ファイル名一時保存部１０９、プロセスＩＤ管理部などは、主記憶装置５０２により実現することができる。補助記憶装置５０３は、プログラムやデータ等を永続的に記憶する記憶装置であり、例えば、ＨＤＤまたはＳＳＤ等がある。 The main storage device 502 is a memory device that temporarily stores instructions executed by the processor 501, various data, and the like, and may be a volatile memory such as a DRAM or a non-volatile memory such as an MRAM. The temporary binding storage unit 105, the access destination file name temporary storage unit 109, the process ID management unit, and the like can be realized by the main storage device 502. The auxiliary storage device 503 is a storage device that permanently stores programs, data, and the like, such as an HDD or an SSD.

上記に、本発明の一実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although one embodiment of the present invention has been described above, these embodiment are presented as examples and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

１プロセス（Ｊａｖａプログラム）
１Ａ第１Ｊａｖａプログラム
１Ｂ第２Ｊａｖａプログラム
１０１、１０１Ａ、１０１Ｂ、１０２、１０２Ａ、１０２ＢＪａｖａオブジェクト
１０３、１０３Ａ、１０３ＢＳｙｓｔｅｍクラス
１０４、１０４Ａ、１０４ＢＬｏｇｇｅｒオブジェクト
１０５バインド一時保存部
１０６ＪＤＢＣドライバ
１０７Ｌｏｇｇｅｒフック
１０８ＲａｎｄｏｍＡｃｃｅｓｓＦｉｌｅオブジェクト
１０９アクセス先ファイル名一時保存部
１１０、１１０Ａ、１１０Ｂオブジェクト送信部
１１１、１１１Ａ、１１１Ｂオブジェクト受信部
１１２、１１２Ａ、１１２ＢプロセスＩＤ管理部
２リレーショナルデータベース（ＲＤＢ）
２０１データベースインタフェース（ＤＢ＿Ｉ／Ｆ）
２０２アクセス先テーブル
２０３システムカタログ
２０４一時ログテーブル
３ログ受信部
４ファイルサービス
４０１アクセス先ファイル
４０２ファイルサービスインタフェース（ファイルサービスＩ／Ｆ）
４０３ファイルポインタ管理部
５０１プロセッサ
５０２主記憶装置
５０３補助記憶装置
５０４デバイスインタフェース
５０５ネットワークインタフェース
５０６バス
６外部記憶媒体 1 process (Java program)
1A 1st Java program 1B 2nd Java program 101, 101A, 101B, 102, 102A, 102B Java object 103, 103A, 103B System class 104, 104A, 104B Logger object 105 Bind temporary storage unit 106 JDBC driver 107 Logger hook 108 RandomAccess object 109 Access destination file name temporary storage unit 110, 110A, 110B Object transmission unit 111, 111A, 111B Object reception unit 112, 112A, 112B Process ID management unit 2 Relational database (RDB)
201 Database interface (DB_I / F)
202 Access destination table 203 System catalog 204 Temporary log table 3 Log receiving unit 4 File service 401 Access destination file 402 File service interface (file service I / F)
403 File pointer manager 501 Processor 502 Main storage 503 Auxiliary storage 504 Device interface 505 Network interface 506 Bus 6 External storage medium

Claims

When a data flow occurs between a first object in a first component that is a process and a second component, first position information indicating a start point or an end point of the data flow in the first component is acquired. A first acquisition unit;
A second acquisition unit for acquiring second position information indicating a start point or an end point of the data flow in the second component;
A log generation unit that generates a data flow log in which the first position information and the second position information are recorded;
A data flow log generation device comprising:

The data flow log generation device according to claim 1, wherein the first acquisition unit, the second acquisition unit, and the log generation unit operate in the process of the first component.

A first identifier management unit for managing an identifier of an object in the first component;
The data flow log generation device according to claim 1 or 2, wherein the first acquisition unit acquires the identifier of the first object as a part or all of the first position information from the first identifier management unit.

The second component is a relational database;
The data flow includes reading data from the relational database into the first object,
The data flow log according to any one of claims 1 to 3, wherein the second acquisition unit acquires, as the second position information, a position where data related to the data reading is stored from the relational database. Generator.

The second component is a relational database;
The data flow includes writing data from the first object to the relational database;
A storage unit for storing data relating to the data writing;
The second acquisition unit acquires, as the second position information, a position where data related to the data writing is stored from the relational database based on the data stored in the storage unit. The data flow log generation device according to any one of the above.

The data flow according to claim 4 or 5, wherein the second position information includes any one of a database name, a catalog name, a schema name, a table name, a column name, and a table primary key of the relational database. Log generator.

The data flow log generation device according to any one of claims 4 to 6, wherein the second acquisition unit issues an instruction relating to validation or invalidation of an automatic commit setting to the relational database.

The second component is a file service;
The data flow includes reading data from the file service to the first object or writing data from the first object to the file service,
The second acquisition unit receives, from the file service, at least one of a file path related to the data reading or data writing and a file pointer indicating a position of data related to the data reading or data writing in the file. The data flow log generation device according to any one of claims 1 to 3, which is acquired as a part or all of the second position information.

The second component is a process;
The data flow includes a data flow from the second object to the first object in the second component,
The second acquisition unit acquires an identifier of the second object and a process ID of the second component as part or all of the second position information,
4. The first acquisition unit acquires the identifier of the first object and the process ID of the first component as part or all of the first position information. 5. Data flow log generator.

The second acquisition unit acquires an identifier of the second object serialized by the second component,
The data flow log generation device according to claim 9, wherein the first acquisition unit acquires the identifier of the second object deserialized by the first component as the identifier of the first object.

When a data flow occurs between the first object and a different second object in the first component,
The first acquisition unit acquires an identifier of the first object as part or all of the first position information, and acquires an identifier of the second object as part or all of the second position information. The data flow log generation device according to any one of 1 to 10.

The data flow log generation device according to claim 11, wherein the first acquisition unit further acquires a process ID of the first component as a part of the first position information.

A second position information storage unit for storing the second position information;
A transmission unit that transmits the second position information to the data flow log generation device according to any one of claims 4 to 7,
Relational database with

The relational database is
A temporary data storage unit for temporarily storing data write destination position information;
The relational database according to claim 13, further comprising: a trigger for copying write destination position information of the written data to the temporary data storage unit when data is written from the first component.

When a data flow occurs between a first object in a first component that is a process and a second component, first position information indicating a start point or an end point of the data flow in the first component is acquired. A first acquisition step;
A second acquisition step of acquiring second position information indicating a start point or an end point of the data flow in the second component;
A log generation step of generating a data flow log in which the first position information and the second position information are recorded;
A data flow log generation method in which a computer executes.

When a data flow occurs between a first object in a first component that is a process and a second component, first position information indicating a start point or an end point of the data flow in the first component is acquired. A first acquisition step;
A second acquisition step of acquiring second position information indicating a start point or an end point of the data flow in the second component;
A log generation step of generating a data flow log in which the first position information and the second position information are recorded;
A program that causes a computer to execute.

A monitoring system comprising: a first component that is a process; a second component; and a log receiving unit that receives a log relating to a data flow performed between the first component and the second component,
The second component is:
A sending unit for sending second position information indicating a start point or an end point of the data flow in the second component;
The first component is:
A first acquisition unit for acquiring first position information indicating a start point or an end point of the data flow in the first component;
A second acquisition unit for acquiring the second position information from the sending unit;
A log generation unit that generates a data flow log in which the first position information and the second position information are recorded;
A monitoring system comprising: