JPWO2020141584A1

JPWO2020141584A1 - Processing system and program

Info

Publication number: JPWO2020141584A1
Application number: JP2020563862A
Authority: JP
Inventors: 豊塚本; 陽子塚本
Original assignee: KABUSIKIGAISYAFUTUREEYE
Current assignee: KABUSIKIGAISYAFUTUREEYE
Priority date: 2019-01-06
Filing date: 2019-12-04
Publication date: 2021-11-11
Anticipated expiration: 2039-12-04
Also published as: JP7446621B2; WO2020141584A1; JP2024045636A

Abstract

【課題】記録された情報の真正の保証とその情報の削除権の保証とが二律背反するジレンマを解消する。【解決手段】２つの片割れ共通鍵ＫＡ及びＫＢで情報を二重暗号化ＥＫＡ(ＥＫＢ(情報））してブロックチェーンに記録する。情報保持者は、情報要求者に対し、一方の片割れ共通鍵ＫＢを配布すると共に二重暗号化された情報を片割れ鍵ＫＡで復号ＤＫＡ(ＥＫＡ(ＥＫＢ（情報）））した情報ＥＫＢ（情報）を送信する。情報要求者はＤＫＢ(ＥＫＢ(情報））を演算して平文の情報を得る。情報保持者が片割れ共通鍵ＫＡをＲに更新することにより、情報要求者に対しＤＲ(ＥＫＡ(ＥＫＢ（情報）））が送信され、情報要求者がＫＢで復号ＤＫＢ(ＤＲ(ＥＫＡ(ＥＫＢ（情報）））しても、平文を得ることができなくする。【選択図】図１４PROBLEM TO BE SOLVED: To solve a dilemma in which the guarantee of authenticity of recorded information and the guarantee of the right to delete the information are in conflict with each other. Information is double-encrypted EKA (EKB (information)) and recorded in a blockchain with two half-split common keys KA and KB. The information holder distributes one half-split common key KB to the information requester and decrypts the double-encrypted information with the half-split key KA (EKA (EKB (information))) Information EKB (information). To send. The information requester calculates DKB (EKB (information)) to obtain plaintext information. When the information holder updates the half-split common key KA to R, a DR (EKA (EKB (information))) is transmitted to the information requester, and the information requester decrypts the DKB (DR (EKA (EKB)) with the KB. Information))) Even if it is not possible to obtain plain text. [Selection diagram] Fig. 14

Description

本発明は、例えば、ブロックチェーン等のような改竄や消去が困難な情報記録方式についての処理システムおよびプログラムに関する。
に関する。The present invention relates to a processing system and a program for an information recording method that is difficult to falsify or erase, such as a blockchain.
Regarding.

改竄が困難な情報記録方式としてブロックチェーンが従来から一般的に知られている。このブロックチェーンを利用して貨物輸送に関する各種情報を記録しているものとして、例えば、特許文献１がある。 Blockchain has been generally known as an information recording method that is difficult to falsify. For example, Patent Document 1 records various information related to freight transportation using this blockchain.

特開２０１８−１２８７２３号JP-A-2018-128723

しかし、このようなブロックチェーンを利用した情報の記録は、改竄が困難なばかりでなく消去も困難である（以下「消去不可能性」という）。その結果、一旦ブロックチェーンを利用して個人情報の記録を行った場合には、その個人情報主が個人情報を消去したくなっても消去できず個人情報消去権（いわゆる忘れられる権利）が損なわれるという欠点がある。 However, recording information using such a blockchain is not only difficult to falsify, but also difficult to erase (hereinafter referred to as "non-erasable"). As a result, once personal information is recorded using the blockchain, even if the personal information owner wants to delete the personal information, it cannot be deleted and the right to erase personal information (so-called right to be forgotten) is impaired. There is a drawback that it is.

つまり、記録された情報の真正の保証とその情報の削除権の保証とが二律背反するジレンマが生じるという欠点が生じる。 In other words, there is a drawback that there is a dilemma in which the guarantee of the authenticity of the recorded information and the guarantee of the right to delete the information are antinomy.

本発明は、かかる実情に鑑み考え出されたものであり、その目的は、記録された情報の真正の保証とその情報の削除権の保証とが二律背反するジレンマを解消することである。 The present invention has been conceived in view of such circumstances, and an object of the present invention is to solve a dilemma in which the guarantee of the authenticity of recorded information and the guarantee of the right to delete the information are antinomy.

本発明は、記録対象の情報を暗号化する暗号化処理を行う暗号化手段と、
前記暗号化処理を経た後の情報を記録する記録手段と、
前記記録手段により記録された情報に対し、第１鍵と第２鍵とを用いて復号処理を行って平文の情報にする復号手段と、
前記記録手段により記録された情報を復号できない復号不能化状態にする復号不能化手段と、を備え、
前記復号手段は、前記第２鍵を秘匿して保持する第２鍵秘匿保持手段を含み、
前記復号不能化手段は、前記第２鍵秘匿保持手段により保持されている前記第２鍵を他のものに更新することにより復号不能化状態にする。The present invention provides an encryption means that performs an encryption process that encrypts information to be recorded.
A recording means for recording information after the encryption process, and
Decoding means that performs decoding processing on the information recorded by the recording means using the first key and the second key to obtain plain text information.
The information recorded by the recording means is provided with an undecryptable means for making the information undecryptable, which cannot be decoded.
The decryption means includes a second key concealment holding means for concealing and holding the second key.
The undecryptable means puts the undecryptable state by updating the second key held by the second key concealment holding means with another key.

好ましくは、前記復号手段は、前記第１鍵を情報の閲覧希望者に配布する第１鍵配布手段をさらに含む。
好ましくは、前記記録手段により記録された情報を平文にすることなく検索する検索手段をさらに備えている。Preferably, the decryption means further includes a first key distribution means that distributes the first key to a person who wants to view the information.
Preferably, the search means for searching the information recorded by the recording means in plain text is further provided.

より好ましくは、前記記録手段により記録された情報は個人情報を含み、
前記復号不能化手段は、個人情報主の要求に応じて当該個人情報主の個人情報を前記復号不能化状態にする。More preferably, the information recorded by the recording means includes personal information.
The undecryptable means puts the personal information of the personal information owner into the undecryptable state at the request of the personal information owner.

本発明の他の局面は、記録対象の情報を暗号化する暗号化処理を行うステップと、
前記暗号化処理を経た後の情報を記録する記録手段により記録された情報に対し、第１鍵と第２鍵とを用いて復号処理を行って平文の情報にする復号ステップと、
前記記録手段により記録された情報を復号できない復号不能化状態にするステップとを、
コンピュータに実行させ、
前記復号ステップは、前記第２鍵を秘匿して保持するステップを含み、
前記復号不能化状態にするステップは、前記保持するステップにより保持されている前記第２鍵を他のものに更新することにより復号不能化状態にする。Another aspect of the present invention is a step of performing an encryption process for encrypting information to be recorded.
A decryption step of decrypting the information recorded by the recording means for recording the information after the encryption process using the first key and the second key to obtain plain text information.
The step of making the information recorded by the recording means undecipherable, which cannot be decoded,
Let the computer run
The decryption step includes a step of keeping the second key secret.
The step of making the decryption impossible state makes the second key held by the holding step update to another key to make the decryption state impossible.

本発明によれば、記録された情報の真正の保証とその情報の削除権の保証とが二律背反するジレンマを極力解消することができる。 According to the present invention, the dilemma in which the guarantee of the authenticity of the recorded information and the guarantee of the right to delete the information conflict with each other can be solved as much as possible.

処理システムの全体構成を示すシステム図である。It is a system diagram which shows the whole structure of a processing system. （Ａ）はブロックチェーンのノードを構成するユーザ端末のＨＤＤに記憶されている情報を説明する図であり、（Ｂ）は認定事業者の個人情報ＤＢに記憶されている情報を説明する図である。(A) is a diagram for explaining the information stored in the HDD of the user terminal constituting the node of the blockchain, and (B) is a diagram for explaining the information stored in the personal information DB of the authorized business operator. be. （Ａ）はパブリックチェーンのユーザ端末のメインルーチンプログラムを示すフローチャートであり、（Ｂ）は個人情報記録処理のサブルーチンプログラムを示すフローチャート及び認定事業者のサーバのフローチャートである。(A) is a flowchart showing a main routine program of a user terminal of a public chain, and (B) is a flowchart showing a subroutine program of personal information recording processing and a flowchart of a server of an authorized business operator. パブリックチェーンのユーザ端末で実行されるスマートコントラクト処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of smart contract processing executed in the user terminal of a public chain. （Ａ）はプライベートチェーンのユーザ端末のメインルーチンプログラムを示すフローチャートであり、（Ｂ）は個人情報検索処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing the main routine program of the user terminal of the private chain, and (B) is the flowchart showing the subroutine program of the personal information search process. プライベートチェーンのユーザ端末で実行されるスマートコントラクト処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of smart contract processing executed in the user terminal of a private chain. （Ａ）はプライベートチェーンのユーザ端末で実行されるスマートコントラクト処理のサブルーチンプログラムを示すフローチャートの続きであり、（Ｂ）はプライベートチェーンのユーザ端末で実行される機械学習処理のサブルーチンプログラムを示すフローチャートである。(A) is a continuation of the flowchart showing the subroutine program of smart contract processing executed on the user terminal of the private chain, and (B) is the flowchart showing the subroutine program of machine learning processing executed on the user terminal of the private chain. be. プライベートチェーンのユーザ端末で実行されるＡＩスマートコントラクト生成処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the AI smart contract generation processing executed in the user terminal of a private chain. （Ａ）はプライベートチェーンのユーザ端末で実行されるシミュレーション学習処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）はプライベートチェーンのユーザ端末で実行されるＡＩスマートコントラクト群生成処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of simulation learning processing executed on a user terminal of a private chain, and (B) is a flowchart showing a subroutine program of AI smart contract group generation processing executed on a user terminal of a private chain. Is. プライベートチェーンのユーザ端末で実行されるスマートコントラクト信託請負い処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of smart contract trust contract processing executed by the user terminal of a private chain. （Ａ）はパーソナライズドＡＩスマートコントラクト学習済みモデルの強化学習処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）はコンソーシアムチェーンのユーザ端末により実行されるメインルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of reinforcement learning processing of a personalized AI smart contract trained model, and (B) is a flowchart showing a main routine program executed by a user terminal of a consortium chain. （Ａ）はコンソーシアムチェーンのユーザ端末により実行されるIoTセンサデータ集計処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）はコンソーシアムチェーンのユーザ端末により実行されるシミュレーション処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of IoT sensor data aggregation processing executed by a user terminal of the consortium chain, and (B) is a flowchart showing a subroutine program of simulation processing executed by a user terminal of the consortium chain. .. （Ａ）はコンソーシアムチェーンのユーザ端末により実行されるＡＩスマートコントラクト群生成処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）はコンソーシアムチェーンのユーザ端末により実行されるスマートコントラクト処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of AI smart contract group generation processing executed by the user terminal of the consortium chain, and (B) is a flowchart showing a subroutine program of smart contract processing executed by the user terminal of the consortium chain. Is. ブロックチェーンを利用して記録された情報を閲覧できない状態にする説明図であり、（Ａ）は閲覧できる通常状態の図であり、（Ｂ）は解読不能化状態にして閲覧できないようにした図である。It is an explanatory diagram that makes the information recorded by using the blockchain inaccessible, (A) is a diagram in a normal state that can be viewed, and (B) is a diagram that makes it unreadable so that it cannot be viewed. Is. ブロックチェーンのノードを構成するユーザ端末のＨＤＤに記憶されている情報を説明する図である。It is a figure explaining the information stored in the HDD of the user terminal which constitutes the node of a blockchain. パブリックチェーンのユーザ端末とプライベートチェーンのユーザ端末とのメインルーチンプログラムを示すフローチャートである。It is a flowchart which shows the main routine program of the user terminal of a public chain and the user terminal of a private chain. （Ａ）はブロックチェーンへの個人情報記録処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は記録解読不能処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of personal information recording processing on the blockchain, and (B) is a flowchart showing a subroutine program of record unbreakable processing. 個人情報入手処理及び個人情報提供処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the personal information acquisition processing and the personal information provision processing. （Ａ）はブロックチェーンのノードを構成するユーザ端末のＨＤＤに記憶されている情報を説明する図であり、（Ｂ）は認定事業者の個人情報ＤＢに記憶されている情報を説明する図である。(A) is a diagram for explaining the information stored in the HDD of the user terminal constituting the node of the blockchain, and (B) is a diagram for explaining the information stored in the personal information DB of the authorized business operator. be. パブリックチェーンのユーザ端末と認定事業者のサーバとプライベートチェーンのユーザ端末とのメインルーチンプログラムを示すフローチャートである。It is a flowchart which shows the main routine program of the user terminal of a public chain, the server of an authorized business operator, and the user terminal of a private chain. ブロックチェーンへの個人情報記録処理及びハッシュ値記録処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the personal information recording processing and the hash value recording processing in a blockchain. 記録解読不能処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the record unbreakable processing. 個人情報提供処理と個人情報入手処理と暗号文送信処理とのサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the personal information provision process, the personal information acquisition process, and the ciphertext transmission process. 処理システムの全体構成を示すシステム図である。It is a system diagram which shows the whole structure of a processing system. パブリックチェーンのユーザ端末と鍵登録センタのサーバとプライベートチェーンのユーザ端末とのメインルーチンプログラムを示すフローチャートである。It is a flowchart which shows the main routine program of a user terminal of a public chain, a server of a key registration center, and a user terminal of a private chain. （Ａ）はブロックチェーンへの個人情報記録処理及び鍵登録処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は記録解読不能化要求処理及び記録解読不能処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of personal information recording processing and key registration processing on the blockchain, and (B) is a flowchart showing a subroutine program of record decipherability request processing and record decipherability processing. 片割れ共通鍵提供処理とデータ入手処理とデータ復号処理とのサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of one-sided common key provision processing, data acquisition processing, and data decoding processing. シミュレーション環境としてのミラーワールドの説明図である。It is explanatory drawing of the mirror world as a simulation environment. ミラーワールドにおける都市デジタルツインの具体例を示す図である。It is a figure which shows a concrete example of an urban digital twin in a mirror world. ミラーワールドサーバとユーザ端末とのメインルーチンのフローチャートである。It is a flowchart of the main routine of a mirror world server and a user terminal. パーソナルＡＩの生成販売処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the generation sale process of personal AI. シミュレーション準備処理及びシミュレーション準備応答処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the simulation preparation processing and the simulation preparation response processing. シミュレーション処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of simulation processing. （Ａ）はマルチ役務ＤＡＯ構築システムの概略図であり、（Ｂ）はミラーワールドサーバとユーザ端末とのメインルーチンのフローチャートである。(A) is a schematic diagram of a multi-service DAO construction system, and (B) is a flowchart of a main routine between a mirror world server and a user terminal. シミュレーション強化学習準備処理及びシミュレーション強化学習準備応答処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the simulation reinforcement learning preparation processing and the simulation reinforcement learning preparation response processing. マルチ役務ＤＡＯデジタルツインをシミュレーション対象としてミラーワールドに登録する説明図である。It is explanatory drawing which registers a multi-service DAO digital twin in a mirror world as a simulation target. マルチ役務ＤＡＯのシミュレーション強化学習の概略システム図である。It is a schematic system diagram of simulation reinforcement learning of multi-service DAO. ＤＡＯエージェント強化学習処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of DAO agent reinforcement learning processing. （Ａ）はＤＡＯエージェントが知識として記憶している報酬テーブルを示す図であり、（Ｂ）はペルソナエージェント強化学習処理のサブルーチンプログラムを示すフローチャートである。(A) is a diagram showing a reward table stored as knowledge by the DAO agent, and (B) is a flowchart showing a subroutine program of the persona agent reinforcement learning process. （Ａ）はアイデア発案役務実行処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は改良案役務実行処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of the idea proposal service execution process, and (B) is a flowchart showing a subroutine program of the improvement plan service execution process. （Ａ）は事業化役務実行処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は侵害対処役務実行処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of commercialization service execution processing, and (B) is a flowchart showing a subroutine program of infringement handling service execution processing. （Ａ）はトークン購入役務実行処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）はペルソナエージェント群によるトークン購入に伴うトークンの変動相場での価格変動を説明する図である。(A) is a flowchart showing a subroutine program of token purchase service execution processing, and (B) is a diagram explaining price fluctuations in a token fluctuation market due to token purchase by a persona agent group. エレメント統合ＤＡＯ構築システムを示す図である。It is a figure which shows the element integrated DAO construction system. （Ａ）はミラーワールドサーバとユーザ端末とのメインルーチンのフローチャートであり、（Ｂ）はシミュレーション強化学習準備処理とシミュレーション強化学習準備応答処理とのサブルーチンプログラムを示すフローチャートである。(A) is a flowchart of the main routine between the mirror world server and the user terminal, and (B) is a flowchart showing a subroutine program of simulation reinforcement learning preparation processing and simulation reinforcement learning preparation response processing. エレメント統合ＤＡＯデジタルツインをシミュレーション対象としてミラーワールドに登録する説明図である。It is explanatory drawing which registers the element integrated DAO digital twin in a mirror world as a simulation target. エレメント統合ＤＡＯデジタルツインのシミュレーション強化学習を示す概略システムを図である。The figure shows the schematic system which shows the simulation reinforcement learning of the element integrated DAO digital twin. （Ａ）は資材調達エレメントエージェントが知識として記憶しているパフォーマンス及び分配率の算出アルゴリズムを示す図であり、（Ｂ）は組立エレメントエージェントが知識として記憶しているパフォーマンス及び分配率の算出アルゴリズムを示す図であり、（Ｃ）は宣伝エレメントエージェントが知識として記憶しているパフォーマンス及び分配率の算出アルゴリズムを示す図である。(A) is a diagram showing a performance and distribution rate calculation algorithm stored as knowledge by the material procurement element agent, and (B) is a diagram showing a performance and distribution rate calculation algorithm stored as knowledge by the assembly element agent. It is a figure which shows, (C) is a figure which shows the calculation algorithm of the performance and distribution rate which the advertising element agent has memorized as knowledge. （Ａ）は販売エレメントエージェントが知識として記憶しているパフォーマンス及び分配率の算出アルゴリズムを示す図であり、（Ｂ）は統括エージェントが知識として記憶している報酬テーブルを示す図である。(A) is a diagram showing a performance and distribution rate calculation algorithm stored as knowledge by the sales element agent, and (B) is a diagram showing a reward table stored as knowledge by the general agent. シミュレーション強化学習処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the simulation reinforcement learning process. 統括エージェント強化学習処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the integrated agent reinforcement learning process. 資材調達エレメントエージェント強化学習処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the material procurement element agent reinforcement learning process. （Ａ）はクローラによる情報収集処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は資材調達ＤＢに記憶されている各種データを示す図である。(A) is a flowchart showing a subroutine program of information collection processing by a crawler, and (B) is a diagram showing various data stored in a material procurement DB. 組立エレメントエージェント強化学習処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the assembly element agent reinforcement learning process. 宣伝エレメントエージェント強化学習処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the promotion element agent reinforcement learning process. （Ａ）はクローラによる情報収集処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は宣伝ＤＢに記憶されている各種データを示す図である。(A) is a flowchart showing a subroutine program of information collection processing by a crawler, and (B) is a diagram showing various data stored in a promotion DB. 販売エレメントエージェント強化学習処理のサブルーチンプログラムを示すフローチャートである。It is a flowchart which shows the subroutine program of the sales element agent reinforcement learning process. （Ａ）は情報収集処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は販売ＤＢに記憶されている各種データを示す図である。(A) is a flowchart showing a subroutine program of information collection processing, and (B) is a diagram showing various data stored in a sales DB. （Ａ）は資材調達担当パーソナルＡＩ強化学習処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は組立担当パーソナルＡＩ強化学習処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of the personal AI reinforcement learning process in charge of material procurement, and (B) is a flowchart showing a subroutine program of the personal AI reinforcement learning process in charge of assembly. （Ａ）は宣伝担当パーソナルＡＩ強化学習処理のサブルーチンプログラムを示すフローチャートであり、（Ｂ）は販売担当パーソナルＡＩ強化学習処理のサブルーチンプログラムを示すフローチャートである。(A) is a flowchart showing a subroutine program of the personal AI reinforcement learning process in charge of advertising, and (B) is a flowchart showing a subroutine program of the personal AI reinforcement learning process in charge of sales. プログラムのインストール方法を示す説明図である。It is explanatory drawing which shows the installation method of a program.

[第１実施形態]
図１〜図１３に基づいて本発明の第１実施形態を説明する。まず、図１の全体システムを参照し、３種類のブロックチェーンネットワークであるプライベートチェーン２、コンソーシアムチェーン３及びパブリックチェーン４が集中型オラクル２１に接続されている。パブリックチェーン４は、完全にオープンな仕組みで、個人や団体が誰でもそこで取引することが可能である。取引はそのブロックチェーンで効果的に確認ができる。マイニング（記帳権の競争）も自由で誰でも参加できる。コンソーシアムチェーン３は、協会や組合に属しているパートナーのみが使うことができるブロックチェーンである。その中の人達（各ノード）は記帳者に指定される。ブロックの生成も事前に決定され、それ以外の他の人達（ノード）は、取引はできるが記帳権はない。プライベートチェーン２は、ブロックチェーン技術で記帳するのみで、記帳権はオープンではなく個人または企業が独占し、内部の取引だけを記録する。ブロックチェーン同士を繋いで各ブロックチェーン間でトークンやデータの交換を行うには、Polkadotを用いる。Polkadotとは、異なるブロックチェーンを繋ぐためのブロックチェーンである。Substrateを用いて開発されたブロックチェーンはPolkadotに接続することができ、Polkadotに接続することで、Polkadotに接続された他のブロックチェーンとトークンやデータの交換ができるようになる。[First Embodiment]
The first embodiment of the present invention will be described with reference to FIGS. 1 to 13. First, referring to the entire system of FIG. 1, three types of blockchain networks, a private chain 2, a consortium chain 3, and a public chain 4, are connected to the centralized oracle 21. Public chain 4 is a completely open mechanism, and any individual or group can trade there. Transactions can be effectively confirmed on the blockchain. Mining (competition of bookkeeping rights) is also free and anyone can participate. The consortium chain 3 is a blockchain that can be used only by partners belonging to associations and unions. The people (each node) among them are designated as bookkeepers. The generation of blocks is also decided in advance, and other people (nodes) can trade but do not have the right to book. Private chain 2 only records with blockchain technology, and the bookkeeping right is not open but monopolized by an individual or a company, and only internal transactions are recorded. Polkadot is used to connect blockchains and exchange tokens and data between blockchains. Polkadot is a blockchain for connecting different blockchains. Blockchains developed using Substrate can be connected to Polkadot, and by connecting to Polkadot, tokens and data can be exchanged with other blockchains connected to Polkadot.

集中型オラクル２１は、ブロックチェーンとインターネット１の間でデータの橋渡しをするシステムであり、インターネット１に接続されてネット上に散在している各種情報を収集してブロックチェーンのスマートコントラクトに情報提供を行う。 The centralized Oracle 21 is a system that bridges data between the blockchain and the Internet 1, collects various information scattered on the Internet connected to the Internet 1, and provides information to the blockchain smart contract. I do.

プライベートチェーン２、コンソーシアムチェーン３及びパブリックチェーン４の各ノード１９は、パーソナルコンピュータ（以下「ＰＣ」という）１６等のユーザ端末で構成されている。このＰＣ（以下「ユーザ端末」ともいう）１６がインターネット１に接続されている。インターネット１には、さらに、ＳＮＳ（Social Networking Service）４０のサーバ２０及びブロックチェーンの認定事業者１７のサーバ１８が接続されている。なお、認定事業者１７のサーバ１８は、ノード１９としてブロックチェーンに参加してもよい。また、ＰＫＩ（Public Key Infrastructure）での電子証明書を発行する認証局のサーバがインターネット１に接続されていてもよい。 Each node 19 of the private chain 2, the consortium chain 3, and the public chain 4 is composed of user terminals such as a personal computer (hereinafter referred to as "PC") 16. This PC (hereinafter, also referred to as "user terminal") 16 is connected to the Internet 1. Further, the server 20 of the SNS (Social Networking Service) 40 and the server 18 of the blockchain authorized business operator 17 are connected to the Internet 1. The server 18 of the certified business operator 17 may participate in the blockchain as a node 19. Further, the server of the certificate authority that issues the digital certificate in PKI (Public Key Infrastructure) may be connected to the Internet 1.

認定事業者１７は、個人情報を預かりその個人準情報主に電子ＩＤを発行すると共に、その個人情報のハッシュ値をブロックチェーンに記録する。預かった個人情報は個人情報データベース（以下「個人情報ＤＢ」という）２９に格納される。なお、認定事業者１７がノード１９としてブロックチェーンに参加してもよい。 The authorized business operator 17 keeps personal information, issues an electronic ID mainly to the personal quasi-information, and records the hash value of the personal information in the blockchain. The entrusted personal information is stored in the personal information database (hereinafter referred to as "personal information DB") 29. The certified business operator 17 may participate in the blockchain as a node 19.

ＰＣ１６は、制御中枢としてのＣＰＵ（Central Processing Unit）１０、ＣＰＵ１０のワークエリアとして機能するＲＡＭ（Random Access Memory）９、データやプログラムを記憶しているＲＯＭ（Read Only Memory）１１、ＨＤＤ（hard disk drive）１２などの記憶部、ディスプレイ、キーボード等の入力操作部７、通信部５、表示部６、インタフェース８、バス１３、その他種々のハードウェアによって構成される。サーバ２０及びサーバ１８等の各種サーバも、ＰＣ１６と同様のハードウェアによって構成されており、ここでは図示及び説明の繰り返しを省略する。なお、記憶部として、上記ＨＤＤに加えてまたはその代わりＳＤＤ（Solid State Drive）を用いてもよい。 The PC 16 includes a CPU (Central Processing Unit) 10 as a control center, a RAM (Random Access Memory) 9 that functions as a work area of the CPU 10, a ROM (Read Only Memory) 11 that stores data and programs, and an HDD (hard disk). It is composed of a storage unit such as drive) 12, an input operation unit 7 such as a display and a keyboard, a communication unit 5, a display unit 6, an interface 8, a bus 13, and various other hardware. Various servers such as the server 20 and the server 18 are also configured by the same hardware as the PC 16, and the repetition of illustration and description is omitted here. As the storage unit, an SDD (Solid State Drive) may be used in addition to or instead of the HDD.

コンソーシアムチェーン３のノード１９には、IoT（Internet of Things）用デバイス１４及びワイヤレスセンサネットワーク１５が接続されている。IoT用デバイス１４及びワイヤレスセンサネットワーク１５からのセンサ信号がノード１９に入力されると共に、ノード１９からIoT用デバイス１４の駆動信号が出力される。IoT用デバイス１４は、IoT用の各種センサやアクチュエータ等である。 An IoT (Internet of Things) device 14 and a wireless sensor network 15 are connected to the node 19 of the consortium chain 3. The sensor signals from the IoT device 14 and the wireless sensor network 15 are input to the node 19, and the drive signal of the IoT device 14 is output from the node 19. The IoT device 14 is various sensors, actuators, and the like for IoT.

ワイヤレスセンサネットワーク１５とは、複数のセンサ付無線端末を空間に散在させ、それらが協調して環境や物理的状況を採取することを可能とする無線ネットワークのことである。例えばエネルギーハーベスティングかM2Mあるいは電池などでセンサ装置を作り、例えば金属疲労の劣化等を圧力センサやゲージセンサで常時モニターしておき、その変化があると知らせる。主に橋梁やトンネル等の建造物に設置される。一般的に、複数のセンサノードとゲートウェイセンサノードとを含む。それらノードは、通常１個以上のセンサ、無線チップ、マイクロプロセッサ、電源（電池など）により構成される。ワイヤレスセンサネットワークは、通常、アドホック(ad hoc)機能と、各ノードから中枢ノードへデータを送るためのルーティング機能(routing algorithm)を持つ。つまり、ノード間の通信に障害がでると別の通信経路を自律的に再構築する機能がある。ノードがグループとして連携するため分散処理の要素もある。加えて、外部から電力供給を受けずに長期間動作する機能もあり、そのために省電力機能または自己発電機能を持つ。 The wireless sensor network 15 is a wireless network in which a plurality of wireless terminals with sensors are scattered in a space, and they can cooperate with each other to collect an environment or a physical condition. For example, a sensor device is made of energy harvesting, M2M, or a battery, and for example, deterioration of metal fatigue is constantly monitored by a pressure sensor or gauge sensor, and the change is notified. It is mainly installed in buildings such as bridges and tunnels. Generally, it includes a plurality of sensor nodes and a gateway sensor node. These nodes usually consist of one or more sensors, a wireless chip, a microprocessor, a power source (such as a battery). Wireless sensor networks typically have an ad hoc function and a routing algorithm for sending data from each node to a central node. In other words, there is a function to autonomously reconstruct another communication path when communication between nodes fails. Since the nodes cooperate as a group, there is also an element of distributed processing. In addition, there is also a function that operates for a long period of time without receiving power supply from the outside, and therefore has a power saving function or a self-power generation function.

本実施形態では、IoT用デバイス１４及びワイヤレスセンサネットワーク１５がノード１９を介してコンソーシアムチェーン３に接続されているが、ノード１９を介することなく、IoT用デバイス１４及びワイヤレスセンサネットワーク１５の一方または双方自体がコンソーシアムチェーン３のノード１９の一部となっていてもよい。 In the present embodiment, the IoT device 14 and the wireless sensor network 15 are connected to the consortium chain 3 via the node 19, but one or both of the IoT device 14 and the wireless sensor network 15 do not go through the node 19. It may itself be part of node 19 of the consortium chain 3.

次に図２（Ａ）を参照し、ＰＣ１６のＨＤＤ１２に記憶されている情報を説明する。ＨＤＤ１２には、ユーザの秘密鍵ＳＫ、公開鍵ＰＫ、共通鍵Ｋ１、トラップドア用共通鍵Ｋ２、ブロックチェーンでのユーザのアドレス、スマートコントラクト、トークン、人工知能（「ＡＩ（Artificial Intelligence）」ともいう）及びブロックチェーンデータ等が記憶されている。なお、ユーザは、自然人ばかりでなく法人をも含む広い概念である。 Next, the information stored in the HDD 12 of the PC 16 will be described with reference to FIG. 2 (A). The HDD 12 has a user's private key SK, a public key PK, a common key K1, a common key K2 for a trap door, a user's address in a blockchain, a smart contract, a token, and artificial intelligence (also referred to as "AI (Artificial Intelligence)". ) And blockchain data etc. are stored. The user is a broad concept that includes not only natural persons but also corporations.

秘密鍵ＳＫと公開鍵ＰＫは、ＰＫＩ(Public key Infratructure）で用いる鍵ペアであり、公開鍵ＰＫで暗号化されたデータを秘密鍵ＳＫで復号する。秘密鍵ＳＫは電子署名にも用いる。共通鍵Ｋ１は、例えばＤＥＳ（Data Encryption Standard）やＡＥＳ（Advanced Encryption Standard）等の共通鍵暗号に用いる鍵である。共通鍵Ｋ１により暗号化されたデータを同じ共通鍵Ｋ１を用いて復号する。本実施形態では、暗号化対象の個人情報毎に異なる共通鍵を用いている。第１実施形態では、暗号化個人情報Ｅ_Ｋ１(個人情報）を暗号文についてキーワード検索用の索引（インデックス）が設けられている。そのインデックスは共通鍵Ｋ２により暗号化されている。キーワード検索するためには、検索に用いるキーワード（検索クエリ）を共通鍵Ｋ２で暗号化した暗号化検索クエリ（これを「トラップドア」という）を用いて検索する。この共通鍵Ｋ２をトラップドア用共通鍵Ｋ２としてＨＤＤ１２に記憶している。The private key SK and the public key PK are a key pair used in PKI (Public key Infratructure), and the data encrypted by the public key PK is decrypted by the private key SK. The private key SK is also used for electronic signatures. The common key K1 is a key used for common key encryption such as DES (Data Encryption Standard) and AES (Advanced Encryption Standard). The data encrypted by the common key K1 is decrypted by using the same common key K1. In this embodiment, a different common key is used for each personal information to be encrypted. In the first embodiment, an index (index) for keyword search is provided for the encrypted personal information _{EK1 (personal information) in the ciphertext.} The index is encrypted with the common key K2. In order to perform a keyword search, an encrypted search query (this is called a "trapdoor") in which the keyword (search query) used for the search is encrypted with the common key K2 is used for the search. This common key K2 is stored in the HDD 12 as a trapdoor common key K2.

ブロックチェーンでのユーザのアドレスは、以下のプロセスを経て生成される。
１秘密鍵からECDSAで公開鍵を生成する。
２公開鍵をハッシュ関数SHA-256に通しハッシュ値を得る。
３そのハッシュ値をさらにハッシュ関数RIPEMD-160に通しハッシュ値を得る。
４ハッシュ値の先頭にプレフィックスとして００を加える。
５ハッシュ関数SHA-256に通す。
６もう一度ハッシュ関数SHA-256に通す。
７４バイトのチェックサムを一番後ろに加える。
８ Base58のフォーマットでエンコーディングする。The user's address on the blockchain is generated through the following process.
1 Generate a public key from the private key with ECDSA.
2 Pass the public key through the hash function SHA-256 to obtain the hash value.
3 The hash value is further passed through the hash function RIPEMD-160 to obtain the hash value.
4 Add 00 as a prefix to the beginning of the hash value.
5 Pass through the hash function SHA-256.
6 Pass it through the hash function SHA-256 again.
Add a 7-byte checksum to the end.
8 Encode in Base58 format.

スマートコントラクトとは、契約のスムーズな検証、条件確認、執行、実行、交渉を意図したコンピュータプロトコルである。トークンとは、企業や個人よりブロックチェーン上で発行された独自通貨のことである。 A smart contract is a computer protocol intended for smooth contract verification, condition confirmation, execution, execution, and negotiation. A token is a unique currency issued on the blockchain by a company or individual.

次に、ブロックチェーンデータを説明する。ブロックチェーンの各ブロック内のデータは、前のブロックのハッシュ値、ナンス及び複数の取引のデータ（トランザクションともいう）を含んでいる。また、図示を省略しているが、タイムスタンプもブロックチェーンに埋め込まれている。このようなブロックチェーンは、各ノード１９がブロックチェーン処理（後述するＳ３、Ｓ１９、Ｓ３０、Ｓ５１、Ｓ１１７、Ｓ１２２、Ｓ１５３等参照）を行うことにより生成されて新たなブロックチェーンとして追加される。ブロックチェーン処理は、主に、取引、伝搬及び記録の３つのフェーズからなる。 Next, blockchain data will be described. The data in each block of the blockchain contains the hash value, nonce and data of multiple transactions (also referred to as transactions) of the previous block. Also, although not shown, the time stamp is also embedded in the blockchain. Such a blockchain is generated by each node 19 performing blockchain processing (see S3, S19, S30, S51, S117, S122, S153, etc., which will be described later), and is added as a new blockchain. Blockchain processing mainly consists of three phases: transaction, propagation and recording.

取引のフェーズは、一般的にトランザクションといわれる行為であり、売買行為、譲渡行為、貸与（貸渡し）行為等の法律行為を意味する。この取引フェーズは、より具体的には、生成→署名→伝搬と３つのフェーズに分けることができる。 The transaction phase is an act generally called a transaction, and means a legal act such as a buying and selling act, a transfer act, and a lending (lending) act. More specifically, this transaction phase can be divided into three phases: generation → signature → propagation.

生成フェーズは、トランザクションの生成を行うことであり、例えば、Ａ氏がＢ氏に対し「休止状態のＰＣ資源（計算資源）を３９００５秒貸与して２５.７８トークンを得る。」と決め、トランザクションを生成したことを電子署名する。この電子著名は、トランザクションデータを所定のハッシュ関数を通すことによりハッシュ値を生成し、トランザクション（取引）の当事者（Ａ氏とＢ氏）の秘密鍵ＳＫを用いてそのハッシュ値を暗号化したものである。また、認証局によりデジタル公開鍵証明書を発行してもらってもよい。図２では、ＰＣ資源（計算資源）を貸与する例を示しているが、貸与対象はそれに限定されるものではなく、例えば、家庭や企業で自己発電した電力、ユーザの専門的知識経験やスキルや人脈（ネット上での人的ネットワークを含む）や信用等の価値、などが考えられる。 The generation phase is to generate a transaction. For example, Mr. A decides to Mr. B that "a dormant PC resource (computational resource) is lent for 39005 seconds to obtain 25.78 tokens." Digitally sign that the transaction was generated. In this electronic prominence, a hash value is generated by passing transaction data through a predetermined hash function, and the hash value is encrypted using the private key SK of the parties (Mr. A and Mr. B) of the transaction (transaction). Is. You may also have a certificate authority issue a digital public key certificate. Figure 2 shows an example of lending PC resources (computational resources), but the lending target is not limited to that, for example, electric power self-generated at home or company, user's specialized knowledge, experience and skills. And the value of personal connections (including human networks on the net) and credit.

伝搬フェーズは、正しくトランザクションの生成・署名が完了したことを、他のノードに確認してもらうことである。正しくトランザクションの生成・署名が行われなかったと判断された場合、そのトランザクションは破棄される The propagation phase is to have other nodes confirm that the transaction has been created and signed correctly. If it is determined that the transaction was not created / signed correctly, the transaction will be discarded.

記録フェーズは、正しくトランザクションの生成・署名が完了したことが確認された場合に、マイナーがマイニングを行って取引の記録を行うことである。正しく生成・署名が完了したことが確認されたトランザクションは、マイニングプールといわれる場所に移動する。その後、マイナーが、マイニングプールから記録するトランザクションを選んでマイニングを行う。 The recording phase is for the miner to perform mining and record the transaction when it is confirmed that the transaction has been generated and signed correctly. Transactions that have been successfully generated and signed are moved to a location called the mining pool. After that, the miner selects the transaction to be recorded from the mining pool and performs mining.

マイニングは、ナンスを算出する作業である。ナンスとは、ブロックのデータをハッシュ関数に通したときに先頭に０がたくさん並ぶような非常に小さなハッシュ値が生成されるように調整するための値のことである。ハッシュ値がターゲット値以下になるようなナンスを計算することができると新しいブロックが生成される。 Mining is the work of calculating nonce. The nonce is a value for adjusting so that a very small hash value such that many 0s are lined up at the beginning is generated when the block data is passed through the hash function. A new block is created if the nonce can be calculated so that the hash value is less than or equal to the target value.

なお、取引のデータは、図２の右側の取引Ｉに示すように、ユーザの個人情報を鍵Ｋ１で暗号化したＥ_Ｋ1(個人情報）のハッシュ値、その電子ＩＤ、及び個人情報のインデックスと個人情報の提供に対する対価（図２では２.４トークンで提供）とを鍵Ｋ２で暗号化したＥ_Ｋ２(インデックス＋２.４トークンで提供）も含んでいる。個人情報の具体例としては、ユーザの心拍数、血圧、体温、脳波等のバイタル情報、購買履歴やウェブサイト閲覧履歴等の行動履歴情報、ＧＰＳ等のユーザの位置情報、人種、信条、社会的身分、病歴、電子カルテデータ、ＩＤ（identification）、ＳＮＳ等への投稿情報等である。 _{The transaction data includes the hash value of EK1} (personal information) in which the user's personal information is encrypted with the key K1, its electronic ID, and the index of personal information, as shown in transaction I on the right side of FIG. and consideration _(provided by the index +2.4 token) E _K2 of the encrypted by key K2 (provided 2.4 token in FIG. 2) also comprise for providing personal information. Specific examples of personal information include vital information such as the user's heart rate, blood pressure, body temperature, and brain waves, behavior history information such as purchase history and website browsing history, user's location information such as GPS, race, creed, and society. Information such as personal status, medical history, electronic medical record data, ID (identification), and information posted on SNS.

ＳＮＳ等への投稿情報は、既にＳＮＳ２５に投稿されてサーバ２０に記憶されている過去の投稿情報をサーバ２０から個人情報ＤＢ２９及びブロックチェーンに移し替えたものである。具体的には、ユーザ自身が自身の過去の投稿情報を全て暗号化して認定事業者の個人情報ＤＢ２９に記憶させると共にそのハッシュ値をブロックチェーンに記録する。以降、ユーザは、ＳＮＳ２５に投稿するのではなく、投稿内容を認定事業者の個人情報ＤＢ２９に暗号化して記憶させると共にそのハッシュ値をブロックチェーンに記録する。これにより、ユーザは、個人情報をＳＮＳ等の事業者から取戻して自己管理下に置くことが可能になる。 The posted information to the SNS or the like is the past posted information already posted on the SNS 25 and stored in the server 20 transferred from the server 20 to the personal information DB 29 and the blockchain. Specifically, the user himself encrypts all the past posted information and stores it in the personal information DB 29 of the authorized business operator, and records the hash value in the blockchain. After that, the user does not post to the SNS 25, but encrypts and stores the posted content in the personal information DB 29 of the authorized business operator, and records the hash value in the blockchain. As a result, the user can retrieve personal information from a business operator such as SNS and put it under self-management.

なお、個人情報の提供に対する対価（図２では２.４トークンで提供）については、暗号化することなく平文のままブロックチェーンに記録してもよい。その場合には、他ユーザが暗号鍵Ｋ２を入手しなくてもブロックチェーンを検索して対価を知ることができる。さらに、個人情報の提供に対する対価やＰＣ資源（計算資源）の貸与の対価（図２ではＰＣ資源（計算資源）を３９００５秒貸与して２５.７８トークンを得る）等の取引条件を、スマートコントラクトとしてコードの形にし、スマートコントラクトにより取引（法律行為）の自動化を行うようにしてもよい。 The consideration for the provision of personal information (provided in 2.4 tokens in FIG. 2) may be recorded in plain text on the blockchain without being encrypted. In that case, even if another user does not obtain the encryption key K2, the blockchain can be searched and the consideration can be known. In addition, smart contracts include transaction conditions such as consideration for providing personal information and consideration for lending PC resources (computational resources) (in Fig. 2, lending PC resources (computational resources) for 39005 seconds to obtain 25.78 tokens). In the form of a code, a smart contract may be used to automate transactions (legal acts).

取引Ｉとして記録されるハッシュ値の対象である暗号化個人情報自体は、認定事業者１７の個人情報ＤＢ２９に記憶されている。具体的には、図２（Ｂ）に示すように、認定事業者１７が暗号化個人情報Ｅ_Ｋ１(個人情報）に対し発行した電子ＩＤに対応付けて暗号化個人情報Ｅ_Ｋ１(個人情報）が個人情報ＤＢ２９に記憶されている。The encrypted personal information itself, which is the target of the hash value recorded as transaction I, is stored in the personal information DB 29 of the authorized business operator 17. Specifically, as shown in FIG. 2 (B), the encrypted personal information _EK1 (personal information) is associated with the electronic ID issued by the _{authorized business operator 17 for the encrypted personal information EK1} (personal information). Is stored in the personal information DB 29.

インデックスとは、暗号化個人情報Ｅ_Ｋ１(個人情報）をキーワード検索するための索引のことである。本実施形態では、インデックスを共通鍵Ｋ２により暗号化する共通鍵暗号方式を採用しているため、キーワード検索するためには、検索に用いるキーワード（検索クエリ）を共通鍵Ｋ２で暗号化した暗号化検索クエリ（これを「トラップドア」という）を用いて検索する。共通鍵Ｋ２は、ユーザ毎に異なる鍵であるが、同じユーザの暗号化インデックスであれば同じ鍵が用いられる。よって、後述するスマートコントラクトにより、例えばユーザＡがユーザＢに対して共通鍵Ｋ２を配布するトランザクションを行えば、ユーザＢは、Ｅ_Ｋ２(検索クエリ）を用いてユーザＡの全ての暗号化インデックスをブロックチェーン上で検索可能となる。The index is an index _{for keyword-searching encrypted personal information EK1} (personal information). In the present embodiment, since the common key encryption method in which the index is encrypted by the common key K2 is adopted, in order to perform a keyword search, the keyword (search query) used for the search is encrypted by the common key K2. Search using a search query (this is called a "trap door"). The common key K2 is a different key for each user, but the same key is used if it is an encrypted index of the same user. Therefore, if, for example, user A performs a transaction to distribute the common key K2 to user B by a smart contract described later, user B uses _EK2 (search query) to perform all the encrypted indexes of user A. It will be searchable on the blockchain.

なお、暗号文を暗号化したまま検索できる準同型暗号や完全準同型暗号等の検索可能暗号を用いてもよい。その際、準同型暗号や完全準同型暗号を用いて個人情報を暗号化し、その暗号化個人情報をブロックチェーンに直接記録してもよい。また、暗号化個人情報Ｅ_Ｋ１(個人情報）を直接ブロックチェーンに記録するようにしてもよい。It should be noted that a searchable cipher such as a homomorphic encryption or a fully homomorphic encryption that can be searched while the ciphertext is encrypted may be used. At that time, personal information may be encrypted by using homomorphic encryption or fully homomorphic encryption, and the encrypted personal information may be recorded directly on the blockchain. Further, the encrypted personal information _EK1 (personal information) may be recorded directly on the blockchain.

次に、図３（Ａ）を参照して、パブリックチェーン１９のユーザ端末のメインルーチンプログラムのフローチャートを説明する。ステップＳ（以下単に「Ｓ」という）１により、個人情報記録処理が行われ、Ｓ２によりスマートコントラクト処理が行われ、Ｓ３によりブロックチェーン処理が行われる。 Next, a flowchart of the main routine program of the user terminal of the public chain 19 will be described with reference to FIG. 3 (A). The personal information recording process is performed by step S (hereinafter simply referred to as “S”) 1, the smart contract process is performed by S2, and the blockchain process is performed by S3.

個人情報記録処理とは、個人情報主が個人情報を暗号化して認定事業者１７に登録してその暗号化個人情報のハッシュ値をブロックチェーンに記録する処理である。スマートコントラクト処理とは、予め決められているルールに従って契約の締結及び執行等の法律行為を自動的に行う処理である。ブロックチェーン処理の具体的内容は、図２（Ａ）に基づいて前述したとおりである。 The personal information recording process is a process in which the personal information owner encrypts the personal information, registers it with the authorized business operator 17, and records the hash value of the encrypted personal information in the blockchain. The smart contract process is a process that automatically performs legal acts such as contract conclusion and execution in accordance with predetermined rules. The specific contents of the blockchain processing are as described above based on FIG. 2 (A).

図３（Ｂ）を参照して個人情報記録処理を説明する。Ｓ５により、パブリックチェーン１９のノード１９を構成しているユーザ端末において個人情報の登録操作があったか否か判定される。ない場合にはこの個人情報記録処理がリターンしてＳ２のスマートコントラクト処理に移行する。個人情報の登録操作があったと判定された場合には、Ｓ６により、ユーザ端末のメモリ（ＨＤＤ１２等）に記憶されている個人情報を鍵Ｋ１で暗号化した上で秘密鍵ＳＫで電子署名し、かつ、インデックスを鍵Ｋ２で暗号化して認定事業者１７のサーバ１８へ送信する。 The personal information recording process will be described with reference to FIG. 3 (B). According to S5, it is determined whether or not the personal information registration operation has been performed on the user terminals constituting the node 19 of the public chain 19. If not, this personal information recording process returns and shifts to the smart contract process of S2. When it is determined that the personal information has been registered, the personal information stored in the memory (HDD12, etc.) of the user terminal is encrypted with the key K1 by S6, and then digitally signed with the private key SK. At the same time, the index is encrypted with the key K2 and transmitted to the server 18 of the certified business operator 17.

それをＳ７で受信した認定事業者１７のサーバ１８は、電子ＩＤを発行すると共に受信した暗号化個人情報であるＥ_Ｋ１(個人情報）のハッシュ値を生成する。次に、発行した電子ＩＤをユーザ端末に返信する（Ｓ９）。それを受信したユーザ端末は、電子ＩＤをメモリ（ＨＤＤ１２等）に記憶する。認定事業者１７のサーバ１８は、Ｓ１０において、電子ＩＤとハッシュ値と暗号化インデックスとをブロックチェーンに記録するための処理を行う。The server 18 of the authorized business operator 17 that received it in S7 issues an electronic ID and generates a hash value of _{EK1 (personal information) that is the received encrypted personal information.} Next, the issued electronic ID is returned to the user terminal (S9). The user terminal that receives it stores the electronic ID in a memory (HDD 12, etc.). In S10, the server 18 of the certified business operator 17 performs a process for recording the electronic ID, the hash value, and the encrypted index on the blockchain.

Ｓ２に示されたスマートコントラクト処理のサブルーチンプログラムのフローチャートを説明する。図４を参照して、Ｓ１３において、共通鍵Ｋ２の配布契約が成立したか否か判定され、成立していない場合にはＳ１５において、ユーザ端末のＰＣ資源（計算資源）の貸与契約が成立したか否か判定され、成立していない場合にはＳ１６において個人情報の提供契約が成立したか否か判定され、成立していない場合にはＳ１７においてオーダーメイド商品等を発注する発注契約が成立したか否か判定され、成立していない場合にはＳ２２において商品等の売買契約が成立したか否か判定され、成立していない場合にはリターンする。これらの判定はスマートコントラクトによって行われる。例えば、前述した対価等をスマートコントラクトとしてコード化している場合には、当事者双方の対価等の条件が合致したか否かをスマートコントラクトが判定し、合致したと判定した場合に、契約の締結及び執行を自動的に行う。 The flowchart of the subroutine program of the smart contract processing shown in S2 will be described. With reference to FIG. 4, it is determined in S13 whether or not the distribution contract for the common key K2 has been established, and if not, in S15, the loan contract for the PC resources (computational resources) of the user terminal has been established. It is determined whether or not it is established, and if it is not established, it is determined whether or not the contract for providing personal information is established in S16, and if it is not established, an ordering contract for ordering a custom-made product or the like is established in S17. Whether or not it is determined, and if it is not established, it is determined in S22 whether or not the sales contract for the product or the like is established, and if it is not established, a return is made. These decisions are made by smart contracts. For example, when the above-mentioned consideration is coded as a smart contract, the smart contract determines whether or not the conditions such as consideration of both parties are met, and if it is determined that the conditions are met, the contract is concluded and the contract is concluded. Execution is performed automatically.

共通鍵Ｋ２の配布契約が成立したと判定された場合には制御がＳ１４へ進み、配布先に共通鍵Ｋ２が送信された後、制御がＳ１９へ進む。Ｓ１９では、その成立した契約をトランザクションとしてブロックチェーンに記録するための処理が行われる。ＰＣ資源（計算資源）の貸与契約が成立したと判定された場合には制御がＳ１８へ進み、ＰＣ資源（計算資源）の貸与処理が行われる。個人情報の提供契約が成立したと判定された場合には制御がＳ２０へ進み、提供する個人情報の電子ＩＤと個人情報の提供に合意する署名とを提供相手に返信すると共に、提供する個人情報の暗号化に使用した共通鍵Ｋ１を提供相手の公開鍵で暗号化して提供相手に返信する。 When it is determined that the distribution contract for the common key K2 has been established, the control proceeds to S14, and after the common key K2 is transmitted to the distribution destination, the control proceeds to S19. In S19, a process for recording the established contract as a transaction in the blockchain is performed. When it is determined that the PC resource (computational resource) loan contract has been established, the control proceeds to S18, and the PC resource (computational resource) loan process is performed. If it is determined that the personal information provision contract has been established, the control proceeds to S20, and the electronic ID of the personal information to be provided and the signature agreeing to provide the personal information are returned to the provider and the personal information to be provided is returned. The common key K1 used for the encryption of the above is encrypted with the public key of the provider and returned to the provider.

発注契約が成立したと判定された場合には制御がＳ２１へ進み、発注処理が行われた後にＳ１９へ進む。売買契約が成立したと判定された場合には制御がＳ２３へ進み、購入対象を入手する処理が実行されてＳ１９へ進む。 If it is determined that the ordering contract has been established, the control proceeds to S21, and after the ordering process is performed, the process proceeds to S19. When it is determined that the sales contract has been established, the control proceeds to S23, the process of obtaining the purchase target is executed, and the process proceeds to S19.

次に、プライベートチェーン２のノード１９を構成しているユーザ端末のメインルーチンプログラムのフローチャートを図５（Ａ）に基づいて説明する。Ｓ２８により個人情報検索処理が行われ、Ｓ２９によりスマートコントラクト処理が行われ、Ｓ３０によりブロックチェーン処理が行われ、Ｓ３１により機械学習処理が行われ、Ｓ３２によりＡＩスマートコントラクト生成処理が行われ、Ｓ３３によりスマートコントラクト信託請負い処理が行われる。ＡＩスマートコントラクトは、「一体化タイプ」と「連携タイプ」との両者を含む概念である。「一体化タイプ」とは、ＡＩとスマートコントラクトとが一体化し契約（法律行為）のデータに基づいて機械学習を行いスマートコントラクト自体をＡＩ化したものである。「連携タイプ」とは、契約（法律行為）のデータに基づいて機械学習を行ったＡＩとスマートコントラクトとが連携したものである。連携タイプの場合、学習済みＡＩ（以下「連携用ＡＩ」という）が状況に応じてスマートコントラクトの追加、変更及び更新等を行う。 Next, a flowchart of the main routine program of the user terminal constituting the node 19 of the private chain 2 will be described with reference to FIG. 5 (A). Personal information search processing is performed by S28, smart contract processing is performed by S29, blockchain processing is performed by S30, machine learning processing is performed by S31, AI smart contract generation processing is performed by S32, and AI smart contract generation processing is performed by S33. Smart contract trust contract processing is performed. AI smart contract is a concept that includes both "integrated type" and "cooperation type". The "integrated type" is an AI version of the smart contract itself by integrating the AI and the smart contract and performing machine learning based on the data of the contract (legal act). The "cooperation type" is a collaboration between an AI that has performed machine learning based on contract (legal act) data and a smart contract. In the case of the cooperation type, the learned AI (hereinafter referred to as "linkage AI") adds, changes, updates, etc. to the smart contract depending on the situation.

個人情報検索処理とは、ブロックチェーンに記録されている暗号化インデックスをトラップドア（暗号化検索クエリ）により検索する処理である。スマートコントラクト処理とは、予め決められているルールに従って契約の締結及び執行等の法律行為を自動的に行う処理である。ブロックチェーン処理の具体的内容は、図２（Ａ）に基づいて前述したとおりである。機械学習処理とは、多数のユーザの個人情報を学習用データとして機械学習して人工知能の学習済みモデルを生成する処理である。より具体的には、個人情報主を特定できない形の膨大な個人情報を学習用データとして機械学習して人工知能の一般的学習済みモデルを生成した後、個人情報主を特定できるデータ（例えばブロックチェーンにおけるアドレス）毎に分類された個人情報を用いて個人情報主毎（ブロックチェーンのアドレス毎）にパーソナライズされたパーソナライズド学習済みモデルを生成する処理である。 The personal information search process is a process of searching an encrypted index recorded in a blockchain by a trapdoor (encrypted search query). The smart contract process is a process that automatically performs legal acts such as contract conclusion and execution in accordance with predetermined rules. The specific contents of the blockchain processing are as described above based on FIG. 2 (A). The machine learning process is a process of generating a trained model of artificial intelligence by machine learning the personal information of a large number of users as learning data. More specifically, after generating a general learned model of artificial intelligence by machine learning a huge amount of personal information in a form that cannot identify the personal information owner as learning data, data that can identify the personal information owner (for example, block) This is a process of generating a personalized trained model that is personalized for each personal information owner (for each blockchain address) using personal information classified for each (address in the chain).

ＡＩスマートコントラクト生成処理とは、契約（法律行為）に関する個人情報を学習用データとして機械学習して人工知能によるスマートコントラクトの学習済みモデルを生成する処理である。より具体的には、個人情報主を特定できない形の膨大な契約（法律行為）に関する個人情報を学習用データとして機械学習して人工知能によるスマートコントラクトの一般的学習済みモデルを生成した後、個人情報主を特定できるデータ（例えばブロックチェーンにおけるアドレス）毎に分類された契約（法律行為）に関する個人情報を用いて個人情報主毎（例えばブロックチェーンにおけるアドレス毎）にパーソナライズされたパーソナライズドＡＩスマートコントラクト学習済みモデルを生成する処理である。 The AI smart contract generation process is a process of machine learning personal information about a contract (legal act) as learning data to generate a trained model of a smart contract by artificial intelligence. More specifically, after generating a general learned model of smart contract by artificial intelligence by machine learning personal information about a huge contract (legal act) in a form that can not identify the personal information owner as learning data, an individual Personalized AI smart contract personalized for each personal information owner (for example, for each address in the blockchain) using personal information about contracts (legal acts) classified for each data (for example, address in the blockchain) that can identify the information owner. This is a process to generate a trained model.

スマートコントラクト信託請負い処理とは、契約の締結及び執行等の法律行為を自動的に行う処理を本人から請負い本人に代わって実行するサービスを行う処理である。より具体的には、信託者用にパーソナライズされたパーソナライズドＡＩスマートコントラクト学習済みモデルを生成し、そのパーソナライズドＡＩスマートコントラクト学習済みモデルを用いて信託者に代わって法律行為を実行する。その実行の結果に基づいてＡＩ用の報酬を決め、その報酬によりパーソナライズドＡＩスマートコントラクト学習済みモデルをさらに強化学習させる。 The smart contract trust contract processing is a processing that provides a service that automatically performs legal acts such as contract conclusion and execution on behalf of the contractor. More specifically, it generates a personalized AI smart contract trained model for the trustee and uses the personalized AI smart contract trained model to perform legal acts on behalf of the trustee. The reward for AI is determined based on the result of the execution, and the personalized AI smart contract trained model is further strengthened and learned by the reward.

次に、図５（Ｂ）に基づいて、Ｓ２８に示された個人情報検索処理のサブルーチンプログラムのフローチャートを説明する。Ｓ３７により共通鍵Ｋ２の記憶があるか判定され、ない場合にはリターンする。後述のＳ４５によりＫ２が記憶されている場合には、Ｓ３７により共通鍵Ｋ２の記憶があると判定されて制御がＳ３８へ進む。Ｓ３８では、Ｋ２で暗号化した検索クエリ（トラップドア）でブロックチェーン上の暗号化インデックスを検索する処理が行われる。その検索を行なった結果、入手したい個人情報があるか否かがＳ３９により判定される。入手したい個人情報がないと判定された場合にはリターンするが、入手したい個人情報があると判定された場合にはＳ４０により、その入手希望個人情報の電子ＩＤが記憶される。 Next, a flowchart of the subroutine program for the personal information search process shown in S28 will be described with reference to FIG. 5 (B). It is determined by S37 whether or not the common key K2 is stored, and if not, it returns. When K2 is stored by S45 described later, it is determined by S37 that the common key K2 is stored, and the control proceeds to S38. In S38, a process of searching the encrypted index on the blockchain with a search query (trapdoor) encrypted with K2 is performed. As a result of the search, it is determined by S39 whether or not there is personal information to be obtained. If it is determined that there is no personal information that is desired to be obtained, a return is made, but if it is determined that there is personal information that is desired to be obtained, the electronic ID of the personal information that is desired to be obtained is stored in S40.

次に、図６及び図７（Ａ）に基づいて、Ｓ２９に示されたスマートコントラクト処理のサブルーチンプログラムのフローチャートを説明する。Ｓ４２により検索希望個人情報があるか否か判定される。この判定は、例えば、未だ検索していない個人情報主のスマートコントラクトと順次交渉して条件が合致した場合にその個人情報主の個人情報を検索希望個人情報と判定する。Ｓ４２により検索希望個人情報がないと判定された場合は、Ｓ４６により入手希望個人情報の記憶があるか否か判定され、ないと判定された場合には制御が図７（Ａ）のＳ５５へ進み、ＰＣ資源（計算資源）の貸与契約が成立しているか否か判定され、成立していないと判定された場合にはＳ５６により発注契約が成立しているか否か判定され、成立していないと判定された場合にはＳ５７により売買契約が成立しているか否か判定され、成立していないと判定された場合にはリターンする。 Next, a flowchart of the smart contract processing subroutine program shown in S29 will be described with reference to FIGS. 6 and 7 (A). It is determined by S42 whether or not there is personal information desired to be searched. In this determination, for example, when the conditions are met by sequentially negotiating with the smart contract of the personal information owner who has not been searched yet, the personal information of the personal information owner is determined as the personal information desired to be searched. If it is determined by S42 that there is no personal information desired to be searched, it is determined by S46 whether or not the personal information desired to be obtained is stored, and if it is determined that there is no personal information desired to be obtained, control proceeds to S55 in FIG. 7 (A). , It is determined whether or not the PC resource (computational resource) loan contract has been established, and if it is determined that it has not been established, it is determined by S56 whether or not the ordering contract has been established, and if it has not been established, it is determined. If it is determined, it is determined by S57 whether or not the sales contract has been established, and if it is determined that the sales contract has not been established, a return is made.

Ｓ４２により検索希望個人情報があると判定された場合には制御がＳ４３へ進み、共通鍵Ｋ２を個人情報主に要求する。具体的には、検索希望個人情報の個人情報主のブロックチェーン上でのアドレス宛てに自身のアドレス及び属性証明書（Attribute Certificate）を送信して共通鍵Ｋ２を要求する。Ｓ４４によりＫ２の返信があったか否か判定し、あるまで待機する。個人情報主または個人情報主のスマートコントラクトは、送信されてきた属性証明書を確認してＫ２を返信してよいか否か判定し、返信して良いと判定した場合にＫ２を返信する。その個人情報主からＫ２の返信があった段階で制御がＳ４５へ進み、返信されたＫ２を記憶した後、制御がＳ５４へ移行する。Ｓ５４では、その成立した契約をトランザクションとしてブロックチェーンに記憶する処理がなされる。この場合は、返信した個人情報主のアドレスから返信を受信したユーザのアドレスへインデックスの暗号化に用いている共通鍵Ｋ２が配布された旨の契約がブロックチェーンに記憶される。 When it is determined by S42 that there is personal information desired to be searched, the control proceeds to S43 and requests the common key K2 from the personal information main. Specifically, it sends its own address and an Attribute Certificate to the address on the blockchain of the personal information owner of the personal information desired to be searched, and requests the common key K2. It is determined by S44 whether or not there is a reply of K2, and the process waits until there is a reply. The personal information owner or the smart contract of the personal information owner confirms the sent attribute certificate, determines whether or not K2 can be returned, and returns K2 when it is determined that the reply is acceptable. When K2 is returned from the personal information owner, the control proceeds to S45, and after storing the returned K2, the control shifts to S54. In S54, a process of storing the established contract as a transaction in the blockchain is performed. In this case, the contract that the common key K2 used for encrypting the index is distributed from the address of the personal information owner who replied to the address of the user who received the reply is stored in the blockchain.

Ｓ４６により入手希望個人情報の記憶があると判定された場合は制御がＳ４７へ進み、入手希望個人情報を個人情報主に要求する処理が成される。具体的には、入手希望個人情報の個人情報主のブロックチェーン上でのアドレス宛てに自身のアドレス及び属性証明書を送信して入手希望個人情報を要求する。それを受けたパブリックチェーンのユーザ端末では、スマートコントラクトが属性証明書を確認すると共に個人情報の提供条件が合致するか否か判定し、個人情報を提供してよいと判定した場合に（Ｓ１６でＹＥＳ）、提供する個人情報の電子ＩＤ及び個人情報の提供に合意する署名を返信すると共に、提供する個人情報の暗号化に使用した共通鍵Ｋ１を提供相手の公開鍵で暗号化して返信する（Ｓ２０参照）。 When it is determined by S46 that the personal information desired to be obtained is stored, the control proceeds to S47, and a process of requesting the personal information desired to be obtained from the personal information is performed. Specifically, the personal information desired to be obtained is requested by sending its own address and attribute certificate to the address on the blockchain of the owner of the personal information desired to be obtained. In the public chain user terminal that receives this, when the smart contract confirms the attribute certificate, determines whether or not the conditions for providing personal information are met, and determines that the personal information may be provided (in S16). YES), the electronic ID of the personal information to be provided and the signature agreeing to provide the personal information are returned, and the common key K1 used for encrypting the personal information to be provided is encrypted with the public key of the provider and returned (YES). See S20).

その返信があった場合には、Ｓ４８により個人情報主からの返信があったと判定されて制御がＳ４９に進み、返信されてきた暗号化共通鍵であるＥ_ＰＫ(Ｋ１）を自身の秘密鍵ＳＫで復号化する演算、すなわち、Ｄ_ＳＫ（Ｅ_ＰＫ(Ｋ１））を行ってＫ１を算出する。次に、Ｓ５０により、個人情報主から返信された電子ＩＤと署名を認定事業者１７のサーバ１８へ送信する処理が行われる。それを受けた認定事業者１７のサーバ１８は、送信されてきた署名を確認した上で、受信した電子ＩＤに基づいて個人情報ＤＢ２９（図２（Ｂ）参照）を検索し、受信した電子ＩＤに対応付けて記憶している暗号化個人情報Ｅ_Ｋ１(個人情報）を読出して返信する。When the reply is received, it is determined by S48 that there is a reply from the personal information owner, the control proceeds to S49, and the returned encryption common key _EPC (K1) is used as the own private key SK. The operation of decoding with, that is, _DSK (E _PK (K1)) is performed to calculate K1. Next, S50 performs a process of transmitting the electronic ID and signature returned from the personal information owner to the server 18 of the authorized business operator 17. Upon receiving this, the server 18 of the authorized business operator 17 confirms the transmitted signature, searches the personal information DB 29 (see FIG. 2B) based on the received electronic ID, and receives the received electronic ID. The encrypted personal information _EK1 (personal information) stored in association with is read and returned.

その返信を受信すればＳ５１によりＹＥＳと判定されて制御がＳ５２へ進み、その返信された暗号化個人情報であるＥ_Ｋ１(個人情報）をＳ４９で算出されたＫ１により復号化する演算、すなわち、Ｄ_Ｋ１(Ｅ_Ｋ１(個人情報）)を行って平文の個人情報を得る。その個人情報をＳ５３により記憶する。その後Ｓ５４に進み、その個人情報の提供契約をトランザクションとしてブロックチェーンに記録するための処理がなされる。If the reply is received, it is determined as YES by S51, the control proceeds to S52, and the returned encrypted personal information _EK1 (personal information) is decrypted by K1 calculated in S49, that is, _{Perform DK1} ( _EK1 (personal information)) to obtain plaintext personal information. The personal information is stored in S53. After that, the process proceeds to S54, and a process for recording the personal information provision contract as a transaction in the blockchain is performed.

次に、パブリックチェーンのユーザ端末との間でＰＣ資源（計算資源）の貸与契約が成立すれば（Ｓ１５でＹＥＳ）、Ｓ５５によりＹＥＳと判定されて制御がＳ５８へ進み、ＰＣ資源（計算資源）の借用処理が行われる。その後制御がＳ５４へ移行し、その貸借契約をトランザクションとしてブロックチェーンに記録するための処理がなされる。パブリックチェーンのユーザ端末との間で発注契約が成立すれば（Ｓ１７でＹＥＳ）、Ｓ５６により発注契約が成立したと判定されて制御がＳ５９へ進み、その発注に対する受注を記憶する処理が行われる。その後制御がＳ５４へ移行し、その受注契約をトランザクションとしてブロックチェーンに記録するための処理がなされる。 Next, if a PC resource (computational resource) loan contract is concluded with the user terminal of the public chain (YES in S15), it is determined as YES by S55 and the control proceeds to S58, and the PC resource (computational resource) Borrowing process is performed. After that, the control shifts to S54, and a process for recording the lease contract as a transaction in the blockchain is performed. If an ordering contract is established with the user terminal of the public chain (YES in S17), it is determined by S56 that the ordering contract has been established, control proceeds to S59, and a process of storing the order for the order is performed. After that, the control shifts to S54, and a process for recording the order contract as a transaction in the blockchain is performed.

パブリックチェーンのユーザ端末との間で売買契約が成立すれば（Ｓ１５でＹＥＳ）、Ｓ５７によりＰＣ資源（計算資源）の貸与契約が成立したと判定されて制御がＳ６０へ進み、販売対象を提供するための処理が行われる。その後制御がＳ５４へ移行し、その売買契約をトランザクションとしてブロックチェーンに記録するための処理がなされる。なお、以上説明したスマートコントラクト処理において、スマートコントラクトに基づいたプライベートチェーンユーザ端末１６での判断（例えば、Ｓ４２、Ｓ４６、Ｓ５５、Ｓ５６、Ｓ５７）のみに基づいて行為（例えば、Ｓ４２、Ｓ４７、Ｓ５８、Ｓ５６、Ｓ６０）を執行するのではなく、当該スマートコントラクトの所有者の同意を得た上で行為を執行するようにしてもよい。この所有者の同意は、以降記載しているスマートコントラクトによる契約の実行に際しても行なうようにしてもよい。 If a sales contract is concluded with a user terminal of a public chain (YES in S15), it is determined by S57 that a PC resource (computational resource) loan contract has been concluded, control proceeds to S60, and a sales target is provided. Processing is performed. After that, the control shifts to S54, and a process for recording the sales contract as a transaction in the blockchain is performed. In the smart contract processing described above, actions (for example, S42, S47, S58, S42, S47, S58, S42, S47, S58, S57) are based only on the judgment (for example, S42, S46, S55, S56, S57) of the private chain user terminal 16 based on the smart contract. Instead of executing S56, S60), the act may be executed with the consent of the owner of the smart contract. This owner's consent may also be given when the contract is executed by the smart contract described below.

次に、図７（Ｂ）に基づいて、Ｓ３１に示された機械学習処理のサブルーチンプログラムのフローチャートを説明する。Ｓ６３により、個人情報主を特定できない形で記憶している膨大な個人情報を学習用データにする処理が行われる。この機械学習に採用されている学習アルゴリズムとしては、例えば、教師あり学習としての回帰や識別、教師なし学習としてのモデル推定やデータマイニング、中間的手法としての強化学習や深層学習等、種々のものが用意されている。 Next, a flowchart of the machine learning processing subroutine program shown in S31 will be described with reference to FIG. 7B. According to S63, a process of converting a huge amount of personal information stored in a form in which the personal information owner cannot be identified into learning data is performed. Various learning algorithms adopted in this machine learning include, for example, regression and identification as supervised learning, model estimation and data mining as unsupervised learning, reinforcement learning and deep learning as intermediate methods, and the like. Is prepared.

次に、Ｓ６４により、借用中のＰＣ資源（計算資源）を利用して学習用データによる機械学習を行う。例えば、教師あり学習としての回帰の場合は、入力情報（ベクトルｘ）と正解情報ｙとからなる大量のデータセットを訓練データ（学習用データ）にする。教師あり学習の場合、入力ｘを正解ｙに写像する関数ｃi（ｘ）（ci：ｘ→ｙ）を学習するものであるため、学習済みモデルは関数ｃiを含むものとなる。なお、機械学習手段３４が行う機械学習は、教師あり学習に限定されるものではなく、モデル推定やパターンマイニング（データマイニング）等の教師なし学習、教師あり学習と教師なし学習との中間的手法である半教師あり学習や強化学習や深層学習等、どのようなものであってもよい。 Next, according to S64, machine learning using learning data is performed using the borrowed PC resources (computational resources). For example, in the case of regression as supervised learning, a large amount of data set consisting of input information (vector x) and correct answer information y is used as training data (learning data). In the case of supervised learning, since the function ci (x) (ci: x → y) that maps the input x to the correct answer y is learned, the trained model includes the function ci. The machine learning performed by the machine learning means 34 is not limited to supervised learning, but is an intermediate method between unsupervised learning such as model estimation and pattern mining (data mining), supervised learning and unsupervised learning. It may be any kind of learning with semi-teacher, intensive learning, deep learning, etc.

Ｓ６４による機械学習によって一般的学習済みモデルが生成され、それを記憶する（Ｓ６５）。この一般的学習済みモデルは、多数のユーザの個人情報であって個人情報主を特定できない形で記憶している膨大な個人情報を学習用データとしたモデルであり、多数のユーザに広く適用できる平均的な学習済みモデルである。 Machine learning by S64 generates a general trained model and stores it (S65). This general learned model is a model in which a huge amount of personal information, which is personal information of a large number of users and is stored in a form in which the personal information owner cannot be identified, is used as learning data, and can be widely applied to a large number of users. It is an average trained model.

次に、Ｓ６６により受注記憶があるか否か判定され、ない場合にはリターンするが、ある場合にはＳ６７により、人工知能の受注か否か判定される。人工知能の受注でない場合にはリターンするが、人工知能の受注である場合には、Ｓ６８により、発注者のアドレス宛てに個人情報を要求する処理がなされる。発注者から個人情報の返信があればＳ６９によりＹＥＳと判定されてＳ７０に進む。Ｓ７０では、返信された個人情報に基づいて一般的学習済みモデルをパーソナライズしてパーソナライズド学習済みモデルを生成する処理がなされる。この一般的学習済みモデルをパーソナライズしてパーソナライズド学習済みモデルを生成する処理については、特許第６４３２８５９号公報に記載されている。パーソナライズに必要な個人情報は、ブロックチェーンを利用して収集されるため、ブロックチェーンの匿名性を担保して収集されることにより、プライバシーの問題を極力回避できる利点がある。次に、Ｓ７１により、発注者のアドレス宛てにパーソナライズド学習済みモデルが送信される。 Next, S66 determines whether or not there is an order storage, and if there is no order memory, it returns, but if there is, S67 determines whether or not the order is for artificial intelligence. If it is not an order for artificial intelligence, it will be returned, but if it is an order for artificial intelligence, S68 will perform a process of requesting personal information to the address of the ordering party. If there is a reply of personal information from the orderer, it is determined as YES by S69 and the process proceeds to S70. In S70, a process of personalizing a general trained model based on the returned personal information to generate a personalized trained model is performed. A process of personalizing this general trained model to generate a personalized trained model is described in Japanese Patent No. 6432859. Since personal information required for personalization is collected using the blockchain, there is an advantage that privacy problems can be avoided as much as possible by collecting the personal information while ensuring the anonymity of the blockchain. Next, S71 transmits the personalized trained model to the address of the ordering party.

次に、図８に基づいて、Ｓ３２に示されたＡＩスマートコントラクト生成処理のサブルーチンプログラムのフローチャートを説明する。Ｓ８０により、個人情報主を特定できない形で記憶している膨大な個人情報の中から契約（法律行為）に関する個人情報を抽出して学習用データにする処理がなされる。次に、Ｓ８１により、借用中のＰＣ資源（計算資源）を利用して学習用データによる機械学習が行われる。Ｓ８０及びＳ８１の処理は、前述のＳ６３及びＳ６４で説明したものと同様であり、ここでは説明の繰り返しを省略する。 Next, a flowchart of the subroutine program for the AI smart contract generation process shown in S32 will be described with reference to FIG. According to S80, a process is performed in which personal information related to a contract (legal act) is extracted from a huge amount of personal information stored in a form in which the personal information owner cannot be identified and used as learning data. Next, according to S81, machine learning using learning data is performed using the borrowed PC resources (computational resources). The processing of S80 and S81 is the same as that described in S63 and S64 described above, and the repetition of the description is omitted here.

次に、Ｓ８２により、一般的ＡＩスマートコントラクトの学習済みモデルを生成して記憶する処理が行われる。次に、Ｓ８３により、シミュレーション学習処理が実行される。このシミュレーション学習処理は、本来多数の人間同士が行っている契約の検証、条件確認、執行、実行、交渉等の法律行為を、多数のＡＩスマートコントラクト群に肩代わりさせてコンピュータ内で仮想的に実行（シミュレーション）させ、各ＡＩスマートコントラクトに対し成果に応じた報酬を与えて強化学習を行うものである。現実世界での強化学習ではなくコンピュータ内でのシミュレーションによる強化学習のため、短時間に膨大な強化学習が可能となる利点がある。強化学習とは、ある環境の状態に置かれたエージェントが、行動を選択したときに与えられる報酬をもとに、初期状態からゴールまでの累積報酬を最大化するような方策を獲得する仕組みのことである。強化学習ではＡＩの一種であるソフトウェアエージェント（以下「エージェント」という）と環境が相互作用することで学習を進めていく。ここにエージェントとはＡＩの一種であり、ユーザやソフトウェアなどと通信しながら自らがある程度の判断能力を持って自律的にふるまい永続的に活動するソフトウェアのことである。エージェントが環境に対して或る行為ａを行うことによりその環境の状態ｓが変化し或る目的状態に達することにより報酬ｒがエージェントに与えられる。エージェントは、この報酬ｒを最大化することを目的として状態ｓを入力として行為ａを出力する関数を学習する。 Next, S82 performs a process of generating and storing a trained model of a general AI smart contract. Next, the simulation learning process is executed by S83. This simulation learning process virtually executes legal acts such as contract verification, condition confirmation, execution, execution, and negotiation, which are originally performed by a large number of human beings, on behalf of a large number of AI smart contract groups in a computer. (Simulation) is performed, and each AI smart contract is rewarded according to the result to perform reinforcement learning. Since reinforcement learning is performed by simulation in a computer instead of reinforcement learning in the real world, there is an advantage that a huge amount of reinforcement learning can be performed in a short time. Reinforcement learning is a mechanism in which an agent placed in a certain environment obtains a measure that maximizes the cumulative reward from the initial state to the goal based on the reward given when an action is selected. That is. In reinforcement learning, learning proceeds by interacting with a software agent (hereinafter referred to as "agent"), which is a type of AI, and the environment. Here, an agent is a kind of AI, and is software that behaves autonomously and acts permanently with a certain degree of judgment ability while communicating with users and software. When the agent performs a certain action a on the environment, the state s of the environment changes, and when a certain target state is reached, the reward r is given to the agent. The agent learns a function that outputs an action a with a state s as an input for the purpose of maximizing the reward r.

強化学習は次の単純なステップを繰り返すことで時間が進行していく。
１エージェントは環境から受け取った観測o（あるいは直接、環境の状態s）を受け取り、方策πに基いて環境に行為aを返す。
２環境はエージェントから受け取った行為aと現在の状態sに基いて、次の状態s′に変化し、その遷移に基いて次の観測o′と、報酬rと呼ばれる直前の行動の良し悪しを示す1つの数（スカラー量）をエージェントに返す。
３時間の進行：t←t+1
ここで←は代入操作を表す。
強化学習として、例えば、アルファゼロ型強化学習アルゴリズムを用いてもよい。このアルファゼロ型強化学習アルゴリズムは、ＤＱＮ(Deep Q-Network)などのアルゴリズムとことなり、探索にモンテカルロ木探索（Monte Carlo tree search,MCTS）を用い、価値（Value）と方策（Policy）をすべてニューラルネットワークに予測させ、木探索によるセルフプレイで得られた経験のみで予測を修正する構成になっている。従来のアルファ碁に比べ、価値を予測するバリューネットワークと方策を予測するポリシーネットワークが１つのニューラルネットワークに統合され、マルチタスク学習により予測精度の向上が図られている。また、ニューラルネットワークの性能向上により、木探索でのプロセッサレイアウト（報酬を貰うまで探索木を伸ばす）処理が不要となり、より高速に探索を行える。さらに、進化計算(evolutionary computation)、遺伝的アルゴリズム（genetic algorithm）、敵対的生成ネットワーク（Generative adversarial networks）を用いるようにしてもよい。Reinforcement learning progresses in time by repeating the following simple steps.
1 The agent receives the observation o (or directly, the state s of the environment) received from the environment, and returns the action a to the environment based on the policy π.
2 The environment changes to the next state s'based on the action a received from the agent and the current state s, and based on the transition, the next observation o'and the good or bad of the immediately preceding action called reward r. Returns the indicated number (scalar amount) to the agent.
3 hours progress: t ← t + 1
Here, ← represents an assignment operation.
As the reinforcement learning, for example, an alpha zero type reinforcement learning algorithm may be used. This alpha zero type reinforcement learning algorithm is different from algorithms such as DQN (Deep Q-Network), and Monte Carlo tree search (MCTS) is used for the search, and all values and policies are used. The configuration is such that a neural network is made to make a prediction, and the prediction is corrected only by the experience obtained by self-play by tree search. Compared to the conventional AlphaGo, the value network that predicts the value and the policy network that predicts the policy are integrated into one neural network, and the prediction accuracy is improved by multi-task learning. In addition, due to the improved performance of the neural network, the processor layout (extending the search tree until a reward is received) processing in the tree search becomes unnecessary, and the search can be performed at a higher speed. In addition, evolutionary computation, genetic algorithms, and Generative adversarial networks may be used.

次にＳ８４により、ＡＩスマートコントラクトの受注記憶があるか否か判定され、ない場合にはリターンする。前述のＳ５９により受注が記憶されている場合にはＳ８４によりＹＥＳと判定されて制御がＳ８５へ進み、記憶されている受注がシミュレーション学習済みＡＩスマートコントラクトの受注であるか否か判定される。シミュレーション学習済みＡＩスマートコントラクトの受注でない場合には、パーソナライズドＡＩスマートコントラクト学習済みモデルの受注であり、その場合には制御がＳ８６へ進み、発注者のアドレス宛てに契約（法律行為）に関する個人情報を要求する処理が行われる。発注者から個人情報の返信があった段階でＳ８７によりＹＥＳと判定されて制御がＳ８８へ進む。 Next, S84 determines whether or not there is an order storage for the AI smart contract, and if not, returns. When the order is stored in S59 described above, YES is determined in S84, the control proceeds to S85, and it is determined whether or not the stored order is an order for the simulation-learned AI smart contract. If it is not an order for a simulation-learned AI smart contract, it is an order for a personalized AI smart contract-learned model. In that case, control proceeds to S86 and personal information about the contract (legal act) is addressed to the orderer's address. Is performed. When the orderer returns the personal information, S87 determines YES and the control proceeds to S88.

Ｓ８８では、返信された契約（法律行為）に関する個人情報に基づいて一般的ＡＩスマートコントラクトの学習済みモデルをパーソナライズしてパーソナライズドＡＩスマートコントラクト学習済みモデルを生成する処理が行われる。この一般的ＡＩスマートコントラクトの学習済みモデルをパーソナライズしてパーソナライズドＡＩスマートコントラクト学習済みモデルを生成する処理については、特許第６４３２８５９号公報に記載されている。そのパーソナライズド学習済みモデルがＳ８９により発注者のアドレスへ送信される。 In S88, a process of personalizing a trained model of a general AI smart contract based on personal information regarding the returned contract (legal act) to generate a personalized AI smart contract trained model is performed. A process of personalizing a trained model of this general AI smart contract to generate a personalized AI smart contract trained model is described in Japanese Patent No. 6432859. The personalized trained model is transmitted by S89 to the orderer's address.

一方、シミュレーション学習済みＡＩスマートコントラクトの受注の場合にはＳ８５によりＹＥＳと判定されて制御がＳ９０へ進み、シミュレーション学習済みＡＩスマートコントラクトが発注者のアドレスへ送信される。 On the other hand, in the case of receiving an order for the simulation-learned AI smart contract, S85 determines YES, the control proceeds to S90, and the simulation-learned AI smart contract is transmitted to the orderer's address.

次に、図９（Ａ）に基づいて、Ｓ８３に示されたシミュレーション学習処理のサブルーチンプログラムのフローチャートを説明する。Ｓ３３４により、シミュレーションの入力があったか否か判定され、ない場合にはリターンする。このシミュレーションは、プライベートチェーン２のユーザ端末によって入力されるものであり、例えば、政府が採用しようとしている政策や法律（例えば、消費増税に伴う軽減税率、改正出入国管理法、イギリスのＥＵ（European Union）からの離脱、ベーシックインカムの部分的または全面的採用、日本国憲法９条の改正等）が採用されたと仮定した場合における、株取引や先物取引等の投資市場での取引シミュレーション、会社経営シミュレーション、または消費行動シミュレーション等である。さらには、新商品（金融商品や生命保険を含む）や新サービスの各種メディアによるプロモーションのシミュレーション等でもよい。Ｓ３３４によりシミュレーションの入力があったと判定されれば制御がＳ３３５に進み、ＡＩスマートコントラクト群生成処理が実行される。 Next, a flowchart of the subroutine program for the simulation learning process shown in S83 will be described with reference to FIG. 9A. According to S334, it is determined whether or not there is a simulation input, and if not, a return is made. This simulation is input by a user terminal of private chain 2, for example, policies and laws that the government is trying to adopt (for example, reduced tax rate due to consumption tax increase, revised immigration law, EU (European Union) in the United Kingdom). ), Partial or full adoption of basic income, amendment of Article 9 of the Japanese Constitution, etc.), transaction simulation in the investment market such as stock trading and futures trading, company management simulation , Or consumption behavior simulation, etc. Furthermore, promotion simulations of new products (including financial products and life insurance) and new services by various media may be used. If it is determined by S334 that the simulation has been input, the control proceeds to S335, and the AI smart contract group generation process is executed.

このＡＩスマートコントラクト群生成処理のサブルーチンプログラムのフローチャートを図９（Ｂ）に基づいて説明する。Ｓ３４４により、借用中のＰＣ資源（計算資源）を利用して、入力されたシミュレーションにマッチするペルソナ群を設定する処理が行われる。ペルソナとは、一般的には、企業や商品やサービスの典型的なターゲットとなる人物像を仮想の人物として定義したものである。本実施形態では、ペルソナとは、シミュレーション内容がターゲットとする典型的な人物像を仮想の人物として定義したものである。例えば、前述の消費増税に伴う軽減税率下での消費行動シミュレーションの場合には、一般消費者に相当するペルソナであって、性別、年代別、地域別、年収別等にグルーピングした各グループ毎にペルソナとして設定する。 The flowchart of the subroutine program of the AI smart contract group generation processing will be described with reference to FIG. 9B. According to S344, a process of setting a persona group matching the input simulation is performed by using the borrowed PC resource (computational resource). A persona is generally defined as a virtual person who is a typical target person for a company, product, or service. In the present embodiment, the persona defines a typical person image targeted by the simulation content as a virtual person. For example, in the case of the consumption behavior simulation under the reduced tax rate accompanying the consumption tax increase mentioned above, the persona is equivalent to a general consumer, and each group is grouped by gender, age group, region, annual income, etc. Set as a persona.

設定されるペルソナの数は、グループに属するユーザ数に比例した数にする。例えば、一般消費者の年代別人口分布が１０代５％、２０代５％、３０代１０％、４０代１０％、５０代２０％、６０代２０％、７０代２０％、８０代５％、９０代５％、の場合に、１０代を代表するペルソナ数が１、２０代を代表するペルソナ数が１、３０代を代表するペルソナ数が２、４０代を代表するペルソナ数が２、５０代を代表するペルソナ数が４、６０代を代表するペルソナ数が４、７０代を代表するペルソナ数が４、８０代を代表するペルソナ数が１、９０代を代表するペルソナ数が１と、設定する。 The number of personas to be set is proportional to the number of users belonging to the group. For example, the population distribution of general consumers by age group is 5% in their teens, 5% in their 20s, 10% in their 30s, 10% in their 40s, 20% in their 50s, 20% in their 60s, 20% in their 70s, and 5% in their 80s. In the case of 5% in their 90s, the number of personas representing teens is 1, the number of personas representing 20s is 1, the number of personas representing 30s is 2, and the number of personas representing 40s is 2. The number of personas representing the 50s is 4, the number of personas representing the 60s is 4, the number of personas representing the 70s is 4, the number of personas representing the 80s is 1, and the number of personas representing the 90s is 1. , Set.

次に、Ｓ３４５により、借用中のＰＣ資源（計算資源）を利用して、各ペルソナに属するユーザ群を選定する処理が行われる。次に、Ｓ３４６により、ペルソナ毎に属するユーザ群をグループピングしてグループ毎にユーザ群の取引データをブロックチェーンから収集する処理が行われる。例えば、前述の消費増税に伴う軽減税率下での消費行動シミュレーションの場合には、ユーザ群を、性別、年代別、地域別、年収別等にグルーピングし、そのグループ毎にユーザ群の取引データをブロックチェーンから収集する。このＳ３４５、Ｓ３４６におけるユーザ群の選定及びユーザ群の取引データの収集は、例えば、インターネットアンケート調査会社が保有するアンケート回答用モニター会員のデータベースを利用するのが有用である。インターネットアンケート調査会社では、性別，年代別，居住地，未婚，既婚，職業，世帯年収等の属性に対応付けてアンケート回答用モニター会員の連絡先（メールアドレス等）をデータベースに記憶しており、その属性別のモニター会員データを利用する。後述するＳ１４５及びＳ１４６、Ｓ５８６及びＳ５８７、Ｓ６２２及びＳ６２３等においても同様に、インターネットアンケート調査会社が保有するアンケート回答用モニター会員のデータベースを利用するのが有用である。次に、Ｓ３４７により、借用中のＰＣ資源（計算資源）を利用して、取引データを学習データとして機械学習を行ってペルソナ毎に学習済みのＡＩスマートコントラクトを生成する処理が行われる。このＡＩスマートコントラクトは、対応するペルソナの設定数と同じ数だけ生成される。これにより、シミュレーションを実行するための環境が整い、その環境内でシミュレーションを行う。 Next, according to S345, a process of selecting a user group belonging to each persona is performed using the borrowed PC resource (computational resource). Next, according to S346, a process is performed in which a group of users belonging to each persona is grouped and transaction data of the user group is collected from the blockchain for each group. For example, in the case of the consumption behavior simulation under the reduced tax rate accompanying the consumption tax increase described above, the user groups are grouped by gender, age group, region, annual income, etc., and the transaction data of the user group is collected for each group. Collect from the blockchain. For the selection of the user group and the collection of transaction data of the user group in S345 and S346, for example, it is useful to use the database of the questionnaire response monitor members owned by the Internet questionnaire survey company. Internet questionnaire survey companies store the contact information (email address, etc.) of monitor members for questionnaire response in a database in association with attributes such as gender, age group, place of residence, unmarried, married, occupation, and annual household income. Use the monitor member data for each attribute. Similarly, in S145 and S146, S586 and S587, S622 and S623, which will be described later, it is useful to use the database of questionnaire response monitor members owned by the Internet questionnaire survey company. Next, according to S347, a process is performed in which the borrowed PC resource (computational resource) is used to perform machine learning using the transaction data as learning data to generate a trained AI smart contract for each persona. The number of AI smart contracts generated is the same as the number of corresponding persona settings. As a result, the environment for executing the simulation is prepared, and the simulation is performed in that environment.

図９（Ａ）に戻り、上記のようにして生成された各ＡＩスマートコントラクトが行為ａに従った契約（法律行為）を執行する（Ｓ３３６）。この「行為ａ」とは、Ｓ３３８による強化学習の結果としての行為ａである。次に、Ｓ３３７により、各ＡＩスマートコントラクト同士で成立した契約をブロックチェーンに記録する処理が行われる。 Returning to FIG. 9A, each AI smart contract generated as described above executes a contract (legal act) in accordance with act a (S336). This "action a" is an action a as a result of reinforcement learning by S338. Next, according to S337, a process of recording the contract concluded between the AI smart contracts on the blockchain is performed.

次に、Ｓ３３８により、借用中のＰＣ資源（計算資源）を利用して、成立した契約内容に基づいて報酬ｒを算出し、ＴＤ学習により最適政策π^＊に従った行為ａを求める処理が行われる。例えば、前述の消費増税に伴う軽減税率下での消費行動シミュレーションの場合には、（増税前の出費額−増税後の出費額）が小さい値ほど高い報酬ｒを与える。そして、Ｓ３３９によりシミュレーションが終了したか否か判定され、未だ終了していない場合には制御がＳ３３６へ戻り、Ｓ３３７→Ｓ３３８→Ｓ３３９→Ｓ３３６を繰り返し巡回して強化学習を進行させる。シミュレーションが終了した段階でＳ３３９によりＹＥＳと判定されて制御がＳ３４０へ進み、１番高い報酬ｒを得たＡＩスマートコントラクトを記憶した後リターンする。なお、１番高い報酬ｒを得たＡＩスマートコントラクトに限定されるものだはなく、例えば上位５％のＡＩスマートコントラクトを記憶してもよい。また、Ｓ３３７によるブロックチェーンへの記録を必ずしも行わなくてもよく、その場合には、前述した連携タイプにおいては、Ｓ３３５、Ｓ３３６、Ｓ３４０及びＳ３４７における「ＡＩスマートコントラクト」を「連携用ＡＩ」と変更する。つまり、コンピュータ内でのシミュレーションの場合、現実世界での契約（法律行為）の執行を伴わないため、ブロックチェーンへの記録を行わない場合には、わざわざスマートコントラクトを用いる必要がなく、各連携用ＡＩ同士が行為ａを実行して強化学習を行えば事足りるためである。強化学習を終了し実際の引用段階では、学習済み連携用ＡＩがスマートコントラクトと連携して契約の執行を行ってブロックチェーンに記録するようにすればよい。Next, in S338, using the borrowed PC resources (computational resources), the reward r is calculated based on the contract details that have been concluded, and the process of finding the action a according ^{to the optimal policy π * by TD learning is performed.} Will be. For example, in the case of the consumption behavior simulation under the reduced tax rate accompanying the consumption tax increase described above, the smaller the value (expense amount before tax increase-expenditure amount after tax increase), the higher the reward r is given. Then, it is determined by S339 whether or not the simulation is completed, and if it is not completed yet, the control returns to S336, and the reinforcement learning is advanced by repeatedly patrolling S337 → S338 → S339 → S336. When the simulation is completed, S339 determines YES, the control proceeds to S340, and the AI smart contract with the highest reward r is stored and then returned. It should be noted that the AI smart contract that has received the highest reward r is not limited to that, and for example, the top 5% of AI smart contracts may be stored. Further, it is not always necessary to record to the blockchain by S337. In that case, in the above-mentioned cooperation type, "AI smart contract" in S335, S336, S340 and S347 is changed to "AI for cooperation". do. In other words, in the case of simulation in a computer, it does not involve the execution of contracts (legal acts) in the real world, so if you do not record on the blockchain, you do not have to bother to use smart contracts, and for each cooperation This is because it is sufficient for AIs to perform action a and perform reinforcement learning. At the stage of actual citation after the reinforcement learning is completed, the learned cooperation AI may execute the contract in cooperation with the smart contract and record it on the blockchain.

次に、図１０に基づいて、Ｓ３３に示されたスマートコントラクト信託請負い処理のサブルーチンプログラムのフローチャートを説明する。Ｓ９４により、スマートコントラクト信託の受注記憶があるか否か判定され、受注記憶がないと判定された場合にはリターンする。前述のＳ５９により記憶されている受注記憶の内容を確認してスマートコントラクト信託の受注である場合にはＳ９４によりＹＥＳと判定されて制御がＳ９５へ進む。Ｓ９５では、発注者のアドレス宛てに契約（法律行為）に関する個人情報を要求し、発注者から個人情報が返信された段階でＳ９６によりＹＥＳと判定されて制御がＳ９７へ進む。 Next, a flowchart of the subroutine program for the smart contract trust contract processing shown in S33 will be described with reference to FIG. According to S94, it is determined whether or not there is an order storage for the smart contract trust, and if it is determined that there is no order storage, a return is made. The contents of the order storage stored in S59 described above are confirmed, and if the order is for the smart contract trust, it is determined as YES by S94 and the control proceeds to S95. In S95, personal information regarding the contract (legal act) is requested to the address of the ordering party, and when the personal information is returned from the ordering party, S96 determines YES and the control proceeds to S97.

Ｓ９７では、返信された契約（法律行為）に関する個人情報に基づいて一般的ＡＩスマートコントラクトの学習済みモデルをパーソナライズしてパーソナライズドＡＩスマートコントラクト学習済みモデルを生成する処理が行われる。この一般的ＡＩスマートコントラクトの学習済みモデルをパーソナライズしてパーソナライズドＡＩスマートコントラクト学習済みモデルを生成する処理については、特許第６４３２８５９号公報に記載されている。 In S97, a process of personalizing a trained model of a general AI smart contract based on personal information regarding the returned contract (legal act) to generate a personalized AI smart contract trained model is performed. A process of personalizing a trained model of this general AI smart contract to generate a personalized AI smart contract trained model is described in Japanese Patent No. 6432859.

次に、Ｓ９８により、発注者のアドレスとパーソナライズドＡＩスマートコントラクト学習済みモデルとを対応付けて記憶する処理が行われる。この記憶したパーソナライズドＡＩスマートコントラクト学習済みモデルを用いて発注者に対し信託請負い処理を行う（Ｓ９９）。次に、Ｓ１００により、パーソナライズドＡＩスマートコントラクト学習済みモデルの強化学習処理が実行される。 Next, according to S98, a process of associating and storing the address of the orderer and the personalized AI smart contract trained model is performed. Using this memorized personalized AI smart contract trained model, the orderer is subjected to trust contract processing (S99). Next, S100 executes reinforcement learning processing of the personalized AI smart contract trained model.

図１１（Ａ）に基づいて、Ｓ１００に示されたパーソナライズドＡＩスマートコントラクト学習済みモデルの強化学習処理のサブルーチンプログラムのフローチャートを説明する。強化学習において、状態stで行為atを行なうときの価値をＱ(st,at)とした場合のＱ値を推定する方法として、環境をモデル化する知識、すなわち、状態遷移確率と報酬の確率分布が与えられている場合はモデルベースの手法を用いればよいが、環境モデルが未知の場合、ＴＤ（Temporal Difference）学習を用いる。先ず、環境の探索が必要なため、ε-greedy法を用いる。探索の初期はいろいろな行為を試し、落ち着いてくると最適な行為を多く選ぶように、温度の概念を導入する。温度をTとして、次の式で表される確率に従って行為を選ぶ。 Based on FIG. 11A, a flowchart of a subroutine program for reinforcement learning processing of the personalized AI smart contract trained model shown in S100 will be described. In reinforcement learning, as a method of estimating the Q value when the value when performing the action at in the state st is Q (st, at), the knowledge that models the environment, that is, the state transition probability and the probability distribution of the reward If is given, a model-based method may be used, but if the environmental model is unknown, TD (Temporal Difference) learning is used. First, since it is necessary to search the environment, the ε-greedy method is used. In the early stages of exploration, we try various actions and introduce the concept of temperature so that when we calm down, we choose many of the most suitable actions. Let T be the temperature, and choose the action according to the probability expressed by the following formula.

P(a｜s）＝{exp（Q(s,a)/T）}／{Σexp(Q(s,b)/T} （なおΣの下にb∈Aが記載されており、上記式ではその記載を省略している）
ここに、aは行為、Q(s,a)は状態sで行為aを行なうときの価値、P (a ｜ s) ＝ {exp (Q (s, a) / T)} / {Σexp (Q (s, b) / T} (Note that b ∈ A is described under Σ, and the above equation The description is omitted)
Here, a is an act, Q (s, a) is the value when performing an act a in the state s,

Tをアニーリング（焼き鈍し）における温度と呼び、高ければ行為を等確率に近い確率で選択し、低ければ最適なものに偏らせる。学習が進むにつれて、Tの値を小さくすることで、学習結果が安定する。このようなＱ値の推定方法は、前述した強化学習及び後述する強化学習全てに適用してもよい。また、強化学習等の機械学習を行うコンピュータとしては、ノイマン型の一般的なコンピュータを用いているが、ニューラル・ネット・プロセッサー（ＮＮＰ）を用いてもよい。ＮＮＰのチップ上には本物のニューロンをモデルにした「人工ニューロン」が多数搭載されており、各ニューロンはネットワークでそれぞれ連携し合う。また、「量子アニーリング方式」を採用した量子コンピュータを用いてもよい。特に、「量子アニーリング方式」を採用した量子コンピュータを用いることにより、機械学習における最適化計算の所要時間を大幅に短縮できる。 T is called the temperature in annealing, and if it is high, the action is selected with a probability close to equal probability, and if it is low, it is biased to the optimum one. As the learning progresses, the learning result becomes stable by reducing the value of T. Such a method of estimating the Q value may be applied to all of the reinforcement learning described above and the reinforcement learning described later. Further, as a computer for performing machine learning such as reinforcement learning, a general von Neumann computer is used, but a neural network processor (NNP) may be used. Many "artificial neurons" modeled on real neurons are mounted on the NNP chip, and each neuron cooperates with each other in a network. Further, a quantum computer adopting the "quantum annealing method" may be used. In particular, by using a quantum computer that employs the "quantum annealing method", the time required for optimization calculation in machine learning can be significantly reduced.

Ｓ１０５により、パーソナライズドＡＩスマートコントラクト学習済みモデルによる信託請負い処理を行った結果に対する評価を信託者から受信する。次に、Ｓ１０６により、受信した評価に基づいて報酬ｒを算出する処理が行われる。次に、Ｓ１０７により、借用中のＰＣ資源（計算資源）を利用して、ＴＤ学習により最適政策π^＊に従った行為ａを求める処理が行われる。次に、Ｓ１０８により、行為ａに従ったコントラクトを信託者の代理として実行する。その結果に対する評価を信託者から受信することにより（Ｓ１０５）、Ｓ１０６〜Ｓ１０８の処理が実行される。In S105, the trustee receives an evaluation of the result of the trust contract processing by the personalized AI smart contract trained model. Next, in S106, a process of calculating the reward r based on the received evaluation is performed. Next, according to S107, a process of obtaining an action a according ^{to the optimum policy π *} is performed by TD learning using the borrowed PC resource (computational resource). Next, according to S108, the contract according to the act a is executed on behalf of the trustee. By receiving the evaluation of the result from the trustee (S105), the processes of S106 to S108 are executed.

次に、図１１（Ｂ）に基づいて、コンソーシアムチェーン３のノードを構成するユーザ端末のメインルーチンプログラムのフローチャートを説明する。Ｓ１１４によりIoTセンサデータ集計処理が実行され、Ｓ１１５によりスマートコントラクト処理が実行され、Ｓ１１６によりシミュレーション処理が実行され、Ｓ１１７によりブロックチェーン処理が実行される。ブロックチェーン処理の具体的内容は、図２（Ａ）に基づいて前述したとおりである。 Next, a flowchart of the main routine program of the user terminal constituting the nodes of the consortium chain 3 will be described with reference to FIG. 11 (B). IoT sensor data aggregation processing is executed by S114, smart contract processing is executed by S115, simulation processing is executed by S116, and blockchain processing is executed by S117. The specific contents of the blockchain processing are as described above based on FIG. 2 (A).

次に、図１２（Ａ）に基づいて、Ｓ１１４に示されたIoTセンサデータ集計処理のサブルーチンプログラムのフローチャートを説明する。Ｓ１２０により、IoTセンサデータを種類毎、期間毎、地域毎等に分類してグルーピングする処理が行われる。この処理は、IoTセンサデータばかりでなくワイヤレスセンサネットワークからのデータも含めて処理される。次に、Ｓ１２１により、グルーピングされた各データの価値を決定する処理が行われる。この決定された価値に応じて、データの提供に対する対価（トークンの量）が対応するデータ毎にスマートコントラクトとしてコード化される。次に、Ｓ１２２により、グルーピングされた各データをブロックチェーンに記録する処理が行われる。 Next, a flowchart of the subroutine program for the IoT sensor data aggregation process shown in S114 will be described with reference to FIG. 12 (A). According to S120, IoT sensor data is classified and grouped by type, period, region, and the like. This processing includes not only IoT sensor data but also data from the wireless sensor network. Next, S121 performs a process of determining the value of each grouped data. According to this determined value, the consideration (amount of tokens) for providing the data is encoded as a smart contract for each corresponding data. Next, S122 performs a process of recording each grouped data on the blockchain.

次に、図１３（Ｂ）に基づいて、Ｓ１１５に示されたスマートコントラクト処理のサブルーチンプログラムのフローチャートを説明する。Ｓ１５０により、データの入手希望者との売買契約が成立したか否か判定される。この判定は、スマートコントラクトとしてコード化されたデータの提供に対する対価（トークンの量）について互いの条件が合致することにより、自動的にＳ１５０によりＹＥＳと判定される。Ｓ１５０によりＮＯと判定された場合は制御がＳ１５１へ進み、ＰＣ資源の貸与契約が成立したか否か判定され、成立していない場合にはリターンする。 Next, a flowchart of the smart contract processing subroutine program shown in S115 will be described with reference to FIG. 13 (B). According to S150, it is determined whether or not a sales contract with a person who wants to obtain the data has been established. This determination is automatically determined to be YES by S150 when both conditions are met with respect to the consideration (amount of tokens) for the provision of data encoded as a smart contract. If NO is determined by S150, the control proceeds to S151, it is determined whether or not the PC resource loan contract has been established, and if not, it returns.

Ｓ１５０によりＹＥＳと判定された場合は制御がＳ１５２へ進み、入手希望者のアドレスにデータを送信し、その対価としてのトークンを取得する処理が行われる。その成立した契約がＳ１５３によりトランザクションとしてブロックチェーンに記録される。一方、ＰＣ資源（計算資源）の貸与契約が成立した場合には、制御がＳ１５４へ進み、ＰＣ資源（計算資源）の借用処理を行い、その契約がＳ１５３によりトランザクションとしてブロックチェーンに記録される。 If YES is determined by S150, the control proceeds to S152, data is transmitted to the address of the person who wants to obtain the data, and a process of acquiring a token as a consideration is performed. The established contract is recorded in the blockchain as a transaction by S153. On the other hand, when a PC resource (computational resource) loan contract is concluded, control proceeds to S154, the PC resource (computational resource) is borrowed, and the contract is recorded in the blockchain as a transaction by S153.

次に、図１２（Ｂ）に基づいて、Ｓ１１６に示されたシミュレーション処理のサブルーチンプログラムのフローチャートを説明する。このシミュレーション処理は、本来多数の人間同士が行っている契約の検証、条件確認、執行、実行、交渉等の法律行為を、多数のＡＩスマートコントラクト群に肩代わりさせてコンピュータ内で仮想的に実行（シミュレーション）させ、或る条件下においてどのようなシミュレーション結果になるかを検証するものである。上記「或る条件下」の具体例としては、政府が採用しようとしている政策や法律（例えば、消費増税に伴う軽減税率、改正出入国管理法、イギリスのＥＵ（European Union）からの離脱、ベーシックインカムの部分的または全面的採用、日本国憲法９条の改正等）、マーケティング関連の条件（例えば、新商品（金融商品や生命保険を含む）や新サービスの価格や対価の設定、各種メディアによるプロモーション効果等）、投資市場関連の条件（例えば、先物取引における気象条件、株式市場における金融引き締め政策等）等が、考えられる。 Next, a flowchart of the simulation processing subroutine program shown in S116 will be described with reference to FIG. 12 (B). This simulation process virtually executes legal acts such as contract verification, condition confirmation, execution, execution, and negotiation, which are originally performed by a large number of people, on behalf of a large number of AI smart contracts (in a computer). (Simulation) is performed to verify what kind of simulation result will be obtained under certain conditions. Specific examples of the above "conditions" include policies and laws that the government is trying to adopt (for example, reduced tax rates associated with consumption tax increases, revised immigration control laws, withdrawal from the EU (European Union) in the United Kingdom, and basic income. Partial or full adoption of, revision of Article 9 of the Japanese Constitution, etc.), marketing-related conditions (for example, setting prices and consideration for new products (including financial products and life insurance) and new services, promotion by various media (Effects, etc.), investment market-related conditions (for example, weather conditions in futures trading, monetary tightening policies in the stock market, etc.) can be considered.

Ｓ１３４により、シミュレーションの依頼があったか否か判定され、ない場合にはリターンする。Ｓ１３４によりシミュレーションの依頼があったと判定されれば制御がＳ１３５に進み、ＡＩスマートコントラクト群生成処理が実行される。 S134 determines whether or not there is a request for simulation, and if not, returns. If it is determined by S134 that the simulation request has been made, the control proceeds to S135, and the AI smart contract group generation process is executed.

このＡＩスマートコントラクト群生成処理のサブルーチンプログラムのフローチャートを図１３（Ａ）に基づいて説明する。Ｓ１４４により、借用中のＰＣ資源（計算資源）を利用して、依頼されたシミュレーションにマッチするペルソナ群を設定する処理が行われる。ペルソナとは、一般的には、企業や商品やサービスの典型的なターゲットとなる人物像を仮想の人物として定義したものである。本実施形態では、ペルソナとは、シミュレーション内容がターゲットとする典型的な人物像を仮想の人物として定義したものである。例えば、前述の消費増税に伴う軽減税率のシミュレーションの場合には、一般消費者に相当するペルソナであって、性別、年代別、地域別、年収別等にグルーピングした各グループ毎にペルソナとして設定する。 The flowchart of the subroutine program of the AI smart contract group generation processing will be described with reference to FIG. 13 (A). In S144, a process of setting a persona group matching the requested simulation is performed by using the borrowed PC resource (computational resource). A persona is generally defined as a virtual person who is a typical target person for a company, product, or service. In the present embodiment, the persona defines a typical person image targeted by the simulation content as a virtual person. For example, in the case of the simulation of the reduced tax rate associated with the consumption tax increase described above, a persona corresponding to a general consumer is set as a persona for each group grouped by gender, age group, region, annual income, etc. ..

設定されるペルソナの数は、グループに属するユーザ数に比例した数にする。例えば、一般大衆の年代別人口分布が１０代５％、２０代５％、３０代１０％、４０代１０％、５０代２０％、６０代２０％、７０代２０％、８０代５％、９０代５％、の場合に、１０代を代表するペルソナ数が１、２０代を代表するペルソナ数が１、３０代を代表するペルソナ数が２、４０代を代表するペルソナ数が２、５０代を代表するペルソナ数が４、６０代を代表するペルソナ数が４、７０代を代表するペルソナ数が４、８０代を代表するペルソナ数が１、９０代を代表するペルソナ数が１と、設定する。 The number of personas to be set is proportional to the number of users belonging to the group. For example, the population distribution of the general public by age group is 5% in their teens, 5% in their 20s, 10% in their 30s, 10% in their 40s, 20% in their 50s, 20% in their 60s, 20% in their 70s, and 5% in their 80s. In the case of 5% in the 90s, the number of personas representing the teens is 1, the number of personas representing the 20s is 1, the number of personas representing the 30s is 2, and the number of personas representing the 40s is 2,50. The number of personas representing the teens is 4, the number of personas representing the 60s is 4, the number of personas representing the 70s is 4, the number of personas representing the 80s is 1, and the number of personas representing the 90s is 1. Set.

次に、Ｓ１４５により、借用中のＰＣ資源（計算資源）を利用して、各ペルソナに属するユーザ群を選定する処理が行われる。次に、Ｓ１４６により、ペルソナ毎に属するユーザ群をグループピングしてグループ毎にユーザ群の取引データをブロックチェーンから収集する処理が行われる。例えば、前述の消費増税に伴う軽減税率のシミュレーションの場合には、ユーザ群を、性別、年代別、地域別、年収別等にグルーピングし、そのグループ毎にユーザ群の取引データをブロックチェーンから収集する。次に、Ｓ１４７により、借用中のＰＣ資源（計算資源）を利用して、取引データを学習データとして機械学習を行ってペルソナ毎に学習済みのＡＩスマートコントラクトを生成する処理が行われる。このＡＩスマートコントラクトは、対応するペルソナの設定数と同じ数だけ生成される。これにより、シミュレーションを実行するための環境が整い、その環境内でシミュレーションを行う。 Next, according to S145, a process of selecting a user group belonging to each persona is performed using the borrowed PC resource (computational resource). Next, according to S146, a process is performed in which a group of users belonging to each persona is grouped and transaction data of the user group is collected from the blockchain for each group. For example, in the case of the simulation of the reduced tax rate associated with the consumption tax increase described above, the user groups are grouped by gender, age group, region, annual income, etc., and the transaction data of the user group is collected from the blockchain for each group. do. Next, according to S147, using the borrowed PC resource (computational resource), machine learning is performed using the transaction data as learning data to generate a trained AI smart contract for each persona. The number of AI smart contracts generated is the same as the number of corresponding persona settings. As a result, the environment for executing the simulation is prepared, and the simulation is performed in that environment.

図１２（Ｂ）に戻り、上記のようにして生成された各ＡＩスマートコントラクトが行為ａに従った契約（法律行為）を執行する（Ｓ１３６）。この「行為ａ」とは、Ｓ１３８による強化学習の結果としての行為ａである。次に、Ｓ１３７により、各ＡＩスマートコントラクト同士で成立した契約をブロックチェーンに記録する処理が行われる。さらに、シミュレーションの進行による状況の変遷状況をブロックチェーンに記録する。例えば、前述の消費増税に伴う軽減税率のシミュレーションの場合には、シミュレーションの進行に伴って内需や景気がどのように変化したかをブロックチェーンに記録する。 Returning to FIG. 12B, each AI smart contract generated as described above executes the contract (legal act) according to the act a (S136). This "act a" is an act a as a result of reinforcement learning by S138. Next, according to S137, a process of recording the contract concluded between the AI smart contracts on the blockchain is performed. Furthermore, the transition status of the situation as the simulation progresses is recorded on the blockchain. For example, in the case of the simulation of the reduced tax rate associated with the consumption tax increase described above, how the domestic demand and the economy have changed as the simulation progresses is recorded on the blockchain.

次に、Ｓ１３８により、借用中のＰＣ資源（計算資源）を利用して、成立した契約内容に基づいて報酬ｒを算出し、ＴＤ学習により最適政策π^＊に従った行為ａを求める処理が行われる。そして、Ｓ１３９によりシミュレーションが終了したか否か判定され、未だ終了していない場合には制御がＳ１３６へ戻り、Ｓ１３７→Ｓ１３８→Ｓ１３９→Ｓ１３６を繰り返し巡回して強化学習を進行させる。シミュレーションが終了した段階でＳ１３９によりＹＥＳと判定されて制御がＳ１４０へ進み、シミュレーション結果を導出する処理が行われた後にリターンする。Next, according to S138, the borrowed PC resource (computational resource) is used to calculate the reward r based on the contract details that have been concluded, and the process of obtaining the action a according ^{to the optimum policy π * by TD learning is performed.} Will be. Then, it is determined by S139 whether or not the simulation is completed, and if it is not completed yet, the control returns to S136 and repeats S137 → S138 → S139 → S136 to proceed with reinforcement learning. When the simulation is completed, S139 determines YES, the control proceeds to S140, and the process returns after the process of deriving the simulation result is performed.

シミュレーション結果を導出する処理の具体例としは、例えば、消費増税に伴う軽減税率の採用に伴う景気変動のシミュレーションの場合には、シミュレーションの結果景気動向指数の各項目がどのように変動したかを導出する。株式市場における金融引き締め政策に伴うシミュレーションの場合には、シミュレーションの結果株式市場がどのように変動したかを導出する。また、消費増税に伴う軽減税率の具体的態様（例えば、なにを軽減税率対象品目にするかや各軽減税率対象品目毎の軽減税率等）を複数態様に変動させながら最適な軽減税率の態様を決定するという、シミュレーション最適化の手法を採用してもよい。この場合、最適な軽減税率の態様として、（税収増加率（％）＋景気動向指数としてのディフュージョン・インデックス（DI）／５０）を期待値Eとし、その期待値Eを最大化するシミュレーション結果を得るようにする。シミュレーションにおけるコントロールパラメータ（軽減税率の態様）をθとし、シミュレーションの結果をY(θ)とし、maxE[Y(θ)]におけるθを求める。このようなシミュレーションを最適化する具体的手法として、例えば、メタヒューリスティックなアルゴリズムとして粒子群最適化法(Particle Swarm Optimization：PSO)、目的関数の解析的表現が困難な場合や目的関数の微分に関する情報を用いることが出来ない状況下で最適解を求めるDFO(derivative free optimization)等を用いる。なお、Ｓ１３７によるブロックチェーンへの記録を必ずしも行わなくてもよく、その場合には、前述した連携タイプにおいては、Ｓ１３５、Ｓ１３６及びＳ１４７における「ＡＩスマートコントラクト」を「連携用ＡＩ」と変更する。つまり、コンピュータ内でのシミュレーションの場合、現実世界での契約（法律行為）の執行を伴わないため、ブロックチェーンへの記録を行わない場合には、わざわざスマートコントラクトを用いる必要がなく、各連携用ＡＩ同士が行為ａを実行して強化学習を行えば事足りるためである。
[変形例]As a specific example of the process of deriving the simulation result, for example, in the case of simulating the economic fluctuation accompanying the adoption of the reduced tax rate due to the consumption tax increase, how each item of the business cycle index fluctuated as a result of the simulation. Derived. In the case of a simulation that accompanies a monetary tightening policy in the stock market, we derive how the stock market fluctuated as a result of the simulation. In addition, the optimum mode of the reduced tax rate while changing the specific mode of the reduced tax rate due to the consumption tax increase (for example, what is the item subject to the reduced tax rate and the reduced tax rate for each item subject to the reduced tax rate) in multiple modes. The simulation optimization method of determining the above may be adopted. In this case, as the optimum mode of the reduced tax rate, (tax revenue increase rate (%) + diffusion index (DI) / 50 as an economic trend index) is set as the expected value E, and the simulation result that maximizes the expected value E is used. Try to get. Let θ be the control parameter (mode of reduced tax rate) in the simulation, let Y (θ) be the result of the simulation, and find θ in maxE [Y (θ)]. Specific methods for optimizing such simulations include, for example, particle swarm optimization (PSO) as a metaheuristic algorithm, information on cases where analytical representation of the objective function is difficult, and information on the differentiation of the objective function. Use DFO (derivative free optimization) etc. to find the optimum solution in the situation where can not be used. It is not always necessary to record to the blockchain by S137. In that case, in the above-mentioned cooperation type, "AI smart contract" in S135, S136 and S147 is changed to "AI for cooperation". In other words, in the case of simulation in a computer, it does not involve the execution of contracts (legal acts) in the real world, so if you do not record on the blockchain, you do not have to bother to use smart contracts, and for each cooperation This is because it is sufficient for AIs to perform action a and perform reinforcement learning.
[Modification example]

（１）認定事業者１７は、ユーザの暗号化個人情報Ｅ_Ｋ１(個人情報）を記憶しているが、それに代えて、暗号化個人情報であるＥ_Ｋ１(個人情報）を直接ブロックチェーンに記録するようにしてもよい。(1) The authorized business operator 17 stores the user's encrypted personal information _EK1 (personal information), but instead _{records the encrypted personal information EK1} (personal information) directly on the blockchain. You may try to do so.

（２）図２の取引Ｃや取引Ｆのような個人情報以外の取引データも、個人情報と同様に鍵Ｋ１等で暗号化してブロックチェーンに記録してもよい。 (2) Transaction data other than personal information such as transaction C and transaction F in FIG. 2 may be encrypted with a key K1 or the like and recorded on the blockchain in the same manner as personal information.

（３）前述の説明では各ブロックチェーンネットワーク２、３、４のノード１９の動作処理を示したが、プライベートチェーン２のノード１９について示した動作処理を他のブロックチェーンネットワーク３、４のノード１９で行なってもよく、コンソーシアムチェーン３のノード１９について示した動作処理を他のブロックチェーンネットワーク２、４のノード１９で行なってもよく、パブリックチェーン４のノード１９について示した動作処理を他のブロックチェーンネットワーク２、３のノード１９で行なってもよい。この変形例は、後述する実施態様においても同様に適用してもよい。 (3) In the above description, the operation processing of the nodes 19 of the blockchain networks 2, 3 and 4 is shown, but the operation processing shown for the node 19 of the private chain 2 is applied to the operation processing of the nodes 19 of the other blockchain networks 3 and 4. The operation processing shown for the node 19 of the consortium chain 3 may be performed at the node 19 of the other blockchain networks 2 and 4, and the operation processing shown for the node 19 of the public chain 4 may be performed by another block. It may be performed at the nodes 19 of the chain networks 2 and 3. This modification may be similarly applied to the embodiments described later.

（４）借用したＰＣ資源（計算資源）を利用してブロックチェーンにおけるマイニング（記帳権の競争）を行なうようにしてもよい。その際、第１実施形態のように時間単価でＰＣ資源（計算資源）を貸与するようにしてもよいが、マイニング（記帳権の競争）に成功したマイナーが得られる利益（トークン等）の何割かをＰＣ資源（計算資源）の貸与者に分配（配当）するようにしてもよい。その配当割合（配当量）は、ＰＣ資源（計算資源）の貸与量（ＰＣの貸与台数×貸与時間等）に比例するように制御する。 (4) The borrowed PC resources (computational resources) may be used for mining (competition of bookkeeping rights) on the blockchain. At that time, PC resources (computational resources) may be lent at an hourly rate as in the first embodiment, but what is the profit (token, etc.) obtained by a miner who succeeds in mining (competition of bookkeeping rights)? The percentage may be distributed (dividend) to the lenders of PC resources (computational resources). The dividend ratio (dividend amount) is controlled so as to be proportional to the loan amount of PC resources (computational resources) (number of PCs lent x loan time, etc.).

（５）前述したＰＣ資源（計算資源）や自家発電の電力等の貸与（提供）対象を借用（利用）して何らかのプロジェクトを遂行するようにしてもよい。プロジェクトの具体例としては、研究開発（例えば、人工知能開発、機械学習、人ゲノム解析、新製品開発、新薬開発等）、レアメタルや石油や天然ガスや海洋資源等の探査・発掘、宇宙開発等が、考えられる。その際、第１実施形態のように貸与（提供）対象の貸与量（提供量）に応じた対価（トークン等）を貸与者（提供者）が取得するようにしてもよいが、プロジェクトの成功によってプロジェクト遂行者（個人または法人または団体）が得られる利益の何割かを資源貸与者（資源提供者）に分配（配当）するようにしてもよい。その配当割合（配当量）は、資源の貸与量（提供量）に比例するように制御する。 (5) Some project may be carried out by borrowing (using) the above-mentioned PC resources (computational resources), privately generated electric power, and the like. Specific examples of projects include research and development (for example, artificial intelligence development, machine learning, human genome analysis, new product development, new drug development, etc.), exploration and excavation of rare metals, oil, natural gas, marine resources, etc., space development, etc. However, it is possible. At that time, as in the first embodiment, the lender (provider) may obtain consideration (tokens, etc.) according to the loan amount (provided amount) to be loaned (provided), but the success of the project. A percentage of the profits obtained by the project executor (individual or corporation or group) may be distributed (dividend) to the resource lender (resource provider). The dividend ratio (dividend amount) is controlled so as to be proportional to the loan amount (provided amount) of resources.

さらに、資源貸与者（資源提供者）が配当自体を受け取るのではなく、配当を受け取る権利（以下「配当享受権」という）を取得するようにしてもよい。この配当享受権は、例えば、プロジェクト遂行者が発行したトークンの形で貸与者（提供者）が取得するように制御してもよい。そして、貸与者（提供者）は、取得した配当享受権（トークン）を、そのときの相場に応じた価格（トークン）で他人に譲渡できるように制御してもよい。このように構成することにより、配当享受権（トークン）を、あたかも株式市場におけるセカンダリーマーケットでの株取引のように運用することができる。 Further, the resource lender (resource provider) may acquire the right to receive the dividend (hereinafter referred to as “dividend enjoyment right”) instead of receiving the dividend itself. This dividend enjoyment right may be controlled to be acquired by the lender (provider) in the form of a token issued by the project executor, for example. Then, the lender (provider) may control the acquired dividend enjoyment right (token) so that it can be transferred to another person at a price (token) according to the market price at that time. With this configuration, the dividend enjoyment right (token) can be managed as if it were a stock transaction in the secondary market in the stock market.

（６）Ｓ７１では生成されたパーソナライズド学習済みモデルを発注者のアドレス宛てに送信して納品しているが、それに加えてまたはその代わりに、生成されたパーソナライズド学習済みモデルを活用して発注者にパーソナライズされたサービスを提供するようにしてもよい。 (6) In S71, the generated personalized trained model is sent to the orderer's address for delivery, but in addition to or instead, the generated personalized trained model is used for ordering. A person may be provided with a personalized service.

（７）前述の説明ではスマートコントラクトにより契約の検証、条件確認、執行、実行、交渉を自動化しているが、契約（取引等の法律行為）の締結及び執行を行う前にユーザ本人の承諾を求めるように制御してもよい。また、全ての契約（取引等の法律行為）についてユーザ本人の承諾を求めるのではなく、予め定められた重要な契約（取引等の法律行為）か否かを判定し、重要な契約（取引等の法律行為）であると判定された場合にユーザ本人の承諾を求めるように制御してもよい。さらに、締結及び執行が急がれる契約（取引等の法律行為）であるか否かを判定し、急がれる契約（取引等の法律行為）であると判定された場合には、ユーザ本人の承諾を得ることなく契約（取引等の法律行為）の締結及び執行する制御を行い、後からユーザ本人に報告するように制御してもよい。この変形例は、後述する実施態様においても同様に適用してもよい。 (7) In the above explanation, the verification, condition confirmation, execution, execution, and negotiation of the contract are automated by the smart contract, but the consent of the user himself / herself is required before concluding and executing the contract (legal act such as transaction). It may be controlled as desired. In addition, instead of seeking the consent of the user for all contracts (legal acts such as transactions), it is determined whether or not they are important contracts (legal acts such as transactions) that have been determined in advance, and important contracts (transactions, etc.) It may be controlled to seek the consent of the user himself / herself when it is determined to be a legal act). Furthermore, it is determined whether or not the contract is urgently concluded and executed (legal act such as transaction), and if it is determined that the contract is urgent (legal act such as transaction), the user himself / herself It may be controlled to conclude and execute a contract (legal act such as a transaction) without obtaining consent, and to report to the user himself / herself later. This modification may be similarly applied to the embodiments described later.

（８）各ブロックチェーンのノード１９を構成するユーザ端末１６等及び各種サーバで動作する前述したプログラムは、所定のウェブサイト等からダウンロードしてインストールしてもよいが、例えばＣＤ−ＲＯＭ９９等の記録媒体（非一時的（non-transitory）な記録媒体）に記録させて流通させ、そのＣＤ−ＲＯＭ９９等を購入した者がプログラムをユーザ端末１６及び各種サーバにインストールしてもよい（図６０参照）。 (8) The above-mentioned programs running on the user terminals 16 and the like constituting the node 19 of each blockchain and various servers may be downloaded and installed from a predetermined website or the like, but for example, recording of a CD-ROM 99 or the like. A person who purchases the CD-ROM99 or the like by recording it on a medium (non-transitory recording medium) and distributing it may install the program on the user terminal 16 and various servers (see FIG. 60). ..

（９）前述の説明では集中型オラクル２１を採用しているが、ネットワーク全体で分散して管理される分散型オラクルを採用してもよい。ネットワーク全体で分散している複数のオラクルが収集した情報を寄せ集めて平均的な情報を抽出し、その平均的情報を正しい情報とみなしてブロックチェーン内に取込みスマートコントラクトに用いる。これは、ジェームズ・スロウィッキーが書籍『みんなの意見は案外正しい』で提唱した「集団において情報を寄せ集めることで、その集団が出す結論は集団の中の個人の誰が考えるよりもよい結論を導くことができる」という理論に基づいたものである。そして、平均的情報に近い情報を収集したオラクルに対しトークン等の報酬を与えることにより、分散型オラクルを運用するインセンティブを与える。 (9) Although the centralized oracle 21 is adopted in the above description, the decentralized oracle that is distributed and managed in the entire network may be adopted. Information collected by multiple Oracles distributed throughout the network is collected to extract average information, and the average information is regarded as correct information and incorporated into the blockchain for use in smart contracts. This is what James Surowiecki advocated in the book "Everyone's Opinion Is Unexpectedly Correct", "By gathering information in a group, the conclusions that the group draws lead to better conclusions than anyone in the group thinks. It is based on the theory that "can be done". Then, by giving a reward such as a token to the oracle that collects information close to the average information, an incentive to operate the decentralized oracle is given.

まとめると、ネットワーク上で分散している複数のオラクルが収集した情報を寄せ集めて平均的な情報を抽出する抽出手段と、該抽出手段により抽出された平均的情報を採用する採用手段と、オラクルに対し報酬を与える報酬付与手段とを備え、前記複数のオラクルは、第１オラクルと第２オラクルとを含み、前記報酬付与手段は、前記第１オラクルよりもより前記平均的情報に近い情報を収集した前記第２オラクルにより多くの報酬を付与する。なお、第１オラクルに付与する報酬は０であってもよく、また、マイナスであってもよい。 In summary, an extraction means that collects information collected by multiple Oracles distributed on a network and extracts average information, an adoption means that adopts the average information extracted by the extraction means, and an Oracle. The plurality of oracles include a first oracle and a second oracle, and the reward giving means provides information closer to the average information than the first oracle. Give more rewards to the collected second oracle. The reward given to the first oracle may be 0 or may be negative.

（１０）前述のＳ１３のＫ２配布契約の成立及びＳ１６の個人情報の提供契約の成立のうちの一方または双方の判定について、個人情報主本人の意思を特定する情報（以下「意思特定情報」という）に従って成立させるか否判定する機能を設けてもよい。具体的には、個人情報主が、実店舗や電子ショッピングモール等において自分にマッチする商品やサービスをレコメンドしてもらうときに、自身が所持する携帯端末（スマートフォンまたはＩＣカード等）の特定情報を読取らせると共に契約を成立させてよい旨の暗証番号を入力することによって、特定情報及び暗証番号からなる意思特定情報を当該個人情報主のスマートコントラクトに通知し、その意思特定情報に従ってスマートコントラクが判定する。 (10) Information that identifies the intention of the personal information owner (hereinafter referred to as "intention identification information") with respect to the determination of one or both of the establishment of the K2 distribution contract in S13 and the establishment of the personal information provision contract in S16. ) May be provided. Specifically, when a personal information owner recommends a product or service that matches him / her at a physical store or an electronic shopping mall, he / she obtains specific information of his / her own mobile terminal (smartphone, IC card, etc.). By inputting a password that allows the user to read and conclude a contract, the smart contract of the personal information owner is notified of the intention-specific information consisting of the specific information and the password, and the smart contract is in accordance with the intention-specific information. Judges.

このようにすることにより、ユーザは、ＳＮＳ等の事業者から取戻して自己管理化に置いた個人情報を自分の意思によって自分のために活用することができる。
[第２実施形態]By doing so, the user can utilize the personal information recovered from the business operator such as SNS and placed in self-management for himself / herself by his / her own will.
[Second Embodiment]

次に、第２実施形態を説明する。この第２実施形態は、ブロックチェーンを利用して記録した個人情報に対し、暗号化技術を応用することにより、自己の個人情報を削除する個人情報削除権（いわゆる忘れられる権利）の必要性に応えるものである。ブロックチェーンは一旦記録した情報が改竄不可能または改竄が極めて困難である点が特徴であり、そのために、一旦記録した情報の削除が不可能または極めて困難である（これを以下「削除不可能性」という）。一方、欧州におけるＧＤＰＲ（General Data Protection Regulation）では、記録した個人情報をその個人情報主が削除できる個人情報削除権（いわゆる忘れられる権利）を保証することが要求されている。このＧＤＰＲの個人情報削除権の要請とブロックチェーンにおける削除不可能性とが真っ向から対立し二律背反するジレンマとなっている。つまり、この第２実施形態は、削除したい情報の削除権を保証する要請と削除不可能性とが二律背反するジレンマを解決するものである。その概要を図１４に基づいて説明する。 Next, the second embodiment will be described. This second embodiment addresses the need for a right to delete personal information (so-called right to be forgotten) to delete one's own personal information by applying encryption technology to personal information recorded using a blockchain. It is a response. The blockchain is characterized in that the information once recorded cannot be tampered with or is extremely difficult to tamper with, and therefore it is impossible or extremely difficult to delete the once recorded information (hereinafter referred to as "undeleteable"). "). On the other hand, the GDPR (General Data Protection Regulation) in Europe requires that the personal information deletion right (so-called right to be forgotten) that the personal information owner can delete the recorded personal information is guaranteed. This request for the right to delete personal information of the GDPR and the impossibility of deletion in the blockchain are in direct conflict and are an antinomy dilemma. That is, this second embodiment solves the dilemma in which the request for guaranteeing the right to delete the information to be deleted and the impossibility of deletion are in conflict with each other. The outline will be described with reference to FIG.

図１４（Ａ）は削除権を行使していない通常状態を示しており、図１４（Ｂ）は削除権を行使して情報を解読不能化した状態を示している。図１４（Ａ）を参照して、情報保持者（情報主ともいう）４０が情報を２つの片割れ共通鍵ＫＡとＫＢを用いて二重暗号化する。式で表せばＥ_ＫＡ(Ｅ_ＫＢ(情報））となる。次に、その暗号化情報であるＥ_ＫＡ(Ｅ_ＫＢ(情報））をブロックチェーン等に記録する。なお、片割れ共通鍵ＫＡは秘匿状態で情報保持者４０のユーザ端末等に記憶しておく。FIG. 14 (A) shows a normal state in which the deletion right is not exercised, and FIG. 14 (B) shows a state in which the information cannot be read by exercising the deletion right. With reference to FIG. 14 (A), the information holder (also referred to as the information owner) 40 double-encrypts the information using the two half-split common keys KA and KB. Expressed as an equation, it is E _KA (E _KB (information)). Next, the encrypted information _EKA ( _EKB (information)) is recorded on the blockchain or the like. The one-sided common key KA is stored in the user terminal or the like of the information holder 40 in a secret state.

この状態で、情報要求者４１が情報主４０に対し情報を要求したときには、情報主４０が、記録している暗号化情報であるＥ_ＫＡ(Ｅ_ＫＢ(情報））を鍵ＫＡで復号する。式で表せば、Ｄ_ＫＡ(Ｅ_ＫＡ(Ｅ_ＫＢ(情報））＝Ｅ_ＫＢ(情報）となる。そして、このＥ_ＫＢ(情報）と片割れ共通鍵ＫＢとを情報主４０が情報要求者４１へ送信する。In this state, when the information requester 41 requests information from the information master 40, the information master 40 decrypts the recorded encrypted information E _KA (E _KB (information)) with the key KA. Expressed in an equation, D _KA (E _KA (E _KB (information)) = E _KB (information). Then, the _{information owner 40 sends this E KB} (information) and the half-split common key KB to the information requester 41. Send.

それらを受信した情報要求者４１は、受信したＥ_ＫＢ(情報）を受信した片割れ共通鍵ＫＢで復号する。式で表せば、Ｄ_ＫＢ(Ｅ_ＫＢ(情報））＝情報となる。これにより、情報要求者４１は、平文の情報を得ることができる。They information requester 41 which received the decrypted by halves common key KB that has received the received E _{KB (information).} Expressed in an equation, D _KB (E _KB (information)) = information. As a result, the information requester 41 can obtain the information in plain text.

次に、削除権を行使して情報を解読不能化した状態を図１４（Ｂ）に基づいて説明する。情報主４０は、解読不能化したい情報の暗号化に用いた片割れ鍵ＫＡとＫＢとのうちの一方ＫＡを乱数Ｒ（≠ＫＡ）に更新する。次に、片割れ共通鍵ＫＢを既に記憶している情報要求者４１が情報主４０に対し情報を要求したときには、情報主４０が、記録している暗号化情報であるＥ_ＫＡ(Ｅ_ＫＢ(情報））を鍵Ｒ（乱数）で復号する。式で表せば、Ｄ_Ｒ(Ｅ_ＫＡ(Ｅ_ＫＢ(情報））となる。そして、このＤ_Ｒ(Ｅ_ＫＡ(Ｅ_ＫＢ(情報））を情報主４０が情報要求者４１へ送信する。Next, a state in which the information is made undecipherable by exercising the deletion right will be described with reference to FIG. 14 (B). The information main 40 updates one of the half-split keys KA and KB used for encrypting the information to be made undecipherable to a random number R (≠ KA). Next, when the information requester 41 who has already stored the half-split common key KB requests the information from the information master 40, the information master 40 records the encrypted information E _KA (E _KB (information)). )) Is decrypted with the key R (random number). Expressed by the formula, the _{_{_{D R (E KA (E KB}}} ( information)). Then, the _{_{_{D R (E KA (E KB}}} ( information)) Information main 40 is transmitted to the information requester 41.

それを受信した情報要求者４１は、既に記憶している片割れ共通鍵ＫＢでＤ_Ｒ(Ｅ_ＫＡ(Ｅ_ＫＢ(情報））を復号する。式で表せば、Ｄ_ＫＢ(Ｄ_Ｒ(Ｅ_ＫＡ(Ｅ_ＫＢ(情報））））≠情報となる。このように、解読不能化状態では、既に片割れ共通鍵ＫＢを記憶している情報要求者４１であっても平文の情報を得ることができず、削除したい情報の削除権を保証する要請と削除不可能性とが二律背反するジレンマを解決することができる。なお、ブロックチェーン等に記録する個人情報等の情報をコピー＆ペーストできないようにコピー禁止処理を施しておれば、情報の削除権の保証はより完全なものとなる。なお、２つの鍵ＫＡ及びＫＢを用いた二重暗号に限定する必要はなく、３つ以上の鍵（ｎ個の鍵）を用いた多重暗号を用いてもよい。この場合、ｎ個の鍵のうち、少なくともいずれか１つの鍵を乱数Ｒに置換することにより、解読不能化状態となる。Information requester 41 which has received it, if indicated already. Expression of decoding the memory to have counterpart common key KB by _{_{_{D R (E KA (E KB}}} ( _{_{information)), D KB (D R}} (E KA ( E _KB (information)))) ≠ information. In this way, even the information requester 41 who has already stored the half-split common key KB cannot obtain plain text information in the undecipherable state. It is possible to solve the dilemma in which the request to guarantee the right to delete the information to be deleted and the inability to delete it are in conflict with each other. Copying is prohibited so that information such as personal information recorded on the blockchain cannot be copied and pasted. If the processing is performed, the guarantee of the right to delete the information becomes more complete. It is not necessary to limit the information to double encryption using two keys KA and KB, and three or more keys (n). In this case, at least one of the n keys is replaced with a random number R, so that the decryption state is established.

以上説明した第２実施形態の概要をより詳細に説明する。第１実施形態との共通点については説明の繰り返しを省略し主に相違点について説明する。図１５は第１実施形態における図２に対応するものである。ブロックチェーン内の取引Ｉを参照し、この第２実施形態では、暗号化個人情報であるＥ_ＫＡ(Ｅ_ＫＢ(個人情報））を直接ブロック内に記録している。よって、第２実施形態では認定事業者１７は不要である。ここに、ＫＡとＫＢは片割れ共通鍵である。The outline of the second embodiment described above will be described in more detail. Regarding the common points with the first embodiment, the repetition of the description will be omitted, and the differences will be mainly described. FIG. 15 corresponds to FIG. 2 in the first embodiment. With reference to transaction I in the blockchain, in this second embodiment, _EKA ( _EKB (personal information)), which is encrypted personal information, is recorded directly in the block. Therefore, in the second embodiment, the certified business operator 17 is unnecessary. Here, KA and KB are symmetric keys.

次に、図１６を参照し、プライベートチェーン２のノード１９を構成しているユーザ端末とパブリックチェーン４のノードを構成しているユーザ端末１６とのメインルーチンのフローチャートを説明する。このメインルーチンは、第１実施形態で示した動作処理のフローチャートを省略し、第１実施形態で示した動作処理に対し追加または変更する動作処理のフローチャートのみを示している。パブリックチェーン４のノードを構成しているユーザ端末１６では、Ｓ１６０によりブロックチェーンへの個人情報記録処理が行われ、Ｓ１６１により記録解読不能処理が行われ、Ｓ１６２により個人情報提供処理が行われる。ライベートチェーン２のノード１９を構成しているユーザ端末では、Ｓ１７０により個人情報入手処理が行われる。 Next, with reference to FIG. 16, a flowchart of the main routine of the user terminal constituting the node 19 of the private chain 2 and the user terminal 16 constituting the node of the public chain 4 will be described. This main routine omits the flowchart of the operation process shown in the first embodiment, and shows only the flowchart of the operation process to be added or changed to the operation process shown in the first embodiment. In the user terminal 16 constituting the node of the public chain 4, the personal information recording process to the blockchain is performed by S160, the record unbreakable processing is performed by S161, and the personal information providing processing is performed by S162. At the user terminal constituting the node 19 of the private chain 2, personal information acquisition processing is performed by S170.

ブロックチェーンへの個人情報記録処理とは、個人情報をブロックチェーンに記録する処理である。記録解読不能処理とは、削除権を行使して情報を解読不能化にするための処理である。個人情報提供処理とは、パブリックチェーン４のユーザ端末１６がプライベートチェーン２のユーザ端末に個人情報を提供する処理である。個人情報入手処理とは、プライベートチェーン２のユーザ端末がパブリックチェーンのユーザ端末１６から個人情報を入手する処理である。 The personal information recording process on the blockchain is a process of recording personal information on the blockchain. The record unbreakable process is a process for exercising the deletion right to make the information unreadable. The personal information providing process is a process in which the user terminal 16 of the public chain 4 provides personal information to the user terminal of the private chain 2. The personal information acquisition process is a process in which the user terminal of the private chain 2 acquires personal information from the user terminal 16 of the public chain.

図１７（Ａ）に基づいて、ブロックチェーンへの個人情報記録処理のサブルーチンプログラムのフローチャートを説明する。Ｓ１７４により、２つの乱数を生成する処理が行われる。例えば、ＤＥＳの場合には、５６ビットの乱数を２つ生成して、それら５６ビット乱数を片割れ共通鍵ＫＡ及びＫＢとする。ＡＤＳの場合には、１２８ビットの乱数を２つ生成して、それら１２８ビット乱数を片割れ共通鍵ＫＡ及びＫＢとする。 A flowchart of a subroutine program for recording personal information on the blockchain will be described with reference to FIG. 17 (A). In S174, a process of generating two random numbers is performed. For example, in the case of DES, two 56-bit random numbers are generated, and these 56-bit random numbers are used as the half-split common keys KA and KB. In the case of ADS, two 128-bit random numbers are generated, and these 128-bit random numbers are used as the half-split common keys KA and KB.

次に、Ｓ１７７において、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））とＥ_Ｋ２(インデックス＋個人情報提供の対価）と暗号文識別子とをブロックチェーンに記録する処理が行われる。この暗号文識別子は、暗号化個人情報であるＥ_ＫＡ(Ｅ_ＫＢ(個人情報））を特定するための識別子であり、第１実施形態における電子ＩＤに相当するものである。Next, in S177, _{a process of recording EKA} ( _EKB (personal information)), _EK2 (index + consideration for providing personal information), and a ciphertext identifier is performed on the blockchain. This ciphertext identifier is an identifier _{for identifying EKA} ( _EKB (personal information)), which is encrypted personal information, and corresponds to an electronic ID in the first embodiment.

次に、Ｓ１８３により、ＫＡ及びＫＢと暗号文識別子とを対応付けてユーザ端末１６のＨＤＤ１２に記憶する。 Next, according to S183, KA and KB are associated with the ciphertext identifier and stored in the HDD 12 of the user terminal 16.

次に、図１７（Ｂ）に基づいて、記録解読不能処理のサブルーチンプログラムのフローチャートを説明する。Ｓ１９０により、ブロックチェーンに記録した暗号文（例えば、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））等）のうち解読不能にしたい暗号文があるか否か判定される。ない場合にはリターンするが、解読不能にしたい暗号文がある場合には制御がＳ１９１に進み、その暗号文の暗号文識別子に対応付けて記憶している片割れ共通鍵ＫＡをＨＤＤ１２から検索する処理が行われる。Next, a flowchart of a subroutine program for record unbreakable processing will be described with reference to FIG. 17 (B). S190 determines whether or not there is a ciphertext to be made unbreakable among the ciphertexts recorded on the blockchain (for example, _EKA ( _{EKB (personal information)), etc.).} If there is no ciphertext, it returns, but if there is a ciphertext that you want to make undecipherable, the control proceeds to S191, and the process of searching the HDD 12 for the half-split symmetric key KA that is stored in association with the ciphertext identifier of that ciphertext. Is done.

次に、Ｓ１９２により、乱数Ｒが生成される。例えば、ＤＥＳの場合には、５６ビットの乱数を生成する。ＡＤＳの場合には、１２８ビットの乱数を生成する。次に、Ｓ１９３により、生成した乱数Ｒ＝ＫＡであるか否か判定される。生成した乱数ＲがＨＤＤ１２に記憶されている片割れ共通鍵ＫＡと同じであれば制御がＳ１９２に戻り、再度乱数を生成し直す。Ｓ１９３によりＮＯと判定されれば制御がＳ１９４へ進み、ＨＤＤ１２に記憶されている片割れ共通鍵ＫＡをＲに更新する処理が行われる。 Next, a random number R is generated by S192. For example, in the case of DES, a 56-bit random number is generated. In the case of ADS, a 128-bit random number is generated. Next, S193 determines whether or not the generated random number R = KA. If the generated random number R is the same as the one-sided symmetric key KA stored in the HDD 12, the control returns to S192 and the random number is generated again. If NO is determined by S193, the control proceeds to S194, and a process of updating the half-split common key KA stored in the HDD 12 to R is performed.

次に、図１８に基づいて、個人情報提供処理と個人情報入手処理とのサブルーチンプログラムのフローチャートを説明する。プライベートチェーン２のユーザ端末において、Ｓ１９８により、入手希望個人情報の片割れ共通鍵ＫＢが既に記憶されているか否か判定される。パブリックチェーン４のユーザ端末１６からプライベートチェーン２のユーザ端末へ、入手希望個人情報の片割れ共通鍵ＫＢが既に配布されておれば、Ｓ１９８により記憶ありと判定されて制御がＳ２０３へ進むが、未だに記憶されていない場合には制御がＳ１９９へ進む。 Next, a flowchart of the subroutine program of the personal information providing process and the personal information acquisition process will be described with reference to FIG. In the user terminal of the private chain 2, it is determined by S198 whether or not the half-split common key KB of the personal information desired to be obtained is already stored. If the half-split common key KB of the personal information desired to be obtained has already been distributed from the user terminal 16 of the public chain 4 to the user terminal of the private chain 2, it is determined by S198 that there is a memory, and the control proceeds to S203, but it is still stored. If not, the control proceeds to S199.

Ｓ１９９では、入手希望個人情報の暗号文識別子をパブリックチェーン４のユーザ端末１６へ送信して片割れ共通鍵ＫＢを要求する処理が行われる。それをＳ２００で受信したパブリックチェーン４のユーザ端末１６では、暗号文識別子で指定された個人情報を提供する取引を行うか否かをスマートコントラクトにより判定し（Ｓ１６参照）、個人情報を提供する取引を行う場合にはＳ２０１により、個人情報の提供に合意する署名と暗号文識別子に対応する片割れ共通鍵ＫＢとが返信される。 In S199, a process is performed in which the ciphertext identifier of the personal information desired to be obtained is transmitted to the user terminal 16 of the public chain 4 to request the half-split common key KB. The user terminal 16 of the public chain 4 that receives it in S200 determines whether or not to perform a transaction that provides personal information specified by the ciphertext identifier by a smart contract (see S16), and a transaction that provides personal information. In this case, S201 returns a signature that agrees to provide personal information and a half-split common key KB corresponding to the ciphertext identifier.

それをＳ２０２で受信したプライベートチェーン２のユーザ端末では、Ｓ２０３により、署名と入手希望個人情報の暗号文識別子とがパブリックチェーン４のユーザ端末１６へ送信される。それをＳ２０６で受信したパブリックチェーン４のユーザ端末１６では、暗号文識別子で指定された個人情報を提供する取引を行うか否かをスマートコントラクトにより判定し（Ｓ１６参照）、個人情報を提供する取引を行う場合にはＳ２０７により、Ｄ_KＡ(暗号化個人情報）またはＤ_R(暗号化個人情報）を演算して返信する処理が行われる。具体的には、ユーザ端末１６のＨＤＤ１２に記憶されている片割れ共通鍵ＫＡが既に乱数Ｒに更新されている場合にはＤ_R(暗号化個人情報）を演算して返信されるが、未だ乱数Ｒに更新されていない場合にはＤ_KＡ(暗号化個人情報）を演算して返信する処理が行われる。At the user terminal of the private chain 2 that receives it in S202, the signature and the ciphertext identifier of the personal information desired to be obtained are transmitted to the user terminal 16 of the public chain 4 by S203. The user terminal 16 of the public chain 4 that receives it in S206 determines by a smart contract whether or not to carry out a transaction that provides personal information specified by the encryption text identifier (see S16), and a transaction that provides personal information. the S207 when performing the process of replying by calculating D _{KA (encrypted} personal information) or D _R (encrypted personal information) is performed. More specifically, although when the halves common key KA stored in the HDD12 of the user terminal 16 has already been updated to the random number R is returned by calculating the D _R (encrypted personal information), yet random numbers If it has not been updated to R, _{a process of calculating DKA} (encrypted personal information) and returning it is performed.

パブリックチェーン４のユーザ端末１６からの返信をＳ２０８で受信したプライベートチェーン２のユーザ端末では、Ｓ２０９により、Ｄ_KＢ(Ｄ_KＡ(暗号化個人情報））＝平文または、Ｄ_KＢ(Ｄ_R(暗号化個人情報））≠平文を演算する処理が行われる。具体的には、Ｄ_KＡ(暗号化個人情報）を受信した場合には、Ｄ_KＢ(Ｄ_KＡ(暗号化個人情報））＝Ｄ_KＢ(Ｄ_KＡ(Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））））＝平文を演算して平文の個人情報を得る。一方、(Ｄ_R(暗号化個人情報））を受信した場合には、Ｄ_KＢ(Ｄ_Ｒ(暗号化個人情報））＝Ｄ_KＢ(Ｄ_Ｒ(Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））））≠平文が演算されることになり、平文の個人情報を得ることができない。これにより、削除したい情報の削除権を保証する要請と削除不可能性とが二律背反するジレンマを解決することができる。
[変形例]The user terminal of a private chain 2 received in S208 a response from the user terminal 16 of the public chain 4, the S209, D _KB (D _KA (encrypted personal information)) = plaintext or, D _KB (D _R (Encryption Personal information)) ≠ Processing to calculate plaintext is performed. Specifically, when _DKA (encrypted personal information) is received, _DKB ( _DKA (encrypted personal information)) = _DKB ( _DKA ( _EKA ( _EKB (personal information)))) ) = Compute the plaintext to obtain the plaintext personal information. On the other hand, (D _R (encrypted personal information)) when receiving the, _D _KB (D _R (encrypted personal _{_{information)) = D KB (D R}} (E KA (E KB ( personal information)))) ≠ Plaintext will be calculated, and personal information in plaintext cannot be obtained. As a result, it is possible to solve the dilemma in which the request for guaranteeing the right to delete the information to be deleted and the impossibility of deletion are in conflict with each other.
[Modification example]

（１）前述の説明では、暗号化個人情報であるＥ_ＫＡ(Ｅ_ＫＢ(個人情報）をブロックチェーンに直接記録しており、この大量の暗号化個人情報を各ノード（パブリックチェーン４では全てのノード）に記憶させていたのでは、各ノード（ユーザ端末）が膨大な記憶容量を要求されるという不都合が生じる。これを解決する手段として、分割したデータを複数のコンピュータで保存する秘密分散技術を応用する。データを分割して断片化し、各断片化データを複数のノードに分散保存する。また各ノードに保存されるデータを冗長化して（重複して）記憶させる。十分な冗長性を持たせることにより、断片データの一部が失われても復元に支障はなく、かつ、ブロックチェーンとしての改竄の困難性も担保できる。さらに、断片化データの記憶を担う記憶量を各ノードの意思で決めるように制御し、担う記憶量に応じた対価をトークン等の形で各ノードに付与するように制御してもよい。(1) In the above explanation, _EKA ( _EKB (personal information)), which is encrypted personal information, is directly recorded in the blockchain, and this large amount of encrypted personal information is recorded in each node (all in public chain 4). If the information is stored in the node), there is an inconvenience that each node (user terminal) is required to have a huge storage capacity. As a means to solve this, a secret sharing technology for storing the divided data on a plurality of computers. The data is divided and fragmented, and each fragmented data is distributed and stored in a plurality of nodes. The data stored in each node is redundantly (duplicated) stored. Sufficient redundancy is provided. By having it, even if a part of the fragmented data is lost, there is no problem in restoration, and the difficulty of falsification as a blockchain can be guaranteed. Furthermore, the storage amount for storing the fragmented data is stored in each node. It may be controlled so as to be decided by intention, and the consideration may be given to each node in the form of a token or the like according to the amount of storage to be carried.

（２）前述の説明では、暗号化個人情報であるＥ_ＫＡ(Ｅ_ＫＢ(個人情報）をブロックチェーンに直接記録していたが、図１９〜図２３に示す変形例では、暗号化個人情報Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報）)を認定事業者１７の個人情報ＤＢ２９に記憶し、その暗号化個人情報のハッシュ値をブロックチェーンに記録するようにしている。図１９（Ａ）に示す取引Ｉには、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））のハッシュ値＋Ｅ_Ｋ２(インデックス＋２.４トークンで提供）＋暗号文識別子と電子署名とが記録されている。また、図１９（Ｂ）に示す認定事業者１７の個人情報ＤＢ２９には、暗号文識別子に対応付けて暗号化個人情報Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報）)が記憶されている。(2) In the above description, the encrypted personal information E _KA (E _KB (personal information)) was directly recorded on the blockchain, but in the modified examples shown in FIGS. 19 to 23, the encrypted personal information E _KA (E _KB (personal information)) is stored in the personal information DB 29 of the certified business operator 17, and the hash value of the encrypted personal information is recorded in the blockchain. Transaction I shown in FIG. 19 (A). The hash value of E _KA (E _KB (personal information)) + E _K2 (provided by index + 2.4 tokens) + cryptographic identifier and electronic signature are recorded in FIG. 19 (B). personal information DB29 of certified operators 17, encrypted personal information E _KA in association with the ciphertext identifier _{(E KB} (personal information)) is stored.

図２０に基づいて、この変形例でのプライベートチェーン２のノード１９を構成しているユーザ端末とパブリックチェーン４のノード１９を構成しているユーザ端末１６と認定事業者１７のサーバ１８とのメインルーチンのフローチャートを説明する。認定事業者１７のサーバ１８は、ノード１９としてブロックチェーンに参加している。Ｓ２１５によりブロックチェーンへの個人情報記録処理が実行され、Ｓ２１６により記録解読不能処理が実行され、Ｓ２１７により個人情報提供処理が実行され、Ｓ２２０によりハッシュ値記録処理が実行され、Ｓ２２１により暗号文送信処理が実行され、Ｓ２２４により個人情報入手処理が実行される。 Based on FIG. 20, the main of the user terminal constituting the node 19 of the private chain 2 in this modification, the user terminal 16 constituting the node 19 of the public chain 4, and the server 18 of the authorized business operator 17. The flow chart of the routine will be described. The server 18 of the certified business operator 17 participates in the blockchain as a node 19. S215 executes personal information recording processing on the blockchain, S216 executes record unbreakable processing, S217 executes personal information providing processing, S220 executes hash value recording processing, and S221 executes ciphertext transmission processing. Is executed, and the personal information acquisition process is executed by S224.

ブロックチェーンへの個人情報記録処理は、パブリックチェーン４のノード１９を構成しているユーザ端末１６が認定事業者１７のサーバ１８に対し、暗号化個人情報Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報）)を送信する処理である。ハッシュ値記録処理は、暗号化個人情報Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報）)等を受信した認定事業者１７のサーバ１８がそれを記憶すると共にそのハッシュ値を生成してブロックチェーンに記録する処理である。記録解読不能処理は、解読不能にしたい暗号化個人情報Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報）)等の暗号文を解読不能にするための処理である。個人情報提供処理は、パブリックチェーン４のノード１９を構成しているユーザ端末１６がプライベートチェーン２のノード１９を構成しているユーザ端末へ個人情報を提供するために実行する処理である。個人情報入手処理は、プライベートチェーン２のノード１９を構成しているユーザ端末が個人情報を入手する処理である。暗号文送信処理は、認定事業者１７のサーバ１８がプライベートチェーン２のノード１９を構成しているユーザ端末に対し暗号化個人情報Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報）)等の暗号文を送信する処理である。In the personal information recording process on the blockchain, the user terminal 16 constituting the node 19 of the public chain 4 sends the encrypted personal information _EKA ( _EKB (personal information)) to the server 18 of the authorized business operator 17. This is the process of sending. The hash value recording process is a process in which _{the server 18 of the authorized business operator 17 that receives the encrypted personal information EKA} ( _EKB (personal information)) or the like stores it and generates the hash value and records it in the blockchain. Is. The record unbreakable process is a process for making a ciphertext such as encrypted personal information _EKA ( _{EKB (personal information)) that is desired to be unbreakable unbreakable.} The personal information providing process is a process executed by the user terminal 16 constituting the node 19 of the public chain 4 to provide personal information to the user terminal constituting the node 19 of the private chain 2. The personal information acquisition process is a process in which a user terminal constituting the node 19 of the private chain 2 acquires personal information. In the ciphertext transmission process, the server 18 of the authorized business operator 17 transmits a ciphertext such as encrypted personal information _EKA ( _{EKB (personal information)) to the user terminals constituting the node 19 of the private chain 2.} It is a process.

以下に、各処理の詳細を各サブルーチンプログラムのフローチャートに基づいて説明するが、第２実施形態との相違点について主に説明する。 The details of each process will be described below based on the flowchart of each subroutine program, but the differences from the second embodiment will be mainly described.

図２１に基づいて、ブロックチェーンへの個人情報記録処理とハッシュ値記録処理とのサブルーチンプログラムのフローチャートを説明する。Ｓ２３１により、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））とＥ_Ｋ２(インデックス＋個人情報提供の対価）とが認定事業者１７のサーバ１８へ送信される。それをＳ２４０で受信したサーバ１８では、Ｓ２４１により、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））のハッシュ値と暗号文識別子とを生成する処理が行われる。次に、Ｓ２４２により、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））のハッシュ値とＥ_Ｋ２(インデックス＋個人情報提供の対価）と暗号文識別子とをブロックチェーンに記録する処理が行われる。A flowchart of a subroutine program of personal information recording processing and hash value recording processing on the blockchain will be described with reference to FIG. By S231, _EKA ( _EKB (personal information)) and _EK2 (index + consideration for providing personal information) are transmitted to the server 18 of the certified business operator 17. The server 18 receives it in S240, the _S241, the process of generating the hash value and the ciphertext identifier _{_E KA (E} _KB (personal information)) is performed. Next, S242 _{performs a process of recording the hash value of EKA} ( _EKB (personal information)), _EK2 (index + consideration for providing personal information), and the ciphertext identifier in the blockchain.

次にＳ２４３により、暗号文識別子をパブリックチェーン４のユーザ端末１６へ送信する処理が行われる。それをＳ２３２で受信したパブリックチェーン４のユーザ端末１６では、Ｓ２３３により、片割れ鍵ＫＡ及びＫＢと受信した暗号文識別子とを対応付けてＨＤＤ１２に記憶する処理が行われる。認定事業者１７のサーバ１８では、Ｓ２４４により、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））と暗号文識別子とを対応付けて個人情報ＤＢ２９に記憶する処理が行われる。Next, S243 performs a process of transmitting the ciphertext identifier to the user terminal 16 of the public chain 4. In the user terminal 16 of the public chain 4 that receives it in S232, the processing of associating the half-split key KA and KB with the received ciphertext identifier and storing them in the HDD 12 is performed by S233. In the server 18 approved vendors 17, the _S244, the processing of storing the personal information DB29 in association _{_E KA (E} _KB (personal information)) and the ciphertext identifier is performed.

図２２に示す記録解読不能処理は、第２実施形態の図１７（Ｂ）で既に説明したものおと同じであるため、説明の繰り返しを省略する。 Since the record unbreakable process shown in FIG. 22 is the same as that already described in FIG. 17 (B) of the second embodiment, the repetition of the description will be omitted.

次に、図２３に基づいて、個人情報提供処理と個人情報入手処理と暗号文送信処理とのサブルーチンプログラムのフローチャートを説明する。この変形例では、プライベートチェーン２のユーザ端末は、個人情報の提供に合意する署名と暗号文識別子に対応する片割れ共通鍵ＫＢとをパブリックチェーン４のユーザ端末１６から受信すると（Ｓ２６４）、Ｓ２６５により、受信した署名と入手希望個人情報の暗号文識別子とを認定事業者１７のサーバ１８へ送信する。それをＳ２６６により受信した認定事業者１７のサーバ１８では、Ｓ２６７により、署名を確認した上で暗号文識別子に対応する暗号文を個人情報ＤＢ２９から検索してパブリックチェーン４のユーザ端末１６へ送信する処理が行われる。 Next, a flowchart of the subroutine program of the personal information providing process, the personal information acquisition process, and the ciphertext transmission process will be described with reference to FIG. 23. In this modification, when the user terminal of the private chain 2 receives the half-split common key KB corresponding to the signature that agrees to provide the personal information and the ciphertext identifier from the user terminal 16 of the public chain 4 (S264), S265 , The received signature and the ciphertext identifier of the personal information desired to be obtained are transmitted to the server 18 of the authorized business operator 17. The server 18 of the authorized business operator 17 that received it in S266 searches the personal information DB 29 for the ciphertext corresponding to the ciphertext identifier after confirming the signature by S267 and transmits it to the user terminal 16 of the public chain 4. Processing is done.

それをＳ２６８で受信したパブリックチェーン４のユーザ端末１６では、Ｓ２６９により、Ｄ_KＡ(暗号化個人情報）またはＤ_R(暗号化個人情報））を演算してプライベートチェーン２のユーザ端末へ返信する処理が行われる。具体的には、ユーザ端末１６のＨＤＤ１２に記憶されている片割れ共通鍵ＫＡが既にＲに更新されている場合にはＤ_R(暗号化個人情報）を演算して返信されるが、未だＲに更新されていない場合にはＤ_KＡ(暗号化個人情報）を演算して返信する処理が行われる。In the user terminal 16 of the public chain 4 which received it at S268, the S269, and returns by calculating D _{KA (encrypted} personal information) or D _R (encrypted personal information)) to the user terminal of a private chain of processing Is done. More specifically, although when the halves common key KA stored in the HDD12 of the user terminal 16 has already been updated to R are returned by calculating the D _R (encrypted personal information), yet the R If it has not been updated, _DKA (encrypted personal information) is calculated and returned.

パブリックチェーン４のユーザ端末１６からの返信をＳ２７０で受信したプライベートチェーン２のユーザ端末では、Ｓ２７１により、Ｄ_KＢ(Ｄ_KＡ(暗号化個人情報））＝平文または、Ｄ_KＢ(Ｄ_R(暗号化個人情報））≠平文を演算する処理が行われる。具体的には、Ｄ_KＡ(暗号化個人情報）を受信した場合には、Ｄ_KＢ(Ｄ_KＡ(暗号化個人情報））＝Ｄ_KＢ(Ｄ_KＡ(Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））））＝平文を演算して平文の個人情報を得る。一方、(Ｄ_R(暗号化個人情報））を受信した場合には、Ｄ_KＢ(Ｄ_Ｒ(暗号化個人情報））＝Ｄ_KＢ(Ｄ_Ｒ(Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））））≠平文が演算されることになり、平文の個人情報を得ることができない。これにより、削除したい情報の削除権を保証する要請と削除不可能性とが二律背反するジレンマを解決することができる。The user terminal of a private chain 2 received in S270 a response from the user terminal 16 of the public chain 4, the S271, D _KB (D _KA (encrypted personal information)) = plaintext or, D _KB (D _R (Encryption Personal information)) ≠ Processing to calculate plaintext is performed. Specifically, when _DKA (encrypted personal information) is received, _DKB ( _DKA (encrypted personal information)) = _DKB ( _DKA ( _EKA ( _EKB (personal information)))) ) = Compute the plaintext to obtain the plaintext personal information. On the other hand, (D _R (encrypted personal information)) when receiving the, _D _KB (D _R (encrypted personal _{_{information)) = D KB (D R}} (E KA (E KB ( personal information)))) ≠ Plaintext will be calculated, and personal information in plaintext cannot be obtained. As a result, it is possible to solve the dilemma in which the request for guaranteeing the right to delete the information to be deleted and the impossibility of deletion are in conflict with each other.

なお、認定事業者１７のサーバ１８は、ノード１９としてブロックチェーンに参加することなくインターネット１を介してプライベートチェーン２のユーザ端末及びパブックチェーン４のユーザ端末に接続されるものであってもよい。 The server 18 of the certified business operator 17 may be connected to the user terminal of the private chain 2 and the user terminal of the pabook chain 4 via the Internet 1 without participating in the blockchain as the node 19. ..

（３）前述の説明では片割れ共通鍵ＫＡを個人情報主が保持（ユーザ端末１６のＨＤＤ１２に記憶）していたが、その代わりに、所定機関（第三者機関）の一例の鍵登録センタ３０の鍵ＤＢ３２に片割れ共通鍵ＫＡを登録しておくようにしてもよい。なお、片割れ共通鍵ＫＡは秘匿状態で鍵ＤＢ３２に記憶しておく。この変形例を図２４〜図２７に基づいて説明する。 (3) In the above explanation, the personal information owner holds the half-split common key KA (stored in the HDD 12 of the user terminal 16), but instead, the key registration center 30 is an example of a predetermined institution (third-party institution). The one-sided common key KA may be registered in the key DB 32 of the above. The one-sided common key KA is stored in the key DB 32 in a secret state. This modification will be described with reference to FIGS. 24 to 27.

図２４を参照し、鍵登録センタ３０のサーバ３１がインターネット１に接続されている。そのサーバ３１に接続されている鍵ＤＢ３２には、パブリックチェーン４の各ノード１９であるユーザのアドレス毎に、暗号文識別子と片割れ共通鍵ＫＡとが対応付けられて記憶されている。そして、ユーザから記録解読不能化要求があれば、その要求のあった記録に相当する暗号文識別子に対応付けて記憶されている片割れ共通鍵ＫＡを乱数Ｒに更新する。図２４では、アドレス0x6079ddの暗号文識別子307cd4に対応付けて記憶されている片割れ共通鍵が乱数１Ｒ２に更新されており、アドレス0x6080ddの暗号文識別子4arb56に対応付けて記憶されている片割れ共通鍵が乱数２Ｒnに更新されており、アドレス0x6978ddの暗号文識別子e2c87rに対応付けて記憶されている片割れ共通鍵が乱数mＲ１に更新されている。 With reference to FIG. 24, the server 31 of the key registration center 30 is connected to the Internet 1. In the key DB 32 connected to the server 31, a ciphertext identifier and a half-split common key KA are stored in association with each user's address, which is each node 19 of the public chain 4. Then, when the user requests to make the record unbreakable, the half-split common key KA stored in association with the ciphertext identifier corresponding to the requested record is updated to the random number R. In FIG. 24, the one-sided common key stored in association with the ciphertext identifier 307cd4 at address 0x6079dd is updated to the random number 1R2, and the one-sided common key stored in association with the ciphertext identifier 4arb56 at address 0x6080dd is The one-sided common key that has been updated to the random number 2Rn and stored in association with the ciphertext identifier e2c87r at the address 0x6978dd has been updated to the random number mR1.

次に図２５に基づいて、パブリックチェーン４のユーザ端末１６と鍵登録センタ３０のサーバ３１とプライベートチェーン２のユーザ端末とのメインルーチンのフローチャートを説明する。第２実施形態との共通点については説明の繰り返しを省略し主に相違点について説明する。 Next, a flowchart of the main routine of the user terminal 16 of the public chain 4, the server 31 of the key registration center 30, and the user terminal of the private chain 2 will be described with reference to FIG. 25. Regarding the common points with the second embodiment, the repetition of the description will be omitted, and the differences will be mainly described.

パブリックチェーン４のユーザ端末１６では、Ｓ４６８によりブロックチェーンへの個人情報記録処理が実行され、Ｓ４６９により記録解読不能化要求処理が実行され、Ｓ４７０により復号鍵提供処理が実行される。鍵登録センタ３０のサーバ３１では、Ｓ４６３により鍵登録処理が実行され、Ｓ４６４により記録解読不能処理が実行され、Ｓ４６５によりデータ復号処理が実行される。プライベートチェーン２のユーザ端末では、Ｓ４６０によりデータ入手処理が実行さる。 In the user terminal 16 of the public chain 4, S468 executes the personal information recording process on the blockchain, S469 executes the record decryption request process, and S470 executes the decryption key providing process. In the server 31 of the key registration center 30, the key registration process is executed by S463, the record unbreakable process is executed by S464, and the data decryption process is executed by S465. At the user terminal of the private chain 2, the data acquisition process is executed by S460.

次に図２６（Ａ）に基づいて、ブロックチェーンへの個人情報記録処理と鍵登録処理とのサブルーチンプログラムのフローチャートを説明する。パブリックチェーン４のユーザ端末１６において、Ｓ４７９により、Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））とＥ_Ｋ２(インデックス＋個人情報提供の対価）と暗号文識別子とがブロックチェーンに記録され、Ｓ４８０により、片割れ共通鍵ＫＡと暗号文識別子とを鍵登録センタ３０へ送信する処理が行われる。Next, a flowchart of the subroutine program of the personal information recording process and the key registration process on the blockchain will be described with reference to FIG. 26 (A). In the user terminal 16 of the public chain 4, the _{S479, E KA} and _{(E KB} (personal information)) _{E K2} (the consideration for providing index + identity) and the ciphertext identifier is recorded in a block chain, the S480, halves A process of transmitting the common key KA and the ciphertext identifier to the key registration center 30 is performed.

それをＳ４７４で受信した鍵登録センタ３０のサーバ３１では、Ｓ４７５により、受信した片割れ共通鍵ＫＡと暗号文識別子とを対応付けて鍵ＤＢ３２に記憶する処理が行われる。 In the server 31 of the key registration center 30 that has received it in S474, a process of associating the received symmetric common key KA with the ciphertext identifier and storing it in the key DB 32 is performed by S475.

次に、図２６（Ｂ）に基づいて、記録解読不能化要求処理と記録解読不能処理とのサブルーチンプログラムのフローチャートを説明する。パブリックチェーン４のユーザ端末１６において、Ｓ４９４により、解読不能にしたい暗号文があるか否か判定され、ない場合にはリターンするが、ある場合には、Ｓ４９５により、解読不能を要求する暗号文識別子を鍵登録センタ３０のサーバ３１へ送信する処理が行われる。 Next, a flowchart of the subroutine program of the record unbreakable request process and the record unbreakable process will be described with reference to FIG. 26 (B). In the user terminal 16 of the public chain 4, S494 determines whether or not there is a ciphertext to be unbreakable, and if not, returns, but in some cases, S495 requests a ciphertext identifier that requests unbreakable. Is transmitted to the server 31 of the key registration center 30.

それをＳ４８５で受信した鍵登録センタ３０のサーバ３１では、受信した暗号文識別子に対応付けて記憶されている片割れ共通鍵ＫＡを鍵ＤＢ３２から検索する処理が行われる。次にＳ４８７により乱数Ｒを生成し、Ｓ４８８により、その乱数Ｒ＝ＫＡであるか否か判定される。Ｒ＝ＫＡの場合にはＳ４８７により再度乱数Ｒを生成し直し、Ｒ≠ＫＡとなった段階でＳ４８９により、片割れ共通鍵ＫＡをＲに更新する処理が行われる。 The server 31 of the key registration center 30 that has received it in S485 performs a process of searching the key DB 32 for the half-split common key KA stored in association with the received ciphertext identifier. Next, a random number R is generated by S487, and it is determined by S488 whether or not the random number R = KA. When R = KA, the random number R is regenerated by S487, and when R ≠ KA, S489 performs a process of updating the half-split common key KA to R.

次に図２７により、片割れ共通鍵提供処理とデータ入手処理とデータ復号処理とのサブルーチンプログラムのフローチャートを説明する。プライベートチェーン２のユーザ端末において、Ｓ５０３により、入手希望データの暗号文識別子を鍵登録センタ３０のサーバ３１へ送信する処理が行われる。それをＳ５０４により受信した鍵登録センタ３０のサーバ３１では、Ｓ５０５により、暗号文識別子に対応する暗号文（暗号化個人情報等）をブロックチェーンから検索する処理が行われる。次にＳ５０６により、暗号文識別子に対応する片割れ共通鍵ＫＡまたはＲを検索する処理が行われる。 Next, with reference to FIG. 27, a flowchart of a subroutine program of the half-split common key provision process, the data acquisition process, and the data decoding process will be described. In the user terminal of the private chain 2, S503 performs a process of transmitting the ciphertext identifier of the data desired to be obtained to the server 31 of the key registration center 30. The server 31 of the key registration center 30 that has received it in S504 performs a process in S505 to search the blockchain for the ciphertext (encrypted personal information, etc.) corresponding to the ciphertext identifier. Next, S506 performs a process of searching for the half-split common key KA or R corresponding to the ciphertext identifier.

次にＳ５０７により、Ｄ_KＡ(暗号化個人情報）またはＤ_R(暗号化個人情報）をプライベートチェーン２のユーザ端末に返信する処理が行われる。具体的には、鍵登録センタ３０の鍵ＤＢ３２に記憶されている片割れ共通鍵ＫＡが既にＲに更新されている場合にはＤ_R(暗号化個人情報）を演算して返信されるが、未だＲに更新されていない場合にはＤ_KＡ(暗号化個人情報）を演算して返信する処理が行われる。The next S507, processing of transmitting D _{KA (encrypted} personal information) or D _R (encrypted personal information) to the user terminal of a private chain 2 is performed. More specifically, although in the case where counterpart common key KA stored in the key DB32 key registration center 30 has already been updated to R are returned by calculating the D _R (encrypted personal information), yet If it has not been updated to R, _{a process of calculating DKA} (encrypted personal information) and returning it is performed.

それをＳ５０９で受信したプライベートチェーン２のユーザ端末では、Ｓ５１０により、Ｄ_KA(Ｄ_KB(暗号化個人情報））＝平文または、Ｄ_KA(Ｄ_R(暗号化個人情報））≠平文を演算する処理が行われる。具体的には、Ｄ_KＡ(暗号化個人情報）を受信した場合には、Ｄ_KＢ(Ｄ_KＡ(暗号化個人情報））＝Ｄ_KＢ(Ｄ_KＡ(Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））））＝平文を演算して平文の個人情報を得る。一方、(Ｄ_R(暗号化個人情報））を受信した場合には、Ｄ_KＢ(Ｄ_Ｒ(暗号化個人情報））＝Ｄ_KＢ(Ｄ_Ｒ(Ｅ_ＫＡ(Ｅ_ＫＢ(個人情報））））≠平文が演算されることになり、平文の個人情報を得ることができない。これにより、削除したい情報の削除権を保証する要請と削除不可能性とが二律背反するジレンマを解決することができる。しかも、片割れ共通鍵ＫＡをＲに更新する処理が鍵登録センタ３０において行われるため、ＫＡをＲに更新して削除権が保証されたことの信頼性を担保しやすい。例えば、ＫＡのＲへの更新を所定機関による監査の元で行ないやすいという利点がある。The user terminal of a private chain 2 which has received it at S509, the S510, D _KA (D _KB (encrypted personal information)) = plaintext or, D _KA (D _R (encrypted personal information)) is calculated ≠ plaintext Processing is done. Specifically, when _DKA (encrypted personal information) is received, _DKB ( _DKA (encrypted personal information)) = _DKB ( _DKA ( _EKA ( _EKB (personal information)))) ) = Compute the plaintext to obtain the plaintext personal information. On the other hand, (D _R (encrypted personal information)) when receiving the, _D _KB (D _R (encrypted personal _{_{information)) = D KB (D R}} (E KA (E KB ( personal information)))) ≠ Plaintext will be calculated, and personal information in plaintext cannot be obtained. As a result, it is possible to solve the dilemma in which the request for guaranteeing the right to delete the information to be deleted and the impossibility of deletion are in conflict with each other. Moreover, since the process of updating the half-split common key KA to R is performed at the key registration center 30, it is easy to ensure the reliability that the deletion right is guaranteed by updating KA to R. For example, there is an advantage that it is easy to update KA to R under an audit by a predetermined organization.

（４）削除したい情報の削除権を保証する要請と削除不可能性とが二律背反するジレンマを解決する他の方法として、認定事業者１７の個人情報ＤＢ２９に記憶されている暗号化個人情報であるＥ_ＫＡ(Ｅ_ＫＢ(個人情報）)を個人情報主の要請に応じて削除するようにしてもよい。その場合は、ブロックチェーン上に個人情報のハッシュ値が記録されているにもかかわらずそれに対応する個人情報が個人情報ＤＢ２９に記憶されていないという矛盾した状態が生じるが、この矛盾を許容することができるのであれば、個人情報の削除も有効な手段となる。(4) As another method for solving the dilemma in which the request for guaranteeing the right to delete the information to be deleted and the impossibility of deletion are in conflict with each other, the encrypted personal information stored in the personal information DB 29 of the certified business operator 17 is used. E _KA (E _KB (personal information)) may be deleted at the request of the personal information owner. In that case, although a hash value of personal information is recorded on the blockchain, a contradictory state occurs in which the corresponding personal information is not stored in the personal information DB 29, but this inconsistency should be tolerated. If possible, deleting personal information is also an effective means.

（５）削除権を保証する情報は、個人情報に限らず、例えば、ＳＮＳやブログへの投稿情報（投稿写真及び投稿動画のデータを含む）、遺言や任意後見契約などの公正証書、私文書や会社等の定款、その他確定日付が必要なもの等、どのような情報であってもよい。また、第２実施形態では削除権を保証する情報をブロックチェーンを利用して記録していたが、ブロックチェーンは一例に過ぎず、他のものを利用して記録してもよい。 (5) Information that guarantees the right to delete is not limited to personal information, for example, information posted on SNS and blogs (including data of posted photos and posted videos), notarial acts such as wills and voluntary guardianship contracts, and private documents. Any information may be used, such as the articles of incorporation of the company, the articles of incorporation, and other information that requires a fixed date. Further, in the second embodiment, the information guaranteeing the deletion right is recorded using the blockchain, but the blockchain is only an example, and other information may be used for recording.

（６）各ブロックチェーンのノード１９を構成するユーザ端末１６等及び各種サーバで動作する前述したプログラムは、所定のウェブサイト等からダウンロードしてインストールしてもよいが、例えばＣＤ−ＲＯＭ９９等の記録媒体（非一時的（non-transitory）な記録媒体）に記録させて流通させ、そのＣＤ−ＲＯＭ９９等を購入した者がプログラムをユーザ端末１６及び各種サーバにインストールしてもよい（図６０参照）。 (6) The above-mentioned programs running on the user terminals 16 and the like constituting the node 19 of each blockchain and various servers may be downloaded and installed from a predetermined website or the like, but for example, recording of a CD-ROM 99 or the like. A person who purchases the CD-ROM99 or the like by recording it on a medium (non-transitory recording medium) and distributing it may install the program on the user terminal 16 and various servers (see FIG. 60). ..

（７）前述の説明では、片割れ共通鍵ＫＡで１度暗号化したものを片割れ共通鍵ＫＢで再度暗号化するという２度の暗号化を行い、また、片割れ共通鍵ＫＢで１度復号したものを片割れ共通鍵ＫＡで再度復号するという２度の復号により平文にしている。しかし、これに限定されるものだはなく、片割れ共通鍵ＫＡまたはＫＢでの暗号化を複数回行い、片割れ共通鍵ＫＡまたはＫＢでの復号を複数回行うものであってもよい。さらに、片割れ共通鍵ＫＡ及びＫＢは２つに限定されるものではなく、３つ以上の片割れ共通鍵を用いてもよい。 (7) In the above description, the one encrypted once with the one-sided common key KA is encrypted twice with the one-sided common key KB and then decrypted once with the one-sided common key KB. Is decrypted again with the one-sided common key KA to make it plain text. However, the present invention is not limited to this, and the encryption may be performed with the half-split common key KA or KB a plurality of times, and the decryption may be performed a plurality of times with the one-sided common key KA or KB. Further, the half-split common key KA and KB are not limited to two, and three or more half-split common keys may be used.

さらには、片割れ共通鍵ＫＡ及びＫＢの排他的論理和（イクスクルーシブオア）を演算して１つの鍵Ｋを生成し（ＫＡ(+)ＫＢ＝Ｋ）、その鍵Ｋで個人情報を暗号化し（Ｅ_Ｋ(個人情報））、片割れ共通鍵ＫＡを鍵登録センタ３０の鍵ＤＢ３２に登録すると共に、個人情報の要求者に片割れ共通鍵ＫＢを配布する。個人情報の要求者からの要求を受けた個人情報主は暗号化個人情報を鍵登録センタ３０のサーバ３１へ送信し、個人情報の要求者は配布された片割れ共通鍵ＫＢを鍵登録センタ３０のサーバ３１へ送信する。鍵登録センタ３０のサーバ３１では、受信した片割れ共通鍵ＫＢと鍵ＤＢ３２に登録されている片割れ共通鍵ＫＡとの排他的論理和（イクスクルーシブオア）を演算して１つの鍵Ｋを生成し（ＫＡ(+)ＫＢ＝Ｋ）、受信した暗号化個人情報（Ｅ_Ｋ(個人情報））をその鍵Ｋで復号して平文にした上で（Ｄ_Ｋ(Ｅ_Ｋ(個人情報））＝平文)、その平文個人情報を個人情報の要求者へ送信するようにしてもよい。上記(+)は排他的論理和（イクスクルーシブオア）を記号で表現したものである。Furthermore, the exclusive logical sum (exclusive or) of the one-sided common key KA and KB is calculated to generate one key K (KA (+) KB = K), and the personal information is encrypted with the key K. (E K _(personal information)), and registers the key DB32 halves common key KA key registration center 30 distributes the halves common key KB to the requester of the personal information. Upon receiving the request from the requester of personal information, the personal information owner sends the encrypted personal information to the server 31 of the key registration center 30, and the requester of personal information sends the distributed half-split common key KB to the key registration center 30. Send to server 31. The server 31 of the key registration center 30 calculates an exclusive logical sum (exclusive or) between the received one-sided common key KB and the one-sided common key KA registered in the key DB 32 to generate one key K. (KA (+) KB = K ), received encrypted personal information _{(E K} (personal information)) to decrypt with the key K on which the plaintext _{(D K} _{(E K} (personal information)) = plaintext ), The plaintext personal information may be sent to the requester of the personal information. The above (+) is a symbolic representation of the exclusive OR.

なお、排他的論理和（イクスクルーシブオア）は一例に過ぎず、片割れ共通鍵ＫＡ及びＫＢから１つの鍵Ｋを生成するものであればどのようなアルゴリズムを用いてもよい。 The exclusive OR is only an example, and any algorithm may be used as long as it generates one key K from the half-split common key KA and KB.

また、排他的論理和（イクスクルーシブオア）のような加法群を用いて鍵Ｋを生成する上記方式の場合には、片割れ共通鍵ＫＡ及びＫＢを定期的に更新してセキュリティを維持できる利点がある。例えば、一方の片割れ共通鍵ＫＡをＫＣに更新した場合には、他方の片割れ共通鍵ＫＢ＝Ｋ(+)ＫＣとなり、演算により求めることができる。このようにして片割れ共通鍵ＫＡ及びＫＢを更新することにより、片割れ共通鍵の漏洩に対抗することができるばかりでなく、１度片割れ共通鍵ＫＢを配布した個人情報要求者が再度ブロックチェーン上の暗号化個人情報を復号できないようにして閲覧を阻止することが可能となる。このような片割れ共通鍵の更新を片割れ共通鍵ＫＢの１度の配布毎に実行することにより、片割れ共通鍵ＫＢの配布を受けた者が他人にその片割れ共通鍵ＫＢを横流ししたとしても、その横流しを受けた者によるブロックチェーン上の暗号化個人情報を復号不能にすることが可能となる。つまり、配布する片割れ共通鍵ＫＢを１度のみ使用可能なワンタイム鍵にすることができる。 Further, in the case of the above method in which the key K is generated by using an additive group such as exclusive OR, there is an advantage that the half-split common keys KA and KB can be periodically updated to maintain security. There is. For example, when one half-split common key KA is updated to KC, the other half-split common key KB = K (+) KC, which can be obtained by calculation. By updating the one-sided common key KA and KB in this way, not only can the leakage of the one-sided common key be countered, but also the personal information requester who once distributed the one-sided common key KB can again be on the blockchain. It is possible to prevent browsing by preventing the encrypted personal information from being decrypted. By executing such an update of the one-sided common key KB for each distribution of the one-sided common key KB, even if a person who receives the distribution of the one-sided common key KB circulates the one-sided common key KB to another person, the one-sided common key KB can be distributed. It is possible to make encrypted personal information on the blockchain unbreakable by a person who has been diverted. That is, the one-sided common key KB to be distributed can be used as a one-time key that can be used only once.

また、共通鍵に限定されるものではなく、ＲＳＡや楕円暗号等の公開鍵暗号方式を用いてもよい。 Further, the key is not limited to the common key, and a public key cryptosystem such as RSA or elliptic curve may be used.

また、上記の鍵の更新を実現するにおいて、以下のような条件を満たす暗号アルゴリズムを採用してもよい。
平文をＭ、その暗号文をＣ、暗号鍵をＫＡ、ＫＢ、ＫＣ及びＫＤと表し、
Ｅ_ＫＡ(Ｅ_ＫＢ(Ｍ））＝Ｅ_ＫＣ(Ｅ_ＫＤ(Ｍ））＝Ｃ
の式が成立するアルゴリズム。Further, in realizing the above-mentioned key update, a cryptographic algorithm satisfying the following conditions may be adopted.
The plaintext is represented by M, the ciphertext is represented by C, and the encryption key is represented by KA, KB, KC and KD.
E _KA (E _KB (M)) = E _KC (E _KD (M)) = C
Algorithm that holds the formula of.

このようなアルゴリズムが共通鍵暗号アルゴリズムの場合には、片割れ共通鍵ＫＡ及びＫＢをＫＣ及びＫＤに更新した場合に、ブロックチェーンに記録されている暗号文Ｃを片割れ共通鍵ＫＣ及びＫＤで復号することにより平文Ｍを得ることができる。一方、公開鍵暗号アルゴリズムの場合には、秘密鍵ＫＡ及びＫＢをＫＣ及びＫＤに更新した場合に、秘密鍵ＫＣ及びＫＤに対応するペアの公開鍵ＰＫＣ及びＰＫＤでブロックチェーンに記録されている暗号文Ｃを復号することにより平文Ｍを得ることができる。
（８）前述の説明では、情報保持者が情報要求者に暗号化個人情報（Ｄ_KＡ（暗号化個人情報）またはＤ_Ｒ（暗号化個人情報））と片割れ共通鍵ＫＢとを送信し（Ｓ２０１、Ｓ２０７）、情報要求者自身が片割れ共通鍵ＫＢを用いて暗号化個人情報を平文にするべく復号していたが（Ｓ２０９）、片割れ共通鍵ＫＢを用いた復号を第三者機関（所定のサービス機関）が担ってもよい。この場合、情報保持者が暗号化個人情報（Ｄ_KＡ（暗号化個人情報）またはＤ_Ｒ（暗号化個人情報））と片割れ共通鍵ＫＢとを第三者機関（所定のサービス機関）へ送信し、第三者機関（所定のサービス機関）で復号して情報要求者へ送信する。
[開示内容の特徴点]When such an algorithm is a common key cryptographic algorithm, when the plaintext common keys KA and KB are updated to KC and KD, the ciphertext C recorded in the blockchain is decrypted by the plaintext common keys KC and KD. As a result, the plaintext M can be obtained. On the other hand, in the case of the public key cryptographic algorithm, when the private keys KA and KB are updated to KC and KD, the ciphers recorded in the blockchain with the public keys PKC and PKD of the pair corresponding to the private keys KC and KD. The plaintext M can be obtained by decoding the sentence C.
(8) In the above description, the encrypted personal information data holders to information requester (D _{KA (encrypted} personal information) or D _{R (encrypted} personal information)) and transmits a counterpart common key KB (S201 , S207), the information requester himself decrypted the encrypted personal information using the one-sided common key KB to make it plain (S209), but the decryption using the one-sided common key KB was performed by a third-party organization (predetermined). Service organization) may be responsible. In this case, information holder sends a counterpart common key KB and encrypted personal information (D _{KA (encrypted} personal information) or D _{R (encrypted} personal information)) to the third party (predetermined service organization) , Decrypt at a third-party organization (prescribed service organization) and send it to the information requester.
[Characteristics of disclosure content]

次に、以上説明した実施形態の開示内容の特徴を以下に列挙する。
（特徴１）
[技術分野]Next, the features of the disclosed contents of the embodiments described above are listed below.
(Feature 1)
[Technical field]

特徴１は、例えば、ブロックチェーン等のような改竄や消去が困難な情報記録方式についての処理システムおよびプログラムに関する。
[背景技術]Feature 1 relates to a processing system and a program for an information recording method that is difficult to falsify or erase, such as a blockchain.
[Background technology]

改竄や消去が困難な情報記録方式としてブロックチェーンが従来から一般的に知られている。このブロックチェーンを利用して貨物輸送に関する各種情報を記録しているものとして、例えば、特開２０１８−１２８７２３号がある。
[特徴１の概要]
[特徴１が解決しようとする課題]Blockchain has been generally known as an information recording method that is difficult to falsify or erase. For example, Japanese Patent Application Laid-Open No. 2018-128723 is used to record various information related to freight transportation using this blockchain.
[Overview of Feature 1]
[Problems that Feature 1 tries to solve]

しかし、このようなブロックチェーンを利用した情報の記録は、改竄が困難なばかりでなく消去も困難である（以下「消去不可能性」という）。その結果、一旦ブロックチェーンを利用した個人情報の記録を行った場合には、その個人情報主が個人情報を消去したくなっても消去できず個人情報消去権（いわゆる忘れられる権利）が損なわれるという欠点がある。 However, recording information using such a blockchain is not only difficult to falsify, but also difficult to erase (hereinafter referred to as "non-erasable"). As a result, once personal information is recorded using the blockchain, even if the personal information owner wants to delete the personal information, it cannot be deleted and the right to erase personal information (so-called right to be forgotten) is impaired. There is a drawback.

特徴１は、かかる実情に鑑み考え出されたものであり、その目的は、記録された情報の真正の保証とその情報の削除権の保証とが二律背反するジレンマを解消することである。
[課題を解決するための手段]Feature 1 was devised in view of such circumstances, and its purpose is to solve the dilemma in which the guarantee of the authenticity of the recorded information and the guarantee of the right to delete the information are antinomy.
[Means to solve problems]

特徴１の主題は、例えば以下のような項目として示される。
（項目１）
記録対象の情報（例えば、個人情報）を暗号化する暗号化処理を行う暗号化手段（例えば、Ｓ１７４、Ｓ１７７、または、Ｓ２２８、Ｓ２３１、または、Ｓ４７８、Ｓ４７９）と、
前記暗号化処理を経た後の情報を記録する記録手段（例えば、Ｓ１７７及びブロックチェーン、または、Ｓ２３１、Ｓ２４０、Ｓ２４２、Ｓ２４４、ブロックチェーン及び個人情報ＤＢ２９、または、Ｓ４７９及びブロックチェーン）と、
前記記録手段により記録された情報に対し、第１鍵と第２鍵とを用いて復号処理を行って平文の情報にする復号手段（例えば、Ｓ２０１、Ｓ２０２、Ｓ２０７〜Ｓ２０９、または、Ｓ２６３〜Ｓ２７１、または、Ｓ５００〜Ｓ５１０）と、
前記記録手段により記録された情報を復号できない復号不能化状態にする復号不能化手段（例えば、Ｓ１９１〜Ｓ１９４、または、Ｓ２５０〜Ｓ２５４、または、Ｓ４９４、Ｓ４９５、Ｓ４８５〜Ｓ４８９）と、を備え、
前記復号手段は、前記第２鍵（例えば、片割れ共通鍵ＫＡ）を秘匿して保持する第２鍵秘匿保持手段（例えば、Ｓ１９４、または、Ｓ２３３、または、Ｓ４７５）を含み、
前記復号不能化手段は、前記第２鍵秘匿保持手段により保持されている前記第２鍵を他のもの（例えば、乱数Ｒ）に更新することにより復号不能化状態にする（例えば、Ｓ１９０〜Ｓ１９４、または、Ｓ２５０〜Ｓ２５４、または、Ｓ４９４、Ｓ４９５、Ｓ４８５〜Ｓ４８９）、処理システム。The subject of feature 1 is shown, for example, as the following items.
(Item 1)
An encryption means (for example, S174, S177, or S228, S231, or S478, S479) that performs an encryption process for encrypting information to be recorded (for example, personal information).
Recording means (for example, S177 and blockchain, or S231, S240, S242, S244, blockchain and personal information DB29, or S479 and blockchain) for recording the information after the encryption process.
Decoding means (for example, S201, S202, S207 to S209, or S263 to S271) that decodes the information recorded by the recording means using the first key and the second key to obtain plain text information. , Or S500 to S510),
The information recorded by the recording means is provided with decodable means (for example, S191 to S194, or S250 to S254, or S494, S495, S485 to S489) that make the information recorded by the recording means undecryptable.
The decryption means includes a second key concealment holding means (for example, S194, or S233, or S475) that conceals and holds the second key (for example, the half-split common key KA).
The undecryptable means puts the second key held by the second key concealment holding means into another undecryptable state (for example, S190 to S194) by updating the second key to another one (for example, a random number R). , Or S250 to S254, or S494, S495, S485 to S489), processing system.

（項目２）
前記復号手段は、前記第１鍵（例えば、片割れ共通鍵ＫＢ）を情報の閲覧希望者に配布する第１鍵配布手段（例えば、Ｓ２００、Ｓ２０１、または、Ｓ２５６２、Ｓ２６３、または、Ｓ５００、Ｓ５０１）をさらに含む、項目１に記載の処理システム。
（項目３）
前記記録手段により記録された情報を平文にすることなく検索する検索手段（例えば、Ｓ３７〜Ｓ４０、Ｓ４２〜Ｓ４５）をさらに備えている、項目１または２に記載の処理システム。(Item 2)
The decryption means distributes the first key (for example, a half-split common key KB) to a person who wants to view information (for example, S200, S201, or S2562, S263, or S500, S501). The processing system according to item 1, further comprising.
(Item 3)
The processing system according to item 1 or 2, further comprising search means (for example, S37 to S40, S42 to S45) for searching information recorded by the recording means without making it plain text.

（項目４）
前記記録手段により記録された情報は個人情報を含み、
前記復号不能化手段は、個人情報主の要求に応じて当該個人情報主の個人情報を前記復号不能化状態にする（例えば、Ｓ１９０〜Ｓ１９４、または、Ｓ２５０〜Ｓ２５４、または、Ｓ４９４、Ｓ４９５、Ｓ４８５〜Ｓ４８９）、項目１〜３の何れかに記載の処理システム。(Item 4)
The information recorded by the recording means includes personal information and includes personal information.
The undecryptable means puts the personal information of the personal information owner into the undecryptable state at the request of the personal information owner (for example, S190 to S194, or S250 to S254, or S494, S495, S485. ~ S489), the processing system according to any one of items 1 to 3.

（項目５）
記録対象の情報（例えば、個人情報）を暗号化する暗号化処理を行うステップ（例えば、Ｓ１７４、Ｓ１７７、または、Ｓ２２８、Ｓ２３１、または、Ｓ４７８、Ｓ４７９）と、
前記暗号化処理を経た後の情報を記録する記録手段（例えば、Ｓ１７７及びブロックチェーン、または、Ｓ２３１、Ｓ２４０、Ｓ２４２、Ｓ２４４、ブロックチェーン及び個人情報ＤＢ２９、または、Ｓ４７９及びブロックチェーン）により記録された情報に対し、第１鍵と第２鍵とを用いて復号処理を行って平文の情報にする復号ステップ（例えば、Ｓ２０１、Ｓ２０２、Ｓ２０７〜Ｓ２０９、または、Ｓ２６３〜Ｓ２７１、または、Ｓ５００〜Ｓ５１０）と、
前記記録手段により記録された情報を復号できない復号不能化状態にするステップ（例えば、Ｓ１９１〜Ｓ１９４、または、Ｓ２５０〜Ｓ２５４、または、Ｓ４９４、Ｓ４９５、Ｓ４８５〜Ｓ４８９）とを、
コンピュータに実行させ、
前記復号ステップは、前記第２鍵（例えば、片割れ共通鍵ＫＡ）を秘匿して保持するステップ（例えば、Ｓ１９４、または、Ｓ２３３、または、Ｓ４７５）を含み、
前記復号不能化状態にするステップは、前記保持するステップにより保持されている前記第２鍵を他のもの（例えば、乱数Ｒ）に更新することにより復号不能化状態にする（例えば、Ｓ１９０〜Ｓ１９４、または、Ｓ２５０〜Ｓ２５４、または、Ｓ４９４、Ｓ４９５、Ｓ４８５〜Ｓ４８９）、プログラム。(Item 5)
A step (for example, S174, S177, or S228, S231, or S478, S479) of performing an encryption process for encrypting information to be recorded (for example, personal information).
It was recorded by a recording means (for example, S177 and blockchain, or S231, S240, S242, S244, blockchain and personal information DB29, or S479 and blockchain) that records the information after the encryption process. A decoding step (for example, S201, S202, S207 to S209, or S263 to S271, or S500 to S510) in which information is decrypted using the first key and the second key to obtain plain text information. When,
The steps (for example, S191 to S194, or S250 to S254, or S494, S495, S485 to S489) that make the information recorded by the recording means undecryptable can be performed.
Let the computer run
The decryption step includes a step (eg, S194, or S233, or S475) of concealing and holding the second key (for example, the half-split common key KA).
The step of making the decryption impossible state makes the second key held by the holding step update to another one (for example, a random number R) to make the decoding impossible state (for example, S190 to S194). , Or S250 to S254, or S494, S495, S485-S489), the program.

（特徴１の効果）
特徴１によれば、記録された情報の真正の保証とその情報の削除権の保証とが二律背反するジレンマを極力解消することができる。
（特徴２）
[技術分野](Effect of feature 1)
According to the feature 1, the dilemma in which the guarantee of the authenticity of the recorded information and the guarantee of the right to delete the information conflict with each other can be solved as much as possible.
(Feature 2)
[Technical field]

特徴２は、例えば、ブロックチェーン等で用いられているスマートコントラクトに関する。
[背景技術]Feature 2 relates to, for example, a smart contract used in a blockchain or the like.
[Background technology]

スマートコントラクトは、契約のスムーズな検証、条件確認、執行、実行、交渉を意図したコンピュータプロトコルであり、従来からブロックチェーン等で用いられている。このスマートコントラクトは、契約や取引等を自動化するものとして従来から知られている（例えば、特許第６４０３１７７号）。
[特徴２の概要]
[特徴２が解決しようとする課題]A smart contract is a computer protocol intended for smooth contract verification, condition confirmation, execution, execution, and negotiation, and has been conventionally used in blockchains and the like. This smart contract has been conventionally known as a device for automating contracts, transactions, etc. (for example, Japanese Patent No. 6403177).
[Overview of Feature 2]
[Problems that Feature 2 tries to solve]

このようなスマートコントラクトの分野においては、売買契約及び貸借契約等で代表される各種契約または各種取引等の法律行為をユーザ自身の代理として実行できる高度なスマートコントラクトが望まれている。 In the field of such smart contracts, advanced smart contracts that can execute legal acts such as various contracts represented by sales contracts and lease contracts or various transactions on behalf of the user are desired.

係る実情に鑑み考え出された特徴２の目的は、ユーザ自身の代理として法律行為を実行できる高度なスマートコントラクトを提供することである。
[課題を解決するための手段]
特徴２の主題は、例えば以下のような項目として示される。
（項目１）
複数の自然人または法人が行った法律行為に関する情報を機械学習用のデータとして入力し一般的モデルを生成する機械学習手段（例えば、Ｓ８０〜Ｓ８２）と、
前記一般的なモデルをユーザに適したモデルにパーソナライズ化するための手段であって、当該ユーザが行った法律行為に関する情報に基づいてパーソナライズ化するパーソナライズ化手段（例えば、Ｓ８６〜Ｓ８８、またはＳ９４〜Ｓ９８）と、
前記パーソナライズ化されたモデルを用いて当該ユーザの代理として法律行為を実行させるためのスマートコントラクトを生成するスマートコントラクト生成手段（例えば、Ｓ８６〜Ｓ８８、またはＳ９４〜Ｓ９８）と、を備えている、コンピュータシステム。The purpose of Feature 2, which was conceived in view of such circumstances, is to provide an advanced smart contract that can execute legal acts on behalf of the user.
[Means to solve problems]
The subject of feature 2 is shown, for example, as the following items.
(Item 1)
Machine learning means (for example, S80 to S82) that input information on legal acts performed by a plurality of natural persons or corporations as data for machine learning to generate a general model, and
A means for personalizing the general model into a model suitable for the user, which is a personalization means (for example, S86 to S88, or S94 to) for personalizing based on information about a legal act performed by the user. S98) and
A computer comprising a smart contract generating means (eg, S86-S88, or S94-S98) that generates a smart contract for performing a legal act on behalf of the user using the personalized model. system.

（項目２）
複数の自然人または法人が行った法律行為に関する情報を機械学習用のデータとして入力して生成された一般的なモデルをユーザに適したモデルにパーソナライズ化するための手段であって、当該ユーザが行った法律行為に関する情報に基づいてパーソナライズ化するパーソナライズ化手段（例えば、Ｓ８０〜Ｓ８２、Ｓ８６〜Ｓ８８、またはＳ９４〜Ｓ９８）と、
前記パーソナライズ化されたモデルを用いて当該ユーザの代理として法律行為を実行させるためのスマートコントラクトを生成するスマートコントラクト生成手段（例えば、Ｓ８６〜Ｓ８８、またはＳ９４〜Ｓ９８）と、を備えている、コンピュータシステム。(Item 2)
It is a means for personalizing a general model generated by inputting information on legal acts performed by multiple natural persons or corporations as data for machine learning into a model suitable for the user, and is performed by the user. Personalization means (eg, S80-S82, S86-S88, or S94-S98) that personalize based on information about legal acts.
A computer comprising a smart contract generating means (eg, S86-S88, or S94-S98) that generates a smart contract for performing a legal act on behalf of the user using the personalized model. system.

（項目３）
複数の自然人または法人が行った法律行為に関する情報を機械学習用のデータとして入力して生成された一般的なモデルをユーザに適したモデルにパーソナライズ化するための手段であって、当該ユーザが行った法律行為に関する情報に基づいてパーソナライズ化するパーソナライズ化手段（例えば、Ｓ８０〜Ｓ８２、Ｓ８６〜Ｓ８８、またはＳ９４〜Ｓ９８）と、
前記パーソナライズ化されたモデルをスマートコントラクトとして用いて当該ユーザの代理として法律行為を実行させるサービスを提供するサービス提供手段（例えば、Ｓ９９）と、を備えている、コンピュータシステム。(Item 3)
It is a means for personalizing a general model generated by inputting information on legal acts performed by multiple natural persons or corporations as data for machine learning into a model suitable for the user, and is performed by the user. Personalization means (eg, S80-S82, S86-S88, or S94-S98) that personalize based on information about legal acts.
A computer system comprising a service providing means (for example, S99) that provides a service of using the personalized model as a smart contract to execute a legal act on behalf of the user.

（項目４）
前記サービス提供手段（例えば、Ｓ９９）によるサービスの提供に伴い実行された法律行為に対する報酬を当該実行したモデルに与えることにより該モデルが前記報酬の累積を最大化する方策を学習する強化学習手段（例えば、Ｓ１０５〜Ｓ１０８）をさらに備えた、項目３に記載のコンピュータシステム。(Item 4)
Reinforcement learning means (for example, S99) for learning a measure for maximizing the accumulation of the reward by giving a reward for the legal act executed by the service providing means (for example, S99) to the executed model. For example, the computer system according to item 3, further comprising S105 to S108).

（項目５）
予め定められたテーマ（例えば、政府が採用しようとしている政策や法律（例えば、消費増税に伴う軽減税率、改正出入国管理法、イギリスのＥＵ（European Union）からの離脱、ベーシックインカムの部分的または全面的採用、日本国憲法９条の改正等）が採用されたと仮定した場合における、株取引や先物取引等の投資市場での取引シミュレーション、会社経営シミュレーション、または消費行動シミュレーション等）のシミュレーションをコンピュータ内で行って強化学習を進行させるコンピュータシステムであって、
前記シミュレーションのテーマにマッチする複数のペルソナに属するユーザ群を選定する選定手段（例えば、Ｓ３４４、Ｓ３４５）と、
前記選定手段により選定されたユーザ群を前記複数のペルソナ毎にグルーピングしてグループ毎にユーザ群が行った法律行為に関する情報を収集する収集手段（例えば、Ｓ３４６）と、
前記取集された法律行為に関する情報を学習データとして機械学習を行ってペルソナ毎に学習済みスマートコントラクトモデル群を生成する生成手段（例えば、Ｓ３４７）と、
前記生成された学習済みスマートコントラクトモデル群同士で法律行為を行うシミュレーションをコンピュータ内で実行するシミュレーション手段（例えば、Ｓ３３６〜Ｓ３３９）と、を備え、
前記シミュレーション手段は、実行された法律行為に対する報酬を当該実行した学習済みスマートコントラクトモデルに与えることにより該学習済みスマートコントラクトモデルが前記報酬の累積を最大化する方策を学習する強化学習手段（例えば、Ｓ３３６、Ｓ３３８）を含む、コンピュータシステム。(Item 5)
Predetermined themes (eg, policies and laws that the government is trying to adopt (eg, reduced tax rates associated with consumption tax increases, revised immigration laws, withdrawal from the UK's European Union), partial or full basic income Simulation of trading in the investment market such as stock trading and futures trading, company management simulation, consumption behavior simulation, etc.) on the assumption that the target adoption, revision of Article 9 of the Japanese Constitution, etc.) has been adopted in the computer. It is a computer system that progresses reinforcement learning by going to
Selection means (for example, S344, S345) for selecting a group of users belonging to a plurality of personas matching the theme of the simulation, and
A collection means (for example, S346) for grouping a group of users selected by the selection means for each of the plurality of personas and collecting information on legal acts performed by the group of users for each group.
A generation means (for example, S347) that performs machine learning using the collected information on legal acts as learning data to generate a trained smart contract model group for each persona.
It is provided with a simulation means (for example, S336 to S339) for executing a simulation of performing a legal act between the generated trained smart contract models in a computer.
The simulation means is an enhanced learning means (eg, for example,) in which the learned smart contract model learns how to maximize the accumulation of the reward by giving a reward for the executed legal act to the executed learned smart contract model. A computer system including S336, S338).

（項目６）
シミュレーションをコンピュータ内で行って強化学習を進行させるコンピュータシステムであって、
機械学習により生成された学習済みスマートコントラクトモデル群同士で法律行為を行うシミュレーションをコンピュータ内で実行し、実行された法律行為に対する報酬を当該実行した学習済みスマートコントラクトモデルに与えることにより該学習済みスマートコントラクトモデルが前記報酬の累積を最大化する方策を学習するシミュレーション強化学習処理を行う強化学習手段（例えば、Ｓ３３６、Ｓ３３８）を備えている、コンピュータシステム。(Item 6)
A computer system that performs simulations in a computer to advance reinforcement learning.
The trained smart is created by executing a simulation of performing legal actions between trained smart contract models generated by machine learning in a computer and giving a reward for the executed legal actions to the executed learned smart contract model. A computer system comprising a strengthening learning means (eg, S336, S338) in which a contract model performs a simulation strengthening learning process that learns measures to maximize the accumulation of rewards.

（項目７）
前記学習済みスマートコントラクトモデル群の中から、前記強化学習手段による強化学習結果の成績に基づいて実際に使用する学習済みスマートコントラクトモデルを選抜する選抜手段（例えば、Ｓ３４０）をさらに含む、項目６に記載のコンピュータシステム。(Item 7)
Item 6 further includes a selection means (for example, S340) for selecting a learned smart contract model to be actually used based on the results of the reinforcement learning result by the reinforcement learning means from the trained smart contract model group. Described computer system.

（注）
上記一般的なモデル生成用の「機械学習用のデータ」及びパーソナライズ化に用いる「機械学習用のデータ」は、「法律行為に関する情報」が含まれていれば事足り、「法律行為に関する情報」以外の情報（例えば、ウェブサイトへのアクセス履歴、ＧＰＳ位置情報等）も含まれていてもよい。上記「スマートコントラクト生成手段」は、例えば、法律行為に関する情報により機械学習されたパーソナルアシスタント等の人工知能にスマートコントラクトとしての役割を担わせる場合も包含するものである。(note)
The above-mentioned "data for machine learning" for model generation and "data for machine learning" used for personalization need only include "information on legal acts", other than "information on legal acts". Information (for example, access history to a website, GPS location information, etc.) may also be included. The above-mentioned "smart contract generating means" includes, for example, a case where artificial intelligence such as a personal assistant who has been machine-learned based on information on legal acts plays a role as a smart contract.

（特徴２の効果）
特徴２によれば、各種の法律行為をユーザ自身の代理として実行できる高度なスマートコントラクトを提供可能となる。(Effect of feature 2)
According to Feature 2, it is possible to provide an advanced smart contract that can execute various legal acts on behalf of the user himself / herself.

（特徴３）
[技術分野]
特徴３は、例えば、政府が採用しようとしている政策や法律（例えば、消費増税に伴う軽減税率、改正出入国管理法、イギリスのＥＵ（European Union）からの離脱、ベーシックインカムの部分的または全面的採用、日本国憲法９条の改正等）、マーケティング関連の条件（例えば、新商品（金融商品や生命保険を含む）や新サービスの価格や対価の設定、各種メディアによるプロモーション効果等）、投資市場関連の条件（例えば、先物取引における気象条件、株式市場における金融引き締め政策等）等の条件を設定し、その条件下でコンピュータ内においてシミュレーションを行い、どのようなシミュレーション結果になるかを事前に予測するコンピュータシステムに関する。
[背景技術](Feature 3)
[Technical field]
Feature 3 is, for example, the policies and laws that the government is trying to adopt (for example, the reduced tax rate due to the consumption tax increase, the revised immigration control law, the withdrawal from the EU (European Union) in the United Kingdom, and the partial or full adoption of basic income. , Amendment of Article 9 of the Japanese Constitution, etc.), Marketing-related conditions (for example, setting prices and consideration for new products (including financial products and life insurance), new service prices, promotion effects by various media, etc.), Investment market-related Conditions (for example, weather conditions in futures trading, monetary tightening policy in the stock market, etc.) are set, and simulations are performed in the computer under those conditions to predict in advance what kind of simulation results will be obtained. Regarding computer systems.
[Background technology]

この種のコンピュータシステムとして、国民が将来負担するべき負債や将来利用可能な資源を明確にして、政策レベルの意思決定を支援するために、純資産の変動計算書勘定を新たに設定し、当該年度の政策決定による資産変動を明確にするとともに、将来の国民の負担をシミュレーションできる会計処理方法が提案されている（例えば、特開２００６−１５５２３３）。
[特徴３の概要]
[特徴３が解決しようとする課題]As a computer system of this kind, a new statement of changes in net assets has been set up to help policy-level decision-making by clarifying future liabilities and resources available to the public in the future, and for the year. An accounting method has been proposed that can clarify the asset fluctuations due to the policy decision-making and simulate the future burden on the people (for example, Japanese Patent Application Laid-Open No. 2006-155233).
[Overview of Feature 3]
[Problems that Feature 3 tries to solve]

このようなシミュレーションの分野においては、実社会における自然人や法人の営みを忠実に模したシミュレーションをコンピュータ内で実行し、現実世界との乖離を極力少なくしたシミュレーション結果を導出し得るコンピュータシステムが望まれる。 In the field of such simulations, a computer system that can faithfully execute simulations that faithfully imitate the activities of natural persons and corporations in the real world and derive simulation results with minimal deviation from the real world is desired.

係る実情に鑑み考え出された特徴３の目的は、実社会における自然人や法人の営みを忠実に模したシミュレーションを可能にすることである。
[課題を解決するための手段]The purpose of Feature 3, which was conceived in view of such circumstances, is to enable simulations that faithfully imitate the activities of natural persons and corporations in the real world.
[Means to solve problems]

特徴３の主題は、例えば以下のような項目として示される。
（項目１）
予め定められた条件（例えば、政府が採用しようとしている政策や法律（例えば、消費増税に伴う軽減税率、改正出入国管理法、イギリスのＥＵ（European Union）からの離脱、ベーシックインカムの部分的または全面的採用、日本国憲法９条の改正等）、マーケティング関連の条件（例えば、新商品（金融商品や生命保険を含む）や新サービスの価格や対価の設定、各種メディアによるプロモーション効果等）、投資市場関連の条件（例えば、先物取引における気象条件、株式市場における金融引き締め政策等）等の条件）の下でのシミュレーションをコンピュータ内で行うコンピュータシステムであって、
前記シミュレーションの条件にマッチする複数のペルソナに属するユーザ群を選定する選定手段（例えば、Ｓ１４４、Ｓ１４５）と、
前記選定手段により選定されたユーザ群を前記複数のペルソナ毎にグルーピングしてグループ毎にユーザ群が行った法律行為に関する情報を収集する収集手段（例えば、Ｓ１４６）と、
前記取集された法律行為に関する情報を学習データとして機械学習を行ってペルソナ毎に学習済みスマートコントラクトモデル群を生成する生成手段（例えば、Ｓ１４７）と、
前記生成された学習済みスマートコントラクトモデル群同士で法律行為を行うシミュレーションをコンピュータ内で実行するシミュレーション手段（例えば、Ｓ１３６〜Ｓ１３９）と、
前記シミュレーション手段によるシミュレーションの結果を導出する導出手段（例えば、Ｓ１４０）と、を備え、
前記シミュレーション手段は、実行された法律行為に対する報酬を当該実行した学習済みスマートコントラクトモデルに与えることにより該学習済みスマートコントラクトモデルが前記報酬の累積を最大化する方策を学習する強化学習手段（例えば、Ｓ１３６、Ｓ１３８）を含む、コンピュータシステム。The subject of feature 3 is shown, for example, as the following items.
(Item 1)
Predetermined conditions (eg, policies and laws that the government is trying to adopt (eg, reduced tax rates associated with consumption tax increases, revised immigration laws, withdrawal from the UK's European Union), partial or full basic income Recruitment, revision of Article 9 of the Japanese Constitution, etc.), marketing-related conditions (for example, setting of prices and consideration for new products (including financial products and life insurance), promotion effects by various media, etc.), investment A computer system that performs simulations under market-related conditions (for example, weather conditions in futures trading, monetary tightening policies in the stock market, etc.) in a computer.
Selection means (for example, S144, S145) for selecting a group of users belonging to a plurality of personas that match the conditions of the simulation, and
A collection means (for example, S146) for grouping a group of users selected by the selection means for each of the plurality of personas and collecting information on legal acts performed by the group of users for each group.
A generation means (for example, S147) that performs machine learning using the collected information on legal acts as learning data to generate a trained smart contract model group for each persona.
A simulation means (for example, S136 to S139) that executes a simulation of performing a legal act between the generated trained smart contract models in a computer, and
A derivation means (for example, S140) for deriving the result of simulation by the simulation means is provided.
The simulation means is an enhanced learning means (eg, for example,) in which the learned smart contract model learns how to maximize the accumulation of the reward by giving a reward for the executed legal act to the executed learned smart contract model. A computer system including S136, S138).

（項目２）
シミュレーションをコンピュータ内で行うコンピュータシステムであって、
機械学習（例えば、Ｓ１４４〜Ｓ１４６）により生成された学習済みスマートコントラクトモデル群同士で法律行為を行うシミュレーションをコンピュータ内で実行し、実行された法律行為に対する報酬を当該実行した学習済みスマートコントラクトモデルに与えることにより該学習済みスマートコントラクトモデルが前記報酬の累積を最大化する方策を学習する強化学習を進行させる強化学習手段（例えば、Ｓ１３６、Ｓ１３８）と、
前記強化学習手段による強化学習が進行した学習済みスマートコントラクトモデル群同士で法律行為を行うシミュレーションの結果を導出する導出手段（例えば、Ｓ１４０）と、を備えている、コンピュータシステム。
（特徴３の効果）(Item 2)
A computer system that performs simulations in a computer
A simulation of performing legal actions between trained smart contract models generated by machine learning (for example, S144 to S146) is executed in a computer, and the reward for the executed legal actions is given to the executed learned smart contract model. By giving, the trained smart contract model learns a strategy for maximizing the accumulation of the rewards.
A computer system including a derivation means (for example, S140) for deriving the result of a simulation of performing a legal act between trained smart contract models in which reinforcement learning by the reinforcement learning means has progressed.
(Effect of feature 3)

特徴３によれば、実社会における自然人や法人の営みを極力忠実に模したシミュレーションが可能になる。
[第３実施形態]According to Feature 3, it is possible to simulate the activities of natural persons and corporations in the real world as faithfully as possible.
[Third Embodiment]

次に、第３実施形態を説明する。この第３実施形態は、リアルワールド（現実世界）のデジタルツインから構成されるミラーワールド（サイバー空間）内をシミュレーション環境にしてシミュレーションを行うことにより、例えば、未来を予測した最適解を導き出したり、ＤＡＯ（Decentralized Autonomous Organization）におけるインセンティブ設計の最適解を導き出したり、ＡＩの機械学習（例えば強化学習）を行ったりするシステムに関する。 Next, the third embodiment will be described. In this third embodiment, for example, an optimum solution that predicts the future can be derived by performing a simulation in a mirror world (cyber space) composed of digital twins in the real world as a simulation environment. It relates to a system for deriving the optimum solution of incentive design in DAO (Decentralized Autonomous Organization) and performing AI machine learning (for example, reinforcement learning).

デジタルツインとは、現実世界の実体やシステムをデジタルで表現したものである。ミラーワールドとは、現実の国家、都市、社会、地方自治体、会社等の組織、人々といった、物理世界（リアルワールド）の情報がすべてデジタル化されたデジタルツインで構成される鏡像世界のことである。具体的には、人のデジタルツインを、例えば、当該人の行動（リアルとバーチャル両方の行動）等のライフログを知識として習得し当該人にとって最適な行為をアシスタントするための機械学習（例えばエージェントによる強化学習）を行ったアシスタントＡＩ（以下「パーソナルＡＩ」という）で構成する。この強化学習は、複数のパーソナルＡＩが協調して強化学習を行うマルチエージェント強化学習である。リアル世界の会社等の組織を構成している人々のパーソナルＡＩにより当該組織のデジタルツインを構成し、リアル世界の地方自治体を構成している人々のパーソナルＡＩにより当該地方自治体のデジタルツインを構成し、リアル世界の都市を構成している人々のパーソナルＡＩにより当該都市のデジタルツインを構成し、リアル世界の国家を構成している人々のパーソナルＡＩにより当該国家のデジタルツインを構成する。 A digital twin is a digital representation of a real-world entity or system. The mirror world is a mirror world composed of digital twins in which all information in the physical world (real world) such as real nations, cities, societies, local governments, organizations such as companies, and people is digitized. .. Specifically, machine learning (for example, an agent) for learning a person's digital twin as knowledge of the life log of the person's behavior (both real and virtual behavior) and assisting the optimum behavior for the person. It is composed of assistant AI (hereinafter referred to as "personal AI") who has performed reinforcement learning by. This reinforcement learning is a multi-agent reinforcement learning in which a plurality of personal AIs cooperate to perform reinforcement learning. The personal AIs of the people who make up the organization of the company in the real world make up the digital twin of the organization, and the personal AIs of the people who make up the local government in the real world make up the digital twin of the local government. , The personal AIs of the people who make up the city in the real world make up the digital twins of the city, and the personal AIs of the people who make up the nations of the real world make up the digital twins of the country.

この第３実施形態では、シミュレーション環境としてのミラーワールドの構築を、リアルワールドにおける現実の国家、都市、社会、地方自治体、会社等の組織及び人々等が自ら率先して参加して構築に協力する仕組みを用意する。具体的には、ミラーワールド内での各種シミュレーションを行うことによって、そのシミュレーションに参加しているデジタルツインのパーソナルＡＩを機械学習（例えば強化学習）させ、より高度に学習した学習済みパーソナルＡＩをリアル世界に還元（フィードバック）させる。このメリットの享受をインセンティブとして、リアルワールドの国家、都市、社会、地方自治体、会社等の組織及び人々等が自ら率先して参加してミラーワールドの構築に協力するように仕向ける。 In this third embodiment, organizations and people such as real nations, cities, societies, local governments, companies, etc. in the real world participate in the construction of a mirror world as a simulation environment and cooperate in the construction. Prepare a mechanism. Specifically, by performing various simulations in the mirror world, the personal AI of the digital twins participating in the simulation is machine-learned (for example, reinforcement learning), and the learned personal AI that has been learned more highly is realized. Give back (feedback) to the world. With the enjoyment of this merit as an incentive, we encourage the organizations and people of real-world nations, cities, societies, local governments, companies, etc. to take the initiative in participating and cooperating in the construction of the mirror world.

図２８を参照し、複数台のミラーワールドサーバ（ストレージサーバを含む）４６が設置されたデータセンタ４５において、ミラーワールドのデータが記憶されている。ミラーワールドサーバ４６のハードウェア構成は、図１に示したユーザ端末１６のハードウェア構成と同様であるため、ここではその図示及び説明の繰り返しを省略する。リアルワールド４７における現実の国家（例えば日本国４９）、都市５０、社会、地方自治体、会社等の組織、人々及び地球４８の情報がすべてデジタル化されたデジタルツイン（現実の国家デジタルツイン（例えば日本国デジタルツイン５３）、都市デジタルツイン５４、社会、地方自治体、会社等の組織、人々及び地球デジタルツイン５２）で構成されたミラーワールド５１全体が、デジタルデータとしてデータセンタ４５に記憶されている。 With reference to FIG. 28, the mirror world data is stored in the data center 45 in which a plurality of mirror world servers (including storage servers) 46 are installed. Since the hardware configuration of the mirror world server 46 is the same as the hardware configuration of the user terminal 16 shown in FIG. 1, the illustration and description thereof will be omitted here. Digital twins in which all the information of the real nation (for example, Japan 49), cities 50, society, local governments, companies and other organizations, people and the earth 48 in the real world 47 is digitized (real nation digital twin (for example, Japan) The entire mirror world 51 composed of the national digital twin 53), the urban digital twin 54, organizations such as society, local governments, companies, people, and the earth digital twin 52) is stored in the data center 45 as digital data.

データセンタ４５では、このミラーワールド５１をシミュレーション環境としてシミュレーションを行い、例えば、シミュレーション最適化により未来を予見した最適解を導き出す。シミュレーションとしては、例えば、前述した、政府が採用しようとしている政策や法律（例えば、消費増税に伴う軽減税率、改正出入国管理法、イギリスのＥＵ（European Union）からの離脱、ベーシックインカムの部分的または全面的採用、日本国憲法９条の改正等）が採用されたと仮定した場合における、株取引や先物取引等の投資市場での取引シミュレーション、会社経営シミュレーション、または消費行動シミュレーション等が考えられる。さらには、新商品（金融商品や生命保険を含む）や新サービスの各種メディアによるプロモーションのシミュレーション等でもよい。シミュレーション最適化によって導き出された最適解をリアルワールドにフィードバック（還元）し、最適解の恩恵をリアルワールドに提供する。また、シミュレーションにより機械学習（例えば強化学習）された学習済みパーソナルＡＩをリアルワールドに還元し、より高度な学習済みパーソナルＡＩによるタスクを遂行できるようにする。 In the data center 45, simulation is performed using this mirror world 51 as a simulation environment, and for example, the optimum solution foreseeing the future is derived by simulation optimization. Simulations include, for example, the policies and laws that the government intends to adopt (eg, reduced tax rates associated with consumption tax increases, revised immigration laws, withdrawal from the EU (European Union) in the United Kingdom, partial basic income, or Assuming that full adoption, revision of Article 9 of the Japanese Constitution, etc.) has been adopted, transaction simulations in the investment market such as stock trading and futures trading, company management simulations, consumption behavior simulations, etc. can be considered. Furthermore, promotion simulations of new products (including financial products and life insurance) and new services by various media may be used. The optimum solution derived by simulation optimization is fed back (reduced) to the real world, and the benefits of the optimum solution are provided to the real world. In addition, the learned personal AI that has been machine-learned (for example, reinforcement learning) by simulation is returned to the real world so that tasks by more advanced learned personal AI can be performed.

このパーソナルＡＩとスマートコントラクトとが連携することにより、前述した連携タイプのＡＩスマートコントラクトが構成される。なお、このデータセンタ４５は、図１や図２４に示したインターネット１に接続されている。図２８では、各種ブロックチェーン２、３、４、ＳＮＳ１９、鍵登録センタ３０等は図示を省略している。 By linking this personal AI and a smart contract, the above-mentioned linkage type AI smart contract is configured. The data center 45 is connected to the Internet 1 shown in FIGS. 1 and 24. In FIG. 28, various blockchains 2, 3, 4, SNS 19, key registration center 30, and the like are not shown.

図２９は、ミラーワールド５１における都市デジタルツイン５４の具体例を示している。リアルワールド４７の都市５０内には、株式会社ＡＢＣ５６、人である太郎５５及び太郎の一家５６等がある。それらに対応する都市デジタルツイン５４にも、株式会社ＡＢＣデジタルツイン５９、太郎デジタルツイン（太郎のパーソナルＡＩ）５７及び太郎の一家デジタルツイン５８等がある。これらのデータからなる都市デジタルツインデータがミラーワールドサーバ４６に記憶されている。リアルワールド４９での株式会社ＡＢＣ５６、人である太郎５５及び太郎の一家５６等の各種オブジェクトに変更（例えば、会社での人事異動や就職や退職、人についての結婚や出産等）があれば、対応する各種デジタルツインが変更後の内容にアップデートされる。このような都市デジタルツインデータが全ての都市毎にデータセンタ４５に記憶されて日本国家４９のデジタルツイン５３のデータとなり、各国における都市デジタルツインデータが全ての都市毎にデータセンタ４５に記憶されて各国国家のデジタルツインデータとなり、それら全てのデジタルツインデータにより、地球４８のデジタルツイン５２のデータとなる。 FIG. 29 shows a specific example of the urban digital twin 54 in the mirror world 51. Within the city 50 of Real World 47, there are ABC56 Co., Ltd., Taro 55 who is a person, and Taro's family 56. Corresponding urban digital twins 54 include ABC digital twins 59, Taro digital twins (Taro's personal AI) 57, and Taro's family digital twins 58. Urban digital twin data consisting of these data is stored in the mirror world server 46. If there are changes to various objects such as ABC56 Co., Ltd. in Real World 49, Taro 55 who is a person, and Taro's family 56 (for example, personnel transfer at a company, employment or retirement, marriage or childbirth about a person, etc.) Various corresponding digital twins will be updated with the changed contents. Such urban digital twin data is stored in the data center 45 for each city and becomes the data of the digital twin 53 of the Japanese state 49, and the urban digital twin data in each country is stored in the data center 45 for each city. It becomes the digital twin data of each country, and the data of all the digital twin data becomes the data of the digital twin 52 of the earth 48.

具体例として、ミラーワールドサーバ４６には、太郎デジタルツイン（太郎のパーソナルＡＩ）５７として、氏名：太郎、ＡＩ識別番号：82km9、パーソナルＡＩデータ、太郎のパーソナルデータ（例えば、ライフログ、プロフィール、嗜好データ、電子カルテデータ、バイタルデータ等）が記憶されている。太郎の一家デジタルツイン５８として、氏名：太郎、桜、志郎、家族構成：夫、妻、長男、ＡＩ識別番号：82km9、11zk9、gf43yが記憶されている。株式会社ＡＢＣデジタルツイン５９として、氏名：太郎、花子・・・三郎、役職：代表取締役、専務、部長・・・平社員、ＡＩ識別番号：82km9、ba935、2es14、・・・9w1c2が記憶されている。 As a specific example, the mirror world server 46 has a Taro digital twin (Taro's personal AI) 57 as a name: Taro, AI identification number: 82km9, personal AI data, and Taro's personal data (for example, life log, profile, preference). Data, electronic medical record data, vital data, etc.) are stored. As Taro's family digital twin 58, names: Taro, Sakura, Shiro, family composition: husband, wife, eldest son, AI identification number: 82km9, 11zk9, gf43y are memorized. As ABC Digital Twin 59 Co., Ltd., names: Taro, Hanako ... Saburo, titles: CEO, managing director, general manager ... regular employees, AI identification numbers: 82km9, ba935, 2es14, ... 9w1c2 are memorized. There is.

ユーザ端末１６とミラーワールドサーバ４６とのメインルーチンプログラムのフローチャートを、図３０〜図３３に基づいて説明する。図３０Ａを参照し、ユーザ端末１６のＣＰＵ１０は、シミュレーション環境としてのミラーワールド５１への参加登録を依頼するメンバー登録依頼処理Ｓ５５５、シミュレーション準備応答処理Ｓ５５６、シミュレーション応答処理Ｓ５５７を実行する。ミラーワールドサーバ４６のＣＰＵ１０は、メンバー登録処理５５０、シミュレーション準備処理Ｓ５５１、シミュレーション処理Ｓ５５２を実行する。 The flowchart of the main routine program of the user terminal 16 and the mirror world server 46 will be described with reference to FIGS. 30 to 33. With reference to FIG. 30A, the CPU 10 of the user terminal 16 executes the member registration request process S555, the simulation preparation response process S556, and the simulation response process S557 for requesting participation registration in the mirror world 51 as a simulation environment. The CPU 10 of the mirror world server 46 executes the member registration process 550, the simulation preparation process S551, and the simulation process S552.

メンバー登録処理とメンバー登録依頼処理とを図３０Ｂに基づいて説明する。これら両処理は、ミラーワールド５１をシミュレーション環境としたシミュレーションにデジタルツインとして参加したいメンバーを登録するためのものである。ユーザ端末１６のＣＰＵ１０は、メンバー登録依頼処理において、Ｓ５６０により登録申し込みを行うか否か判定し、登録申し込みを行わないと判定すればこのメンバー登録処理が終了してリターンする。登録申し込みを行うと判定すれば、Ｓ５６１において、登録申し込みに必要な所定事項をミラーワールドサーバ４６へ送信すると共に、パーソナルＡＩを有していない人がいる場合はその旨とその人のブロックチェーンアドレスもミラーワールドサーバ４６へ送信する。登録申し込みに必要な所定事項とは、具体的には、人のデジタルツインにおいては、当該人のパーソナルＡＩのＡＩ識別番号とパーソナルＡＩデータ、家族のデジタルツインにおいては家族の氏名と家族構成とそれぞれのＡＩ識別番号、会社のデジタルツインにおいては従業員の氏名、役職、それぞれのＡＩ識別番号等である。 The member registration process and the member registration request process will be described with reference to FIG. 30B. Both of these processes are for registering members who want to participate as digital twins in a simulation using the mirror world 51 as a simulation environment. In the member registration request process, the CPU 10 of the user terminal 16 determines whether or not to apply for registration in S560, and if it determines that the application for registration is not performed, the member registration process ends and returns. If it is determined that a registration application will be made, in S561, the predetermined items required for the registration application will be transmitted to the mirror world server 46, and if there is a person who does not have a personal AI, that fact and the blockchain address of that person will be sent. Is also transmitted to the mirror world server 46. Specifically, the prescribed items required for registration application are the AI identification number and personal AI data of the person's personal AI in the case of the person's digital twin, and the family name and family composition in the case of the family's digital twin. The AI identification number of the company, the name and position of the employee in the digital twin of the company, the AI identification number of each, and the like.

それをＳ５６５により受信したミラーワールドサーバ４６のＣＰＵ１０は、Ｓ５６６において、パーソナルＡＩを所有済みか否か判定する。Ｓ５６１により送信されてきた情報中に「パーソナルＡＩを有していない旨」の情報が含まれていた場合には制御がＳ５６７に進み、パーソナルＡＩの生成販売処理を行うが、「パーソナルＡＩを有していない旨」の情報が含まれていない場合には、制御がＳ５６８に進み、Ｓ５６２により送られてきたＡＩ識別番号を含む所定事項をミラーワールド５１に登録する。 The CPU 10 of the mirror world server 46, which receives it in S565, determines in S566 whether or not it already owns the personal AI. If the information transmitted by S561 includes the information "not having a personal AI", the control proceeds to S567 to generate and sell the personal AI, but "has a personal AI". If the information "not done" is not included, the control proceeds to S568, and the predetermined items including the AI identification number sent by S562 are registered in the mirror world 51.

Ｓ５６７に示されたパーソナルＡＩの生成販売処理を図３１に基づいて説明する。ミラーワールドサーバ４６のＣＰＵ１０は、Ｓ５７３において、Ｓ５６５により受信したブロックチェーンアドレス（パーソナルＡＩを有していないユーザのブロックチェーンアドレス）に記録されている取引データ及びＳＮＳ等の投稿データを、ブロッチェーンから収集する。次に、Ｓ５７４において、取引データ及びＳＮＳ等の投稿データを学習データとして機械学習を行って学習済みのパーソナルＡＩを生成する。次に、Ｓ５７５において、その学習済みのパーソナルＡＩを対応するユーザに販売する。 The personal AI generation / sales process shown in S567 will be described with reference to FIG. The CPU 10 of the mirror world server 46 transmits the transaction data recorded in the blockchain address (the blockchain address of the user who does not have the personal AI) received by S565 in S573 and the posted data such as SNS from the blockchain. collect. Next, in S574, machine learning is performed using transaction data and posted data such as SNS as learning data to generate a learned personal AI. Next, in S575, the learned personal AI is sold to the corresponding user.

Ｓ５５１に示されたシミュレーション準備処理及びＳ５５５に示されたシミュレーション準備応答処理を、図３２に基づいて説明する。ミラーワールドサーバ４６のＣＰＵ１０は、Ｓ５７７において、シミュレーションの依頼を受けたか否か判定する。受けていないと判定した場合には、このシミュレーション準備処理が終了してリターンする。シミュレーションの依頼を受けたと判定した場合には制御がＳ５７８に進み、依頼されたシミュレーションにマッチするパーソナルＡＩ群及びデジタルツインを割出す処理を行う。例えば、前述の消費増税に伴う軽減税率下での消費行動シミュレーションの場合には、一般消費者に相当するパーソナルＡＩ群であって、性別、年代別、地域別、年収別等の人口統計に従った割合でパーソナルＡＩ群、及び、軽減税率の対象となる消費財のメーカデジタルツインや販売店デジタルツイン等を割出す。次に、その割出されたパーソナルＡＩ群及びデジタルツイン宛にシミュレーションの同意を求める処理を行う。具体的には、割出されたパーソナルＡＩ群及びデジタルツインに対応するユーザ群各々のユーザ端末１６へ、シミュレーションの内容を送信して同意するか否かを問う。 The simulation preparation process shown in S551 and the simulation preparation response process shown in S555 will be described with reference to FIG. The CPU 10 of the mirror world server 46 determines in S577 whether or not the simulation request has been received. If it is determined that the simulation has not been received, this simulation preparation process ends and returns. When it is determined that the simulation request has been received, the control proceeds to S578, and the process of determining the personal AI group and the digital twin that match the requested simulation is performed. For example, in the case of the consumption behavior simulation under the reduced tax rate accompanying the consumption tax increase mentioned above, it is a personal AI group corresponding to general consumers, and according to population statistics such as gender, age group, region, and annual income. The personal AI group and the consumer goods manufacturer digital twins and retailer digital twins that are subject to the reduced tax rate are identified at the same rate. Next, a process of requesting the consent of the simulation is performed on the indexed personal AI group and the digital twin. Specifically, the content of the simulation is transmitted to the user terminals 16 of each of the indexed personal AI group and the user group corresponding to the digital twin, and it is asked whether or not to agree.

割出されたパーソナルＡＩ群及びデジタルツインに対応するユーザ群各々のユーザ端末１６のＣＰＵ１０は、送信されてきたシミュレーションの内容をＳ５８０で受信し、Ｓ５８１において、そのシミュレーションの実行メンバーへの参加に同意するか否か判定する。この判定は、パーソナルＡＩが判定してもよいが、ユーザ自身が判定してもよい。同意しないと判定した場合には、このシミュレーション準備応答処理が終了してリターンするが、同意すると判定した場合には、Ｓ５８２において、同意する旨をミラーワールドサーバ４６へ返信する。 The CPU 10 of the user terminal 16 of each of the indexed personal AI group and the user group corresponding to the digital twin receives the transmitted contents of the simulation in S580, and agrees to participate in the execution member of the simulation in S581. Determine whether or not to do so. This determination may be determined by the personal AI, or may be determined by the user himself. If it is determined that they do not agree, the simulation preparation response process ends and returns. However, if it is determined that they agree, S582 returns a message to that effect to the mirror world server 46.

それをＳ５８３により受信したミラーワールドサーバ４６のＣＰＵ１０は、依頼されたシミュレーションを実行するのに必要な量の同意が得られたか否か判定する。得られたと判定した場合にはＳ５８４において、同意が得られたＡＩ群及びデジタルツインをコピーしてシミュレーション対象としてミラーワールド５１に登録する。その登録された状態が前述の図２９に示されている。 The CPU 10 of the mirror world server 46 that has received it in S583 determines whether or not the amount of consent required to execute the requested simulation has been obtained. If it is determined that the result has been obtained, in S584, the AI group and the digital twin for which consent has been obtained are copied and registered in the mirror world 51 as a simulation target. The registered state is shown in FIG. 29 above.

一方、必要なパーソナルＡＩ群及びデジタルツインから同意が得られていないと判定した場合には制御がＳ５８５に進み、不足しているパーソナルＡＩ群及びデジタルツインにマッチするペルソナ群（メーカや販売店のデジタルツインに対応するペルソナを含む）を設定する処理を行い、Ｓ５８６において、各ペルソナに属するユーザ群（メーカや販売店に従事するユーザ群を含む）を選定し、Ｓ５８７において、ペルソナ毎に属するユーザ群をグループピングしてグループ毎にユーザ群の取引データ（メーカや販売店としての取引データを含む）をブロッチェーンから収集し、Ｓ５８８において、取引データを学習データとして機械学習を行ってペルソナ毎に学習済みのパーソナルＡＩ群及びデジタルツインを生成して補充した上で、Ｓ５８４に進む。このＳ５８５〜Ｓ５８８は、図９（Ｂ）のＳ３４４〜Ｓ３４７と同様の処理であり、ここでは詳細な説明の繰り返しを省略する。 On the other hand, if it is determined that consent has not been obtained from the required personal AI group and digital twin, control proceeds to S585, and the persona group (manufacturer or retailer's) that matches the lacking personal AI group and digital twin. The process of setting the persona corresponding to the digital twin) is performed, the user group belonging to each persona (including the user group engaged in the manufacturer or the dealer) is selected in S586, and the user belonging to each persona in S587. The group is grouped and the transaction data of the user group (including the transaction data as a manufacturer or a dealer) is collected from the blockchain for each group, and in S588, machine learning is performed using the transaction data as learning data for each persona. After generating and replenishing the learned personal AI group and digital twin, the process proceeds to S584. S585 to S588 are the same processes as S344 to S347 of FIG. 9B, and the repetition of detailed description is omitted here.

次に、Ｓ５５２に示したシミュレーション処理及びＳ５５７に示したシミュレーション応答処理の具体的制御を図３３に基づいて説明する。Ｓ５９３〜Ｓ５９５は、前述した図９のＳ３３６、Ｓ３３８、Ｓ３３９、図１２のＳ１３６、Ｓ１３８、Ｓ１３９と同様の処理であり、ここでは詳細な説明を省略する。Ｓ５９６において、シミュレーション結果をシミュレーションの依頼者に通知する。具体的には、シミュレーション結果をシミュレーション依頼者のユーザ端末１６に送信する。次に、Ｓ５９７において、シミュレーションに用いた各パーソナルＡＩ（会社組織等のデジタルツインに従事するパーソナルＡＩを含む）を各々の所持者のユーザ端末１６に送信する。 Next, specific control of the simulation process shown in S552 and the simulation response process shown in S557 will be described with reference to FIG. 33. S593 to S595 are the same processes as S336, S338, S339 in FIG. 9 and S136, S138, S139 in FIG. 12, and detailed description thereof will be omitted here. In S596, the simulation result is notified to the simulation requester. Specifically, the simulation result is transmitted to the user terminal 16 of the simulation requester. Next, in S597, each personal AI used in the simulation (including a personal AI engaged in a digital twin of a company organization or the like) is transmitted to the user terminal 16 of each owner.

それをＳ５９８で受信したユーザ端末１６のＣＰＵ１０は、Ｓ５９９において、受信したパーソナルＡＩを消去するか否か判定する。受信したパーソナルＡＩは、シミュレーションに参加して強化学習（機械学習）された学習済みＡＩであり、その分性能がアップしており高度なタスク処理を実行することが可能である。しかし、シミュレーションの内容によっては、ユーザが望まない強化学習（機械学習）を受けている場合もあるため、そのような場合には、Ｓ５９９によりＹＥＳと判定し、Ｓ６０１において、受信したパーソナルＡＩを消去する。一方、シミュレーションがユーザの望む内容であり、受信したパーソナルＡＩが望ましい強化学習（機械学習）を受けていると認定した場合には、制御がＳ６００に進み、受信した学習済みパーソナルＡＩを上書き保存する。その結果、ユーザは、望ましい強化学習（機械学習）を受けて性能がアップしたパーソナルＡＩを得ることができる利点がある。この利点の享受をインセンティブとして、リアルワールドの国家、都市、社会、地方自治体、会社等の組織及び人々等が自ら率先して参加してミラーワールドの構築に協力するように仕向けることができる。このＳ６００によりパーソナルＡＩを上書き保存することにより、ミラーワールドサーバ４６において、上書き保存された後の新たなパーソナルＡＩのデジタルツイン及び新たなパーソナルＡＩからなる組織のデジタルツインにデータがアップデートされる（図２９参照）。なお、上書き保存ではなく、既存のパーソナルＡＩと学習済みパーソナルＡＩとの両者を共に記憶しておき、必要に応じて使い分けるようにしてもよい。 The CPU 10 of the user terminal 16 that has received it in S598 determines in S599 whether or not to erase the received personal AI. The received personal AI is a learned AI that has been reinforcement-learned (machine-learned) by participating in a simulation, and its performance is improved by that amount, and it is possible to execute advanced task processing. However, depending on the content of the simulation, the user may have received undesired reinforcement learning (machine learning). In such a case, S599 determines YES, and S601 erases the received personal AI. do. On the other hand, if the simulation is the content desired by the user and it is determined that the received personal AI has received the desired reinforcement learning (machine learning), the control proceeds to S600 and the received learned personal AI is overwritten and saved. .. As a result, the user has an advantage that he / she can obtain a personal AI with improved performance by receiving desirable reinforcement learning (machine learning). With the enjoyment of this advantage as an incentive, organizations and people of real-world nations, cities, societies, local governments, companies, etc. can be encouraged to take the initiative in participating and cooperating in the construction of the mirror world. By overwriting and saving the personal AI by this S600, the data is updated in the mirror world server 46 to the digital twin of the new personal AI after the overwriting and the digital twin of the organization consisting of the new personal AI (FIG. 29). In addition, instead of overwriting and saving, both the existing personal AI and the learned personal AI may be stored together and used properly as needed.

次に、ミラーワールド内をシミュレーション環境にしてシミュレーションを行うことにより、ＤＡＯにおけるインセンティブ設計の最適解を導き出すシステムを、図３４〜図５９に基づいて説明する。図３４（Ａ）は、マルチ役務ＤＡＯ構築システムの概略図である。例えば、ビットコインはＤＡＯの一種であるが、ノード（マイナー）がマイニング（記帳権の競争）という１種類の役務のみを担っており、マイニングに成功した者にビットコインを付与するといインセンティブを与えることによりブロックが追加され、ビットコインのシステムが自律的に継続される。これに対し、役務が複数種類存在するＤＡＯをマルチ役務ＤＡＯという。例えば、会社組織のＤＡＯの場合、資材調達、組立、宣伝、販売等の複数の役務が存在し、それら複数の役務を実行したノードに対しどのような割合でどの程度の報酬を分配すれば最適なインセンティブ設計となるのかが困難な問題となる。このようなマルチ役務ＤＡＯにおけるインセンティブ設計の最適解を導き出すシステムを説明する。 Next, a system for deriving an optimum solution for incentive design in DAO by performing a simulation in a mirror world will be described with reference to FIGS. 34 to 59. FIG. 34 (A) is a schematic diagram of a multi-service DAO construction system. For example, Bitcoin is a type of DAO, but a node (minor) has only one type of service, mining (competition of bookkeeping rights), and gives an incentive to give Bitcoin to those who succeed in mining. This will add blocks and the Bitcoin system will continue autonomously. On the other hand, a DAO having multiple types of services is called a multi-service DAO. For example, in the case of DAO, which is a company organization, there are multiple services such as material procurement, assembly, promotion, and sales, and it is optimal to distribute what proportion and how much reward to the nodes that performed these multiple services. It becomes a difficult problem whether it will be an incentive design. A system for deriving an optimum solution for incentive design in such a multi-service DAO will be described.

図３４（Ａ）を参照し、マルチ役務ＤＡＯデータを記憶しているミラーワールドサーバ４６が、ＤＡＯエージェント６１、役務１を行うペルソナエージェント群６２、役務２を行うペルソナエージェント群６３、・・・役務ｎを行うペルソナエージェント群６４のデータを記憶している。さらに、ミラーワールドサーバ４６は、強化学習に伴って各ペルソナエージェント群に与えられる報酬ｒ１、ｒ２、・・・ｒｎの種類も記憶している。 With reference to FIG. 34 (A), the mirror world server 46 that stores the multi-service DAO data is a DAO agent 61, a persona agent group 62 that performs service 1, a persona agent group 63 that performs service 2, ... The data of the persona agent group 64 that performs n is stored. Further, the mirror world server 46 also stores the types of rewards r1, r2, ... rn given to each persona agent group in accordance with reinforcement learning.

マルチ役務ＤＡＯ構築業者の端末１６が、ＤＡＯエージェント６１と、必要なペルソナエージェント群と、報酬ｒ１、ｒ２、・・・ｒｎの種類とを、ミラーワールドサーバ４６からダウンロードしてインストールする。端末１６は、パブリックチェーン４を構成する各ノード１９である。この各ノード１９で構成されたパブリックチェーン４で運用されるマルチ役務ＤＡＯ６５のデジタルツイン６６を、ミラーワールド５１内でシミュレーション強化学習を行い、マルチ役務ＤＡＯにおけるインセンティブ設計の最適解を導き出す。そのインセンティブ設計の最適解を、リアルワールド４７における実際のマルチ役務ＤＡＯ６５に適用し、最適なインセンティブ設計のマルチ役務ＤＡＯ６５を作り上げる。このミラーワールド５１内でのシミュレーション強化学習は、ミラーワールドサーバ４６において実行される。その制御を以下に説明する。 The terminal 16 of the multi-service DAO builder downloads and installs the DAO agent 61, the necessary persona agent group, and the types of rewards r1, r2, ... rn from the mirror world server 46. The terminal 16 is each node 19 constituting the public chain 4. The digital twin 66 of the multi-service DAO 65 operated in the public chain 4 composed of each node 19 is subjected to simulation reinforcement learning in the mirror world 51, and the optimum solution of the incentive design in the multi-service DAO is derived. The optimum solution of the incentive design is applied to the actual multi-service DAO65 in the real world 47 to create the multi-service DAO65 of the optimum incentive design. The simulation reinforcement learning in the mirror world 51 is executed in the mirror world server 46. The control will be described below.

図３４（Ｂ）を参照し、端末１６のＣＰＵ１０は、Ｓ６０６においてシミュレーション強化学習準備応答処理を行い、Ｓ６０７においてシミュレーション強化学習応答処理を行う。 With reference to FIG. 34 (B), the CPU 10 of the terminal 16 performs the simulation reinforcement learning preparation response processing in S606 and the simulation reinforcement learning response processing in S607.

ミラーワールドサーバ４６のＣＰＵ１０は、Ｓ６１１においてシミュレーション強化学習準備処理を行い、Ｓ６１２においてシミュレーション強化学習処理を行い、Ｓ６１３においてＤＡＯエージェント強化学習処理を行う。 The CPU 10 of the mirror world server 46 performs simulation reinforcement learning preparation processing in S611, simulation reinforcement learning processing in S612, and DAO agent reinforcement learning processing in S613.

Ｓ６１１に示したシミュレーション強化学習準備処理及びＳ６０６に示したシミュレーション強化学習準備応答処理の具体的制御を図３５に基づいて説明する。シミュレーション強化学習準備応答処理において、端末１６のＣＰＵ１０は、Ｓ６１５において、シミュレーション強化学習を依頼するか否か判定する。依頼しない場合にはこのシミュレーション学習準備応強化答処理が終了してリターンする。依頼する場合は制御がＳ６１６に進み、マルチ役務ＤＡＯデータをミラーワールドサーバ４６へ送信して依頼する。このマルチ役務ＤＡＯデータには、役務の種類が含まれている。例えば、図３６〜図４２において後述するイノベーション誘発ＤＡＯの場合は、役務が、アイデア発案、改良発案、事業化、侵害発見、トークン購入の５種類存在し、それらの役務を送信する。 Specific control of the simulation reinforcement learning preparation process shown in S611 and the simulation reinforcement learning preparation response process shown in S606 will be described with reference to FIG. 35. In the simulation reinforcement learning preparation response processing, the CPU 10 of the terminal 16 determines in S615 whether or not to request the simulation reinforcement learning. If not requested, this simulation learning preparation and strengthening answer processing is completed and returned. When requesting, the control proceeds to S616, and the multi-service DAO data is transmitted to the mirror world server 46 to request. This multi-service DAO data includes service types. For example, in the case of the innovation-induced DAO described later in FIGS. 36 to 42, there are five types of services: idea idea, improvement idea, commercialization, infringement detection, and token purchase, and these services are transmitted.

それをＳ６２０で受信したミラーワールドサーバ４６のＣＰＵ１０は、Ｓ６２１により、マルチ役務の各々にマッチするペルソナ群を設定する。例えば、上記イノベーション誘発ＤＡＯの場合は、アイデア発案及び改良発案のペルソナ群として発明をよく考え出す人々、事業化のペルソナ群として事業化に興味のある人々、侵害発見のペルソナ群として特許法や著作権法に詳しい人々、トークン購入のペルソナ群として投資に興味のある人々等が考えられる。 The CPU 10 of the mirror world server 46 that receives it in S620 sets a persona group that matches each of the multi-services by S621. For example, in the case of the above innovation-induced DAO, people who often think of inventions as a group of idea and improvement idea, people who are interested in commercialization as a group of commercialization personas, and patent law and copyright as a group of infringement discovery personas. People who are familiar with the law, people who are interested in investing as a group of personas for purchasing tokens, etc. can be considered.

次に、Ｓ６２２において、各ペルソナに属するユーザ群を選定する。例えば、上記イノベーション誘発ＤＡＯの場合は、アイデア発案及び改良発案のペルソナ群に属するユーザ群として特許出願の発明者として掲載されているユーザ群、事業化のペルソナ群に属するユーザ群として会社の経営者のユーザ群、侵害発見のペルソナ群に属するユーザ群として弁理士や弁護士のユーザ群、トークン購入のペルソナ群に属するユーザ群としてビットコイン等の仮想通貨を購入したことのあるユーザ群等が考えられる。 Next, in S622, a user group belonging to each persona is selected. For example, in the case of the above innovation-induced DAO, the user group listed as the inventor of the patent application as the user group belonging to the persona group of the idea idea and the improvement idea, and the company manager as the user group belonging to the commercialization persona group. The user group of patent attorneys and lawyers as the user group belonging to the persona group of infringement detection, the user group of users who have purchased virtual currency such as bit coin as the user group belonging to the persona group of token purchase, etc. can be considered. ..

次に、Ｓ６２３において、ペルソナ毎に属するユーザ群をグループピングしてグループ毎にユーザ群の取引データをブロッチェーンから収集し、Ｓ６２４において、取引データを学習データとして機械学習を行ってペルソナ毎に学習済みのペルソナエージェント群を生成する。これら両制御は、前述した図９（Ｂ）のＳ３４６、Ｓ３４７と同様の処理であり、ここでは詳細な説明の繰り返しを省略する。次に、Ｓ６２５において、ペルソナエージェント群をマルチ役務ＤＡＯ６５内に配備してマルチ役務ＤＡＯデジタルツイン６６を生成し、シミュレーション対象としてミラーワールド５１に登録する。その状態が図３６に示されている。 Next, in S623, the user groups belonging to each persona are grouped and the transaction data of the user group is collected from the blockchain for each group, and in S624, machine learning is performed using the transaction data as learning data to learn for each persona. Generate a group of completed persona agents. Both of these controls are the same processes as those of S346 and S347 of FIG. 9B described above, and the repetition of detailed description thereof will be omitted here. Next, in S625, the persona agent group is deployed in the multi-service DAO 65 to generate the multi-service DAO digital twin 66, and the persona agent group is registered in the mirror world 51 as a simulation target. The state is shown in FIG.

図３６を参照し、パブリックチェーンからなるマルチ役務ＤＡＯ６５のデジタルツイン６６をミラーワールド５１に構築する。マルチ役務ＤＡＯデジタルツイン６６には、上記Ｓ６２５においてペルソナエージェント群が配備されて各ノード毎に１つずつペルソナエージェントが配備された状態となっている。それらペルソナエージェントの識別番号が、各役務（アイデア発案、改良発案、事業化、侵害発見、トークン購入）毎に分類されてミラーワールドサーバ４６に記憶されている。例えば、アイデア発案役務のペルソナエージェントの識別番号として、kc29m,1w13a,・・・9nad8がミラーワールドサーバ４６に記憶されている。このマルチ役務ＤＡＯ６５は、前述したイノベーション誘発ＤＡＯであり、以降、イノベーション誘発ＤＡＯを例としてマルチ役務ＤＡＯを説明する。 With reference to FIG. 36, a digital twin 66 of a multi-service DAO 65 composed of a public chain is constructed in a mirror world 51. In the multi-service DAO digital twin 66, a persona agent group is deployed in S625, and one persona agent is deployed for each node. The identification numbers of those persona agents are classified for each service (idea idea, improvement idea, commercialization, infringement detection, token purchase) and stored in the mirror world server 46. For example, kc29m, 1w13a, ... 9nad8 are stored in the mirror world server 46 as the identification number of the persona agent for the idea idea service. This multi-service DAO 65 is the above-mentioned innovation-induced DAO, and the multi-service DAO will be described below by taking the innovation-induced DAO as an example.

さらに、ミラーワールドサーバ４６には、各ペルソナエージェントの行為に対し報酬（インセンティブ）を与えるＤＡＯエージェントも記憶されている。このＤＡＯエージェントが、与える報酬の分配割合や報酬額を強化学習（機械学習）することにより、インセンティブ設計の最適解を導き出す。その強化学習（機械学習）の概略システムを図３７に示す。 Further, the mirror world server 46 also stores a DAO agent that gives a reward (incentive) for the actions of each persona agent. This DAO agent derives the optimum solution for incentive design by performing reinforcement learning (machine learning) on the distribution ratio of the reward to be given and the reward amount. A schematic system of the reinforcement learning (machine learning) is shown in FIG. 37.

図３７を参照し、ペルソナエージェント群がオリジンのアイデア投稿をアクションa11、a12、・・・a1nとして行えば、環境の状態Ｓ１がＤＡＯエージェント６１に入力され、それらアイデア発案役務を行ったペルソナエージェント群６７に対し報酬r11、r12、・・・r1nが与えられる。このアイデア発案役務は、夢やアイデア、ビジネスプラン、技術思想、著作物等の発案を包含する広い概念である。環境の状態Ｓ１は、アイデア発案役務を行ったペルソナエージェント群６７にも与えられる。この環境の状態Ｓ１は、例えば、オリジンアイデア投稿内容、アイデア発案役務を行ったペルソナエージェント６８各々に報酬として与えられたトークンＡ１の変動相場の価格等である。 With reference to FIG. 37, if the persona agent group posts the origin idea as actions a11, a12, ... a1n, the environment state S1 is input to the DAO agent 61, and the persona agent group performing the idea idea service. Rewards r11, r12, ... r1n are given to 67. This idea-creating service is a broad concept that includes the ideas of dreams, ideas, business plans, technical ideas, copyrighted works, and the like. The environmental state S1 is also given to the persona agent group 67 who has performed the idea-creating service. The state S1 of this environment is, for example, the content of the origin idea posting, the floating exchange rate price of the token A1 given as a reward to each of the persona agents 68 who performed the idea idea service, and the like.

上記オリジンのアイデアに対しペルソナエージェント群６８が改良案の投稿等のアクションa21、a22、・・・a2を行えば、環境の状態Ｓ２がＤＡＯエージェント６１に入力され、それら改良案役務を行ったペルソナエージェント群６８に対し報酬r21、r22、・・・r2nが与えられる。環境の状態Ｓ１は、改良案役務を行ったペルソナエージェント群６８にも与えられる。この環境の状態Ｓ２は、例えば、改良案投稿内容、改良案投稿に付与された「いいね！」の数等である。この「いいね！」を付与する主体は、例えば上記オリジンアイデアの投稿に対し報酬として付与されたトークンＡ１を購入した者（ペルソナエージェント群７１）のみに限定している。このように、「いいね！」の付与主体を利害関係人（ステークフォルダ）に限定（制限）する理由は、不正行為を防止するためである。「いいね！」の付与主体を無制限に広げた場合、例えば、改良案の投稿を行った者（ペルソナエージェント群６８）が多数の者（ペルソナエージェント）と結託して多数の「いいね！」を付けてもらう等の不正行為を防止するためである。事業化役務を行ったペルソナエージェント群６９及び侵害対処役務を行ったペルソナエージェント群７０への「いいね！」の付与主体も、同様の理由により、トークンＡ１を購入した者（ペルソナエージェント群７１）のみに限定している。 If the persona agent group 68 performs actions a21, a22, ... a2 such as posting an improvement plan to the above origin idea, the environmental state S2 is input to the DAO agent 61, and the persona who performed the improvement plan service. Rewards r21, r22, ... r2n are given to the agent group 68. The environmental state S1 is also given to the persona agent group 68 who performed the improvement proposal service. The state S2 of this environment is, for example, the content of the improvement proposal submission, the number of “likes” given to the improvement proposal submission, and the like. The entity that gives this "like" is limited to, for example, only those who have purchased the token A1 given as a reward for posting the origin idea (persona agent group 71). In this way, the reason for limiting (restricting) the granting party of "Like!" To interested parties (stake folders) is to prevent fraudulent activities. If the number of "Like" grantees is expanded without limit, for example, the person who posted the improvement plan (Persona agent group 68) colludes with a large number of people (Persona agents) and a large number of "Like!" This is to prevent fraudulent acts such as having people attach. The person who gave "Like" to the persona agent group 69 who performed the commercialization service and the persona agent group 70 who performed the infringement countermeasure service also purchased the token A1 for the same reason (persona agent group 71). Limited to only.

上記オリジンのアイデアに対し事業化役務を行ったペルソナエージェント群６９がアクションa31、a32、・・・a3nを行えば、環境の状態Ｓ３がＤＡＯエージェント６１に入力され、それら事業化役務を行ったペルソナエージェント群６９に対し報酬r31、r32、・・・r3nが与えられる。ペルソナエージェント群６９のアクションa31、a32、・・・a3nとしては、例えば、事業計画書の投稿、事業化の進行状況の投稿、実際の事業化遂行の状況の投稿、事業化された事業による収益額の投稿等が考えられる。環境の状態Ｓ３は、事業化役務を行ったペルソナエージェント群６９にも与えられる。この環境の状態Ｓ３は、例えば、事業計画書の投稿や事業化の進行状況の投稿、事業化された事業による収益額の投稿等に付与された「いいね！」の数等である。 If the persona agent group 69 who performed the commercialization service for the above origin idea performs actions a31, a32, ... a3n, the environmental state S3 is input to the DAO agent 61, and the persona who performed those commercialization services. Rewards r31, r32, ... r3n are given to the agent group 69. Actions a31, a32, ... a3n of the persona agent group 69 include, for example, posting a business plan, posting the progress of commercialization, posting the actual status of commercialization, and profit from the commercialized business. Posting of the amount can be considered. The environmental state S3 is also given to the persona agent group 69 who performed the commercialization service. This environmental state S3 is, for example, the number of “likes” given to the posting of the business plan, the posting of the progress of commercialization, the posting of the profit amount from the commercialized business, and the like.

上記オリジンのアイデアに対し侵害対処役務を行ったペルソナエージェント群７０がアクションa41、a42、・・・a4nを行えば、環境の状態Ｓ４がＤＡＯエージェント６１に入力され、それら侵害対処役務を行ったペルソナエージェント群７０に対し報酬r41、r42、・・・r4nが与えられる。ペルソナエージェント群７０のアクションa31、a32、・・・a3nとしては、例えば、侵害発見の報告投稿、侵害対処報告投稿、ライセンス交渉報告投稿等が考えられる。さらには、これら役務の前提となる特許出願の報告投稿やその権利化の報告投稿役務も含めてもよい。環境の状態Ｓ４は、侵害対処役務を行ったペルソナエージェント群７０にも与えられる。この環境の状態Ｓ４は、例えば、侵害発見の報告投稿、侵害対処報告投稿、ライセンス交渉報告投稿等に付与された「いいね！」の数等である。 If the persona agent group 70 who performed the infringement countermeasure service for the above origin idea performs actions a41, a42, ... a4n, the environmental state S4 is input to the DAO agent 61, and the persona who performed the infringement countermeasure service. Rewards r41, r42, ... r4n are given to the agent group 70. As the actions a31, a32, ... a3n of the persona agent group 70, for example, an infringement discovery report posting, an infringement handling report posting, a license negotiation report posting, and the like can be considered. Furthermore, the service of submitting a report of a patent application and the service of submitting a report of the acquisition of rights, which are the premise of these services, may be included. The environmental state S4 is also given to the persona agent group 70 who has performed the infringement countermeasure service. The state S4 of this environment is, for example, the number of “likes” given to the infringement discovery report posting, the infringement countermeasure report posting, the license negotiation report posting, and the like.

ペルソナエージェント群が上記オリジンのアイデア発案役務に対し付与されたトークンＡ１を購入する役務をアクションa51、a52、・・・a5nとして行えば、環境の状態Ｓ５がＤＡＯエージェント６１に入力され、それらトークン購入役務を行ったペルソナエージェント群７１に対し報酬r51、r52、・・・r5nが与えられる。環境の状態Ｓ５は、トークン購入役務を行ったペルソナエージェント群７１にも与えられる。この環境の状態Ｓ５は、例えば、トークンの購入数（または購入金額）等である。ペルソナエージェント群７１は、仮想通貨（例えばEthereumのETH等）を消費してトークンＡ１を購入する。なお、購入したトークンは、変動相場での価格に従って仮想通貨に変換（換金）でき、その仮想通貨は、変動相場での価格に従って円やドル等の法定通貨に変換（換金）できる。 If the persona agent group purchases the token A1 granted to the origin idea idea service as actions a51, a52, ... a5n, the environment state S5 is input to the DAO agent 61 and those tokens are purchased. Rewards r51, r52, ... r5n are given to the persona agent group 71 who performed the service. The environmental state S5 is also given to the persona agent group 71 who has performed the token purchasing service. The state S5 of this environment is, for example, the number of tokens purchased (or the purchase price). Persona agent group 71 consumes virtual currency (for example, ETH of Ethereum) to purchase token A1. The purchased token can be converted (cashed) into virtual currency according to the price in the floating market, and the virtual currency can be converted (cashed) into legal currency such as yen or dollar according to the price in the floating market.

各ペルソナエージェント群に付与する報酬ｒ1〜r5は、ＤＡＯエージェント６１が報酬テーブル（図３９（Ａ）参照）に基づいて決定する。アイデア発案役務を行ったペルソナエージェントに対しては、r1=A1+B1・ｂ+G1・ｇ、改良役務を行ったペルソナエージェンに対しては、r2=A2・e＋B2・ｂ+G2・ｇ、事業化役務を行ったペルソナエージェントに対しては、r3=A3・e＋B3・ｂ、侵害対処役務を行ったペルソナエージェントに対しては、r4=A4・e＋B4・ｂ+G4・ｇ、トークン購入役務を行ったペルソナエージェントに対しては、r5=B5・ｂ+G5・ｇと、決定する。 The rewards r1 to r5 given to each persona agent group are determined by the DAO agent 61 based on the reward table (see FIG. 39 (A)). For the persona agent who performed the idea idea service, r1 = A1 + B1 ・ b + G1 ・ g, for the persona agent who performed the improvement service, r2 = A2 ・ e ＋ B2 ・ b + G2 ・ g, business R3 = A3 ・ e ＋ B3 ・ b for the persona agent who performed the conversion service, r4 = A4 ・ e ＋ B4 ・ b + G4 ・ g for the persona agent who performed the infringement countermeasure service, and token purchase service. For the persona agent, r5 = B5 ・ b + G5 ・ g is determined.

ここに、A2〜A4,B1〜B5,G1,G2,G4,G5は、係数であり、ＤＡＯエージェント６１が強化学習により最適なものに収束させる。A1はトークン、ｇはライセンス収入、ｅは「いいね！」の数、ｂは事業化収益である。 Here, A2 to A4, B1 to B5, G1, G2, G4, and G5 are coefficients, and the DAO agent 61 converges to the optimum one by reinforcement learning. A1 is token, g is license revenue, e is the number of likes, and b is commercialization revenue.

なお、各ペルソナエージェント群６８〜７１が役務を行った後に発生したライセンス収入ｇまたは事業化収益ｂのみが報酬r2〜r5として考慮される。ライセンス収入ｇまたは事業化収益ｂが既に発生しているオリジンアイデアに対し後から改良役務やトークン購入役務を行うという不正行為を防止するためである。 Only the license revenue g or the commercialization revenue b generated after each persona agent group 68 to 71 performs the service is considered as the reward r2 to r5. This is to prevent fraudulent acts such as performing improvement services or token purchase services later on an origin idea for which license revenue g or commercialization revenue b has already been generated.

また、改良役務、事業化役務、または侵害対処役務を行ったペルソナエージェント群が併せてトークン購入役務を行ってもよい。さらに、アイデア発案役務を行ったペルソナエージェント群６７が、併せて改良役務、事業化役務、または侵害対処役務を行ってもよい。 In addition, a group of persona agents who have performed improvement services, commercialization services, or infringement countermeasure services may also perform token purchase services. Further, the persona agent group 67 who has performed the idea idea service may also perform the improvement service, the commercialization service, or the infringement countermeasure service.

Ｓ６１３に示したＤＡＯエージェント強化学習処理の詳細を図３８に基づいて説明する。この処理は、ＤＡＯエージェント６１が自ら強化学習を行い報酬r1〜r5の最適化を図るものである。ＤＡＯエージェント６１は、Ｓ６３０において、ペルソナエージェント群の各行為ａを受信したか否か判定する。受信していない場合はＳ６３２へ進むが、受信していると判定した場合には制御がＳ６３１へ進み、受信した各行為ａを記憶する。 The details of the DAO agent reinforcement learning process shown in S613 will be described with reference to FIG. 38. In this process, the DAO agent 61 performs reinforcement learning by itself to optimize the rewards r1 to r5. The DAO agent 61 determines in S630 whether or not each act a of the persona agent group has been received. If it is not received, it proceeds to S632, but if it is determined that it is received, the control proceeds to S631 and each received action a is stored.

Ｓ６３２において、「いいね！」が付与されたか否か判定し、付与されていない場合にはＳ６３４へ進むが、付与されていると判定した場合にはＳ６３３において、ペルソナエージェント毎にいいね！eを記憶する。Ｓ６３４において、事業化収益があったか否か判定し、ない場合にはＳ６３６へ進むが、あったと判定した場合にはＳ６３５において、事業化収益ｂを記憶する。Ｓ６３６において、ライセンス利益ｇがあったか否か判定し、ない場合にはＳ６３８に進むが、あったと判定した場合には、Ｓ６３７において、ライセンス利益ｇを記憶する。 In S632, it is determined whether or not "Like!" Is given, and if it is not given, the process proceeds to S634, but if it is determined that it is given, in S633, each persona agent likes it! Remember e. In S634, it is determined whether or not there is commercialization profit, and if it is not, the process proceeds to S636, but if it is determined that there is, in S635, the commercialization profit b is stored. In S636, it is determined whether or not there is a license profit g, and if it is not present, the process proceeds to S638, but if it is determined that there is, the license profit g is stored in S637.

Ｓ６３８において、報酬算出時期になったか否か判定し、なっていない場合にはＳ６４０に進むがなったと判定した場合には、Ｓ６３９において、報酬テーブル（図３９（Ａ））を参照して各報酬ｒ1〜r5を算出して該当するペルソナエージェントへ付与する。Ｓ６４０において、学習更新時期になったか否か判定し、なっていない場合にはこのＤＡＯエージェント強化学習処理が終了してリターンする。学習更新時期になったと判定した場合にはＳ６４１において、報酬として付与したトークンＡ１のトータル付与価格ＴＴと付与したトークンの変動相場における現時点のトータル価格ＴＢとを算出し、Ｓ６４２において、ＴＢ／ＴＴの値からＤＡＯエージェントの報酬Ｒを算出する。例えば、前回の学習更新時におけるＴＢ／ＴＴの値と今回の学習更新時におけるＴＢ／ＴＴの値とを比較し、今回の学習更新時におけるＴＢ／ＴＴの値の方が大きい場合に大きな報酬Ｒにし、小さい場合に小さな報酬Ｒにする。その結果、ＤＡＯエージェント６１が得ることのできる報酬Ｒは、トークンの変動相場におけるトータル価格ＴＢが高騰すれば大きくなり、トークンの変動相場におけるトータル価格ＴＢが下落すれば小さくなる。 In S638, it is determined whether or not it is time to calculate the reward, and if it is not determined, it is determined that the process proceeds to S640. Calculate r1 to r5 and assign it to the corresponding persona agent. In S640, it is determined whether or not the learning update time has come, and if not, the DAO agent reinforcement learning process ends and returns. When it is determined that the learning update time has come, in S641, the total grant price TT of the token A1 granted as a reward and the current total price TB in the floating exchange rate of the token granted are calculated, and in S642, the TB / TT The reward R of the DAO agent is calculated from the value. For example, the value of TB / TT at the time of the previous learning update is compared with the value of TB / TT at the time of the current learning update, and when the value of TB / TT at the time of the current learning update is larger, a large reward R And if it is small, it will be a small reward R. As a result, the reward R that the DAO agent 61 can obtain increases when the total price TB in the floating exchange rate of the token rises, and decreases when the total price TB in the floating exchange rate of the token falls.

次に、Ｓ６４３において、報酬Ｒに基づいて、ＴＤ学習により最適政策π^＊に従った行為A1〜A4,B1〜B5,G1,G2,G4,G5を求める処理が行われ、Ｓ６４４において、報酬テーブルのA1〜A4,B1〜B5,G1,G2,G4,G5を、求められた行為A1〜A4,B1〜B5,G1,G2,G4,G5に更新する。その結果、ＤＡＯエージェント６１は、トークンの変動相場におけるトータル価格ＴＢを高騰させるための最適な行為A1〜A4,B1〜B5,G1,G2,G4,G5を学習することになる。なお、この学習目標は一例に過ぎず、学習目標としては、他に、オリジンアイデアの投稿数を増加させること、オリジンアイデアと改良案との投稿合計数を増加させること、事業化件数を増加させること、事業化収益合計を増加させること等であってもよい。Next, in S643, based on the reward R, the processing of obtaining the actions A1 to A4, B1 to B5, G1, G2, G4, G5 according ^{to the optimum policy π * by TD learning is performed, and in S644, the reward table is performed.} A1 to A4, B1 to B5, G1, G2, G4, G5 are updated to the requested actions A1 to A4, B1 to B5, G1, G2, G4, G5. As a result, the DAO agent 61 learns the optimum actions A1 to A4, B1 to B5, G1, G2, G4, and G5 for raising the total price TB in the floating exchange rate of tokens. This learning goal is just an example, and other learning goals include increasing the number of posts of origin ideas, increasing the total number of posts of origin ideas and improvement plans, and increasing the number of commercializations. That may be to increase the total commercialization revenue.

次に、Ｓ６４５において、強化学習が完了したか否か判定し、未だ完了していない場合はリターンする。完了したと判定した場合にはＳ６４６において、シミュレーション強化学習の依頼者に学習済みマルチ役務ＤＡＯを送信する。 Next, in S645, it is determined whether or not the reinforcement learning is completed, and if it is not completed yet, it returns. When it is determined that the completion is completed, the learned multi-service DAO is transmitted to the requester of the simulation reinforcement learning in S646.

シミュレーション強化学習の依頼者は、インセンティブ設計が最適化された学習済みのマルチ役務ＤＡＯ（イノベーション誘発ＤＡＯ）６５をリアルワールド４７で運用することができる。その結果、このマルチ役務ＤＡＯ（イノベーション誘発ＤＡＯ）６５では、図３７に示した「各ペルソナエージェント群６７〜７１」が、実際のユーザ群となり、各役務を行うユーザ群に対し、学習済みＤＡＯエージェント６１により最適設計された報酬（インセンティブ）が分配される。このリアルワールド４７での実際の運用の段階では、各投稿内容等の役務やトークンの売買取引内容等をブロックチェーンにタイムスタンプ付きで記録させる。その結果、ブロックチェーンが、オリジンアイデア投稿内容や改良案投稿内容に対する公証人の役目を果たしてくれ、新規性喪失の例外適用（特許法３０条）や冒認出願対策（特許法４９条１項７行、７４条、１２３条１項２号）も行いやすくなる。 The client of simulation reinforcement learning can operate a trained multi-service DAO (innovation-induced DAO) 65 with an optimized incentive design in the real world 47. As a result, in this multi-service DAO (innovation-induced DAO) 65, "each persona agent group 67 to 71" shown in FIG. 37 becomes an actual user group, and the trained DAO agent is used for the user group performing each service. The optimally designed reward (incentive) is distributed by 61. At the stage of actual operation in this real world 47, services such as the contents of each post and the contents of token sales transactions are recorded on the blockchain with a time stamp. As a result, the blockchain acts as a notary for the content of origin idea submissions and the content of improvement proposals, and the exception application of loss of novelty (Patent Law Article 30) and measures against denial application (Patent Law Article 49, Paragraph 1, Line 7). , Article 74, Article 123, Paragraph 1, Item 2) will also be easier to perform.

また、マルチ役務ＤＡＯ（イノベーション誘発ＤＡＯ）６５をリアルワールド４７で運用する段階においても、ＤＡＯエージェント６１により引き続き機械学習（強化学習）を続行させ、自際の運用状況にマッチしたより一層最適にインセンティブ設計されたものになるようにしてもよい。なお、学習済みの各ペルソナエージェント群６７〜７１（図４０（Ａ）（Ｂ）及び図４１（Ａ）（Ｂ）による学習済みのペルソナエージェント群）もマルチ役務ＤＡＯ（イノベーション誘発ＤＡＯ）６５に含めてシミュレーション強化学習の依頼者に送信し、各役務を行うユーザ群に対し各ペルソナエージェント群６７〜７１が相談役として機能させてもよい。さらには、マルチ役務ＤＡＯ（イノベーション誘発ＤＡＯ）６５をリアルワールド４７で運用する段階において、各役務をユーザ群と各ペルソナエージェント群６７〜７１との両者が実行する混在型のマルチ役務ＤＡＯ６５にしてもよく、また、各役務を各ペルソナエージェント群６７〜７１のみが実行するペルソナエージェント運用型のマルチ役務ＤＡＯ（イノベーション誘発ＤＡＯ）６５にしてもよい。なお、このイノベーション誘発ＤＡＯは、前述のミラーワールドでのシミュレーション強化学習を経て生成されるものに限定されず、他の方法、例えば人為的な設計に基づいて人為的に生成されるものであってもよく、さらには、ＤＡＯに限らず、特定の管理者や主体を持った組織（例えば通常の株式会社等）であってもよい。 In addition, even at the stage of operating the multi-service DAO (innovation-induced DAO) 65 in the real world 47, the DAO agent 61 will continue machine learning (reinforcement learning), and an even more optimal incentive that matches the operation situation of one's own. It may be designed. In addition, each trained persona agent group 67 to 71 (learned persona agent group according to FIGS. 40 (A) (B) and 41 (A) (B)) is also included in the multi-service DAO (innovation-induced DAO) 65. It may be transmitted to the requester of the simulation reinforcement learning, and each persona agent group 67 to 71 may function as a consultant for the user group performing each service. Furthermore, at the stage of operating the multi-service DAO (innovation-induced DAO) 65 in the real world 47, even if each service is a mixed type multi-service DAO 65 executed by both the user group and each persona agent group 67 to 71. In addition, it may be a persona agent operation type multi-service DAO (innovation-induced DAO) 65 in which each service is executed only by each persona agent group 67 to 71. It should be noted that this innovation-induced DAO is not limited to the one generated through the simulation reinforcement learning in the mirror world described above, but is artificially generated based on another method, for example, an artificial design. Further, it is not limited to DAO, and may be an organization having a specific manager or entity (for example, an ordinary corporation).

次に、図３９（Ｂ）に基づいてペルソナエージェントが強化学習を行う処理のメインルーチンを説明する。Ｓ６４８において、アイデア発案役務実行処理が行われ、Ｓ６４９において、改良役務実行処理が行われ、Ｓ６５０において、事業化役務実行処理が行われ、Ｓ６５１において、侵害対処役務実行処理が行われ、Ｓ６５２において、トークン購入役務実行処理が行われる。 Next, the main routine of the process in which the persona agent performs reinforcement learning will be described based on FIG. 39 (B). In S648, the idea idea service execution process is performed, in S649, the improved service execution process is performed, in S650, the commercialization service execution process is performed, in S651, the infringement countermeasure service execution process is performed, and in S652, the infringement countermeasure service execution process is performed. Token purchase service execution process is performed.

Ｓ６４８に示したアイデア発案役務実行処理の詳細を図４０（Ａ）に基づいて説明する。Ｓ６５５において、アイデア発案を行うか否か判定し行わない場合にはリターンする。行うと判定した場合には、Ｓ６５６において、アイデアを創出する処理を行う。このアイデアの創出は、例えばDABUSというＡＩを利用する。例えば、ペルソナエージェント６７とDABUSとが協働してアイデアの創出を行う。Ｓ６５７において、アイデア発案の投稿内容を生成し、Ｓ６５８において、アイデア発案投稿行為ａ1iを実行する。 The details of the idea-initiating service execution process shown in S648 will be described with reference to FIG. 40 (A). In S655, it is determined whether or not to propose an idea, and if not, a return is made. If it is determined to be performed, the process of creating an idea is performed in S656. The creation of this idea uses, for example, an AI called DABUS. For example, Persona Agent 67 and DABUS collaborate to create ideas. In S657, the posted content of the idea idea is generated, and in S658, the idea idea posting act a1i is executed.

Ｓ６５９において、ＤＡＯエージェント６１から報酬r1iを受信したか否か判定し、受信していない場合にはリターンする。受信したと判定した場合には、Ｓ６６０において、報酬r1iに基づいて、ＴＤ学習により最適政策π^＊に従った行為ａを求める。この行為ａは、受け取った報酬r1iが満足できるものであればアイデア発案の行為を繰り返し継続することになるが、報酬r1iが満足できるものでなければ、他の行為（例えば、改良役務、事業化役務、侵害対処役務、トークン購入役務、あるいはなにも役務を行わない）を選択することとなる。In S659, it is determined whether or not the reward r1i has been received from the DAO agent 61, and if not, the reward r1i is returned. If it is determined that the signal has been received, in S660, the action a according ^{to the optimum policy π * is obtained by TD learning based on the reward r1i.} In this act a, if the received reward r1i is satisfactory, the act of inventing an idea is repeatedly continued, but if the reward r1i is not satisfactory, another act (for example, improvement service, commercialization) is continued. You will have to select services, infringement handling services, token purchasing services, or no services).

Ｓ６４９に示された改良役務実行処理の詳細を図４０（Ｂ）に基づいて説明する。Ｓ６６４において、改良案を投稿するか否か判定し、投稿しない場合にはリターンする。投稿すると判定した場合にはＳ６６５において改良案を創出する処理を行う。改良案の創出は、例えばDABUSというＡＩを利用する。例えば、ペルソナエージェント６８とDABUSとが協働して改良案を創出する処理を行う。Ｓ６６６において改良案投稿内容を生成し、Ｓ６６７において、改良案投稿行為ａ2iを実行する。 The details of the improved service execution process shown in S649 will be described with reference to FIG. 40 (B). In S664, it is determined whether or not to post the improvement plan, and if it is not posted, a return is made. If it is determined to post, the process of creating an improvement plan is performed in S665. For the creation of improvement plans, for example, AI called DABUS is used. For example, Persona Agent 68 and DABUS collaborate to create an improvement plan. In S666, the content of the improvement proposal posting is generated, and in S667, the improvement proposal posting act a2i is executed.

Ｓ６６８において、ＤＡＯエージェント６１から報酬r２iを受信したか否か判定し、受信していない場合にはリターンする。受信したと判定した場合には、Ｓ６６９において、報酬r２iに基づいて、ＴＤ学習により最適政策π^＊に従った行為ａを求める。この行為ａは、受け取った報酬r２iが満足できるものであれば改良案投稿の行為を繰り返し継続することになるが、報酬r２iが満足できるものでなければ、他の行為（例えば、アイデア発案役務、事業化役務、侵害対処役務、トークン購入役務、あるいはなにも役務を行わない）を選択することとなる。In S668, it is determined whether or not the reward r2i has been received from the DAO agent 61, and if not, the reward r2i is returned. If it is determined that the message has been received, in S669, the action a according ^{to the optimum policy π * is obtained by TD learning based on the reward r2i.} If the reward r2i received is satisfactory, this act a will continue the act of posting the improvement proposal repeatedly, but if the reward r2i is not satisfactory, another act (for example, an idea idea service, etc.) You will have to select commercialization services, infringement handling services, token purchasing services, or no services).

Ｓ６５０に示された事業化役務実行処理の詳細を図４１（Ａ）に基づいて説明する。Ｓ６７４において、事業化するか否か判定し、事業化しない場合はリターンする。事業化すると判定した場合は、Ｓ６７５において、事業計画書を生成し、Ｓ６７６において、事業計画書の投稿行為ａ3iを実行し、Ｓ６７７において、事業化役務を遂行し、Ｓ６７８において、遂行状況投稿行為ａ3iを実行する。この遂行状況投稿行為ａ3iには、前述した事業化により得た収益の投稿等も含まれる。 The details of the commercialization service execution process shown in S650 will be described with reference to FIG. 41 (A). In S674, it is determined whether or not to commercialize, and if it is not commercialized, a return is made. If it is determined to be commercialized, a business plan is generated in S675, the business plan posting act a3i is executed in S676, the commercialization service is executed in S677, and the execution status posting act a3i is executed in S678. To execute. This performance status posting act a3i also includes posting of profits obtained from the above-mentioned commercialization.

Ｓ６７９において、ＤＡＯエージェント６１から報酬r3iを受信したか否か判定し、受信していない場合はリターンする。受信していると判定した場合はＳ６８０において、受信した報酬r3iに基づいて、ＴＤ学習により最適政策π^＊に従った行為aを求める。この行為ａは、受け取った報酬r3iが満足できるものであれば事業化役務の行為を繰り返し継続することになるが、報酬r3iが満足できるものでなければ、他の行為（例えば、アイデア発案役務、改良役務、侵害対処役務、トークン購入役務、あるいはなにも役務を行わない）を選択することとなる。In S679, it is determined whether or not the reward r3i has been received from the DAO agent 61, and if not, the reward r3i is returned. If it is determined that the message has been received, in S680, the action a according ^{to the optimum policy π * is obtained by TD learning based on the received reward r3i.} If the reward r3i received is satisfactory, this act a will continue the act of the commercialization service repeatedly, but if the reward r3i is not satisfactory, another act (for example, the idea idea service, etc.) You will have to select improved services, infringement handling services, token purchasing services, or no services).

Ｓ６５１に示した侵害対処役務実行処理の詳細を図４１（Ｂ）に基づいて説明する。Ｓ６８４において、侵害対処役務を実行するか否か判定する。実行しない場合にはリターンするが、実行すると判定した場合にはＳ６８５において、侵害行為の捜査を行い、Ｓ６８６において、侵害行為を発見したか否か判定する。なお、侵害行為の捜査を行う前に、前述したように、特許出願やその権利化の行為を行ってもよい。侵害行為を発見しなかった場合にはリターンするが、侵害行為を発見したと判定した場合には、Ｓ６８７において、被疑侵害者への警告書を生成し、Ｓ６８８において、警告書投稿行為ａ4iを実行し、Ｓ６８９において、被疑侵害者との交渉等、侵害対処行為ａ4iを遂行し、Ｓ６９０において、遂行状況投稿行為ａ4iを実行する。 The details of the infringement countermeasure service execution process shown in S651 will be described with reference to FIG. 41 (B). In S684, it is determined whether or not to execute the infringement countermeasure service. If it is not executed, it returns, but if it is determined to be executed, the infringing act is investigated in S685, and it is determined in S686 whether or not the infringing act is found. Before investigating the infringing act, as described above, the patent application and the act of acquiring the right may be filed. If the infringing act is not found, it returns, but if it is determined that the infringing act is found, a warning letter to the suspected infringer is generated in S687, and the warning letter posting act a4i is executed in S688. Then, in S689, the infringement coping act a4i such as negotiation with the suspected infringer is executed, and in S690, the execution status posting act a4i is executed.

次に、Ｓ６９１において、ＤＡＯエージェント６１から報酬r4iを受信したか否か判定し、受信していない場合はリターンする。受信していると判定した場合はＳ６９２において、受信した報酬r4iに基づいて、ＴＤ学習により最適政策π^＊に従った行為aを求める。この行為ａは、受け取った報酬r4iが満足できるものであれば事業化役務の行為を繰り返し継続することになるが、報酬r4iが満足できるものでなければ、他の行為（例えば、アイデア発案役務、改良役務、事業化役務、トークン購入役務、あるいはなにも役務を行わない）を選択することとなる。Next, in S691, it is determined whether or not the reward r4i has been received from the DAO agent 61, and if not, the reward r4i is returned. If it is determined that the message has been received, in S692, the action a according ^{to the optimum policy π * is obtained by TD learning based on the received reward r4i.} If the reward r4i received is satisfactory, this act a will continue the act of the commercialization service repeatedly, but if the reward r4i is not satisfactory, another act (for example, the idea idea service, etc.) You will have to select improvement services, commercialization services, token purchase services, or no services).

次に、Ｓ６５２に示されたトークン購入役務実行処理の詳細を図４２（Ａ）に基づいて説明する。Ｓ９６９において、トークンを購入するか否か判定し、購入しない場合にはリターンする。購入すると判定した場合には、Ｓ６９７において、トークン購入行為ａ5iを実行する。次に、Ｓ６９８において、ＤＡＯエージェント６１から報酬r5iを受信したか否か判定し、受信していない場合はリターンする。受信していると判定した場合はＳ６９９において、受信した報酬r5iに基づいて、ＴＤ学習により最適政策π^＊に従った行為aを求める。この行為ａは、受け取った報酬r5iが満足できるものであれば事業化役務の行為を繰り返し継続することになるが、報酬r5iが満足できるものでなければ、他の行為（例えば、アイデア発案役務、改良役務、事業化役務、侵害対処役務、あるいはなにも役務を行わない）を選択することとなる。Next, the details of the token purchase service execution process shown in S652 will be described with reference to FIG. 42 (A). In S969, it is determined whether or not to purchase the token, and if it is not purchased, a return is made. If it is determined to purchase, the token purchase act a5i is executed in S697. Next, in S698, it is determined whether or not the reward r5i has been received from the DAO agent 61, and if not, the reward r5i is returned. If it is determined that the message has been received, in S699, the action a according ^{to the optimum policy π * is obtained by TD learning based on the received reward r5i.} If the reward r5i received is satisfactory, this act a will continue the act of the commercialization service repeatedly, but if the reward r5i is not satisfactory, another act (for example, the idea idea service, etc.) Improvement services, commercialization services, infringement handling services, or no services) will be selected.

図４２（Ｂ）に基づいて、ペルソナエージェント群７１によるトークン購入に伴うトークン７２の変動相場での価格変動を説明する。アイデア発案役務を行ったペルソナエージェント６７に報酬Ａ１として５０トークン（時価総額５万円）７２が付与されており、そのトークン７２の一部（１０トークン）を１万円相当の仮想通貨を支払ってペルソナエージェント７１ａが購入した。次に、その１０トークンを１万５千円相当の仮想通貨を支払ってペルソナエージェント７１ｂが購入した。その結果、１０トークンが１万５千円に価値が高騰する。それをペルソナエージェント７１ｃが２万円相当の仮想通貨を支払って購入した。その結果、１０トークンが２万円に価値が高騰する。それをペルソナエージェント７１ｄが２万５千円相当の仮想通貨を支払って購入した。その結果、１０トークンが２万５千円に価値が高騰する。それをペルソナエージェント７１ｅが３万円相当の仮想通貨を支払って購入した。その結果、１０トークンが３万円に価値が高騰する。 Based on FIG. 42 (B), the price fluctuation of the token 72 in the floating exchange rate due to the token purchase by the persona agent group 71 will be described. 50 tokens (market capitalization 50,000 yen) 72 are given to the persona agent 67 who performed the idea idea service as a reward A1, and a part of the token 72 (10 tokens) is paid a virtual currency equivalent to 10,000 yen. Purchased by Persona Agent 71a. Next, the persona agent 71b purchased the 10 tokens by paying a virtual currency equivalent to 15,000 yen. As a result, the value of 10 tokens will rise to 15,000 yen. Persona Agent 71c purchased it by paying a virtual currency equivalent to 20,000 yen. As a result, the value of 10 tokens will rise to 20,000 yen. Persona Agent 71d purchased it by paying a virtual currency equivalent to 25,000 yen. As a result, the value of 10 tokens will rise to 25,000 yen. Persona Agent 71e purchased it by paying a virtual currency equivalent to 30,000 yen. As a result, the value of 10 tokens will rise to 30,000 yen.

これにより、ペルソナエージェント６７の手持ちの４０トークン（時価総額４万円）が時価総額１２万円に高騰する。このトークンは期待値に正比例して高騰するのであり、人気のあるオリジンアイデアほど高くなり、人気のある（いいね！の多い）改良案が投稿されるほど高くなり、人気のある（いいね！の多い）事業化が投稿されるほど高くなり、人気のある（いいね！の多い）侵害対処が投稿されるほど高くなる。 As a result, 40 tokens (market capitalization of 40,000 yen) on hand of Persona Agent 67 will soar to a market capitalization of 120,000 yen. This token soars in direct proportion to expectations, the more popular origin ideas are, the more popular (likes) improvements are posted, and the more popular (likes!) Are posted. The more commercializations are posted, the higher the price, and the more popular (many likes) infringement countermeasures are posted, the higher the price.

マルチ役務ＤＡＯ６５をリアルワールド４７で実際に運用する段階では、前述したように、図３７の「各ペルソナエージェント群６７〜７１」がリアルワールドにおけるユーザ群となる。その場合に、アイデア発案役務を行ったユーザ群に与えられたトークン７２ばかりでなく、各種役務を行うユーザ自身のトークン（以下「マイトークン」という）も売買させてもよい。その投稿された役務内容を閲覧した他のユーザが当該投稿者に期待して当該投稿者自身のマイトークンを購入することにより、変動相場でのマイトークンの価格が高騰する。この場合に、投稿者の所得の一部を、マイトークン購入者に購入量に応じた割合で配当してもよい。このマイトークンは、マルチ役務ＤＡＯ６５内で発行してもよいが、マイトークンを発行・流通させる専門業者がユーザに対し発行したマイトークンとリンクさせ、その専門業者発行のマイトークンをマルチ役務ＤＡＯ６５のユーザが売買できるようにしてもよい。マイトークンを発行・流通させる専門業者としては、現在、株式会社VALUがある。 At the stage of actually operating the multi-service DAO65 in the real world 47, as described above, “each persona agent group 67 to 71” in FIG. 37 becomes a user group in the real world. In that case, not only the token 72 given to the group of users who have performed the idea-creating service, but also the tokens of the users themselves who perform various services (hereinafter referred to as "my token") may be bought and sold. When another user who browses the posted service content purchases the poster's own My Token in anticipation of the poster, the price of the My Token in the floating exchange rate rises. In this case, a part of the poster's income may be distributed to the My Token purchaser at a rate according to the purchase amount. This My Token may be issued within the Multi-Service DAO65, but the My Token issued by the specialist who issues and distributes the My Token is linked to the My Token issued to the user, and the My Token issued by the Specialist is linked to the Multi-Service DAO65. The user may be able to buy and sell. Currently, VALU Inc. is a specialist company that issues and distributes My Tokens.

次に、複数の機能エレメントが協働して統制のとれた１つのＤＡＯを構築するシステムを図４３〜図５９に基づいて説明する。このようなＤＡＯを以下「エレメント統合ＤＡＯ」と称する。 Next, a system in which a plurality of functional elements work together to construct one well-controlled DAO will be described with reference to FIGS. 43 to 59. Such DAO is hereinafter referred to as "element integrated DAO".

図４３を参照し、このエレメント統合ＤＡＯは、リアルワールド４７において既に存在する会社組織等をＤＡＯで簡単に構築できるようにするものであり、その機能エレメント毎に予めエレメントＤＡＯが生成され用意されている。つまり、機能エレメント毎にモジュール化されたエレメントＤＡＯが用意されており、必要となるエレメントＤＡＯを選んで組み合わせることにより、簡単に所望のエレメント統合ＤＡＯを構築できるように構成されている。エレメントＤＡＯ提供業者７３には、サーバ７４とエレメントＤＡＯプロトコルＤＢ７５とが設けられている。エレメントＤＡＯプロトコルＤＢ７５には、例えば、会社関係に必要な機能エレメント毎に用意されたエレメントＤＡＯ、ＮＰＯ（Nonprofit Organization）関係に必要な機能エレメント毎に用意されたエレメントＤＡＯ、地方自治体に必要な機能エレメント毎に用意されたエレメントＤＡＯ等が記憶されている。 With reference to FIG. 43, this element integrated DAO makes it possible to easily construct a company organization or the like that already exists in the real world 47 with DAO, and an element DAO is generated and prepared in advance for each functional element. There is. That is, a modularized element DAO is prepared for each functional element, and it is configured so that a desired element integrated DAO can be easily constructed by selecting and combining necessary element DAOs. The element DAO provider 73 is provided with a server 74 and an element DAO protocol DB 75. The element DAO protocol DB75 includes, for example, an element DAO prepared for each functional element required for a company relationship, an element DAO prepared for each functional element required for an NPO (Nonprofit Organization) relationship, and a functional element required for a local government. The element DAO and the like prepared for each are stored.

エレメント統合ＤＡＯ構築業者は、依頼者からエレメント統合ＤＡＯ構築の注文を受けて、必要となる機能エレメントに相当するエレメントＤＡＯを、サーバ７４を経由してＰＣ端末７６にインストールする。図４３の例では、Ａ１エレメントＤＡＯ（Ａ１エレメントエージェント含む）、Ａ２エレメントＤＡＯ（Ａ２エレメントエージェント含む）、Ａ５エレメントＤＡＯ（Ａ５エレメントエージェント含む）、Ａ９エレメントＤＡＯ（Ａ９エレメントエージェント含む）がインストールされている。Ａ１〜Ａ９の各エレメントエージェントとは、対応する各エレメントＤＡＯが最高のパフォーマンスを発揮できるように強化学習（機械学習）するためのＡＩである。 The element integrated DAO builder receives an order for element integrated DAO construction from the client, and installs the element DAO corresponding to the required functional element on the PC terminal 76 via the server 74. In the example of FIG. 43, A1 element DAO (including A1 element agent), A2 element DAO (including A2 element agent), A5 element DAO (including A5 element agent), and A9 element DAO (including A9 element agent) are installed. .. Each element agent of A1 to A9 is an AI for reinforcement learning (machine learning) so that each corresponding element DAO can exhibit the best performance.

また、ＰＣ端末７６には、統括エージェントもインストールされる。この統括エージェントは、各エレメントＤＡＯのエレメントエージェントを統括してエレメント統合ＤＡＯ全体が最適化されるように制御するものであり、統括エージェント自身も強化学習（機械学習）を行って全体最適化を達成する。各エレメントエージェントは、担当するエレメントＤＡＯのパフォーマンスを最大化するためのものであるため、エレメントエージェントのみでは部分最適化に陥り、エレメント統合ＤＡＯの全体最適化が達成できない虞がある。そこで、エレメント統合ＤＡＯ全体が最適化されるように制御する統括エージェントが必要となる。これは、例えば、不完全情報ゲームにおけるパレート最適解探しと同じといえる。 In addition, a general agent is also installed in the PC terminal 76. This supervising agent supervises the element agents of each element DAO and controls so that the entire element integrated DAO is optimized, and the supervising agent itself also performs reinforcement learning (machine learning) to achieve overall optimization. do. Since each element agent is for maximizing the performance of the element DAO in charge, there is a possibility that the element agent alone may fall into partial optimization and the overall optimization of the element integrated DAO cannot be achieved. Therefore, a central agent that controls the entire element integrated DAO to be optimized is required. This can be said to be the same as searching for a Pareto optimal solution in a perfect information game, for example.

ＰＣ６７にインストールされたエレメント統合ＤＡＯを、ミラーワールド５１をシミュレーション環境としてシミュレーション強化学習を行うために、複数の端末１６にインストールし、それら端末１６をノード１９とするプライベートチェーン２からなるブロックチェーンのデジタルツイン２Ｔを、ミラーワールド５１内に生成する。 The element integrated DAO installed on the PC 67 is installed on a plurality of terminals 16 in order to perform simulation reinforcement learning using the mirror world 51 as a simulation environment, and the block chain digital consisting of a private chain 2 having the terminals 16 as nodes 19 is digital. Twin 2T is generated in the mirror world 51.

このエレメント統合ＤＡＯのシミュレーション強化学習のメインルーチンを図４４（Ａ）に基づいて説明する。エレメント統合ＤＡＯのシミュレーション強化学習を依頼する依頼者のユーザ端末１６のＣＰＵ１０は、Ｓ６７４においてシミュレーション強化学習準備応答処理を行い、Ｓ６７５においてシミュレーション強化学習応答処理を行う。ミラーワールドサーバ４６のＣＰＵ１０は、Ｓ６７９においてシミュレーション強化学習準備処理を行い、Ｓ６８０においてシミュレーション強化学習処理を行う。 The main routine of simulation reinforcement learning of this element integrated DAO will be described with reference to FIG. 44 (A). The CPU 10 of the client user terminal 16 requesting the simulation reinforcement learning of the element integrated DAO performs the simulation reinforcement learning preparation response processing in S674 and the simulation reinforcement learning response processing in S675. The CPU 10 of the mirror world server 46 performs a simulation reinforcement learning preparation process in S679 and a simulation reinforcement learning process in S680.

Ｓ６７４に示されたシミュレーション強化学習準備応答処理及びＳ６７９に示されたシミュレーション強化学習準備処理の詳細を、図４４（Ｂ）に基づいて説明する。シミュレーション強化学習準備応答処理においてユーザ端末１６のＣＰＵ１０は、Ｓ６７９において、シミュレーション強化学習を依頼するか否か判定し、依頼しない場合はリターンする。依頼すると判定した場合は、Ｓ６８０において、ＤＡＯデータとパーソナルＡＩ群とを送信して依頼する。ＤＡＯデータとは、シミュレーション強化学習してもらいたい組織の機能エレメントである。例えば、家具組立販売会社の場合は、資材調達エレメント、組立エレメント、宣伝エレメント、販売エレメントである。パーソナルＡＩ群とは、リアルワールド４７において実際にエレメント統合ＤＡＯに従事する人々のパーソナルＡＩである。パーソナルＡＩを有していない従事者がいる場合、及び、未だに従事者が決まっていない場合には、前述のＳ５６１、Ｓ５６５〜Ｓ５６８、Ｓ５６２、Ｓ５７３〜Ｓ５７５に基づいて説明したように、シミュレーション強化学習対象のエレメント統合ＤＡＯにマッチするパーソナルＡＩを生成して準備する。 The details of the simulation reinforcement learning preparation response processing shown in S674 and the simulation reinforcement learning preparation processing shown in S679 will be described with reference to FIG. 44 (B). In the simulation reinforcement learning preparation response processing, the CPU 10 of the user terminal 16 determines in S679 whether or not to request the simulation reinforcement learning, and returns if not requested. If it is determined that the request is made, the DAO data and the personal AI group are transmitted in S680 to make the request. DAO data is a functional element of an organization that wants simulation reinforcement learning. For example, in the case of a furniture assembly and sales company, it is a material procurement element, an assembly element, an advertising element, and a sales element. The personal AI group is a personal AI of people who are actually engaged in element integrated DAO in the real world 47. If there is a worker who does not have a personal AI, or if the worker has not been decided yet, simulation reinforcement learning will be performed as described based on S561, S565-S568, S562, and S573 to S575 described above. Generate and prepare a personal AI that matches the target element integration DAO.

シミュレーション強化学習準備処理において、ミラーワールドサーバ４６のＣＰＵ１０は、Ｓ６８３において、シミュレーション強化学習の依頼があったか否か判定し、ない場合にはリターンする。シミュレーション強化学習の依頼があったと判定した場合には、Ｓ６８４において、パーソナルＡＩ群をコピーしてエレメント統合ＤＡＯ内に配備してエレメント統合ＤＡＯデジタルツインを生成し、シミュレーション対象としてミラーワールド５１に登録する。 In the simulation reinforcement learning preparation process, the CPU 10 of the mirror world server 46 determines in S683 whether or not there is a request for simulation reinforcement learning, and returns if there is no request. When it is determined that there is a request for simulation reinforcement learning, in S684, the personal AI group is copied and deployed in the element integrated DAO to generate the element integrated DAO digital twin, and it is registered in the mirror world 51 as a simulation target. ..

その状態を図４５に示している。リアルワールド４７のエレメント統合ＤＡＯ７７のデジタルツイン７８がミラーワールド５１に登録されている。図４５に示すエレメント統合ＤＡＯデジタルツイン７８は、例えば家具組立販売会社のエレメント統合ＤＡＯデジタルツイン７８であり、資材調達、組立、宣伝、販売の、各機能エレメントを有し、その各機能エレメントに従事する者たちのパーソナルＡＩ群の識別番号が、ミラーワールドサーバ４６に記憶されている。 The state is shown in FIG. The digital twin 78 of the element integrated DAO77 of the real world 47 is registered in the mirror world 51. The element-integrated DAO digital twin 78 shown in FIG. 45 is, for example, an element-integrated DAO digital twin 78 of a furniture assembly and sales company, has each functional element of material procurement, assembly, promotion, and sales, and engages in each functional element. The identification numbers of the personal AI groups of those who do so are stored in the mirror world server 46.

このエレメント統合ＤＡＯデジタルツイン７８について、シミュレーション強化学習により最適なインセンティブ設計を導き出す。その強化学習（機械学習）の概略システムを図４６に示す。 For this element-integrated DAO digital twin 78, the optimum incentive design is derived by simulation reinforcement learning. A schematic system of the reinforcement learning (machine learning) is shown in FIG.

図４６を参照し、エレメント統合ＤＡＯデジタルツイン７８では、資材調達、組立、宣伝、販売の、各機能エレメントに対応して、資材調達エレメントエージェント８０及び資材調達担当のパーソナルＡＩ群８４、組立エレメントエージェント８１及び組立担当のパーソナルＡＩ群８５、宣伝エレメントエージェント８２及び宣伝担当のパーソナルＡＩ群８６、資材調達エレメントエージェント８０及び資材調達担当のパーソナルＡＩ群８４、販売エレメントエージェント８３及び販売担当のパーソナルＡＩ群８７が、形成されている。それら各エレメントエージェント８０〜８３を統括エージェント７９が統括する。 With reference to FIG. 46, in the element integrated DAO digital twin 78, the material procurement element agent 80, the personal AI group 84 in charge of material procurement, and the assembly element agent correspond to each functional element of material procurement, assembly, advertisement, and sales. 81 and personal AI group 85 in charge of assembly, advertising element agent 82 and personal AI group 86 in charge of advertising, material procurement element agent 80 and personal AI group 84 in charge of material procurement, sales element agent 83 and personal AI group 87 in charge of sales Is formed. The controlling agent 79 controls each of these element agents 80 to 83.

資材調達担当のパーソナルＡＩ群８４が内部打合せにおいて提案等の行為ａ11,a12,・・・a1nを行い、最終的にまとまった行為a1を資材提供業者のデジタルツイン群８８に対して実行する。その行為a1に対する資材提供業者のデジタルツイン群８８の状態Ｓ１が資材調達エレメントエージェント８０及び資材調達担当のパーソナルＡＩ群８４に入力される。この状態Ｓ１は、例えば、資材要求数及び価格交渉の行為a1に対しての返答資材数及び返答価格等である。なお、資材調達担当のパーソナルＡＩ群８４の各行為a11,a12,・・・a1nは資材調達エレメントエージェント８０にも入力され、まとまった行為a1は、統括エージェント７９及び資材調達エレメントエージェント８０にも入力される。 The personal AI group 84 in charge of material procurement performs actions a11, a12, ... a1n such as proposals at an internal meeting, and finally executes the set actions a1 to the digital twin group 88 of the material provider. The state S1 of the digital twin group 88 of the material provider for the act a1 is input to the material procurement element agent 80 and the personal AI group 84 in charge of material procurement. This state S1 is, for example, the number of material requests, the number of response materials for the act of price negotiation a1, the response price, and the like. In addition, each act a11, a12, ... a1n of the personal AI group 84 in charge of material procurement is also input to the material procurement element agent 80, and the collective action a1 is also input to the general agent 79 and the material procurement element agent 80. Will be done.

資材調達エレメントエージェント８０は、行為a1及び状態Ｓ１に基づいて、資材調達担当のパーソナルＡＩ群８４によるパフォーマンスp1を算出し、そのパフォーマンスp1を統括エージェント７９に送信する。統括エージェント７９は、そのパフォーマンスp1に基づいて報酬r1を決定し、その報酬r1を資材調達エレメントエージェント８０へ送信する。資材調達エレメントエージェント８０は、資材調達担当のパーソナルＡＩ群８４の各行為ａ11,a12,・・・a1nに基づいて報酬分配率を決定し、その報酬分配率に従って報酬r1を各資材調達担当のパーソナルＡＩ８４に分配する。 The material procurement element agent 80 calculates the performance p1 by the personal AI group 84 in charge of material procurement based on the act a1 and the state S1, and transmits the performance p1 to the controlling agent 79. The general agent 79 determines the reward r1 based on the performance p1, and transmits the reward r1 to the material procurement element agent 80. The material procurement element agent 80 determines the reward distribution rate based on each action a11, a12, ... a1n of the personal AI group 84 in charge of material procurement, and sets the reward r1 to the personal in charge of each material procurement according to the reward distribution rate. Distribute to AI84.

組立担当のパーソナルＡＩ群８５が内部打合せにおいて提案等の行為ａ21,a22,・・・a2iを行い、最終的にまとまった行為a2を組立設備のデジタルツイン群８９に対して実行する。その行為a2に対する組立設備のデジタルツイン群８９の状態Ｓ２が組立エレメントエージェント８１及び組立担当のパーソナルＡＩ群８５に入力される。この状態Ｓ１は、例えば、組立設備デジタルツイン群８９の消費電力及び組立設備デジタルツイン群８９に従事した組立担当のパーソナルＡＩ群８５の総労働時間等である。なお、組立担当のパーソナルＡＩ群８４の各行為a21,a22,・・・a2iは組立エレメントエージェント８１にも入力され、まとまった行為a2は、統括エージェント７９及び組立エレメントエージェント８１にも入力される。 The personal AI group 85 in charge of assembly performs actions a21, a22, ... a2i such as proposals at an internal meeting, and finally executes the set actions a2 on the digital twin group 89 of the assembly facility. The state S2 of the digital twin group 89 of the assembly equipment with respect to the act a2 is input to the assembly element agent 81 and the personal AI group 85 in charge of assembly. This state S1 is, for example, the power consumption of the assembly equipment digital twin group 89 and the total working hours of the personal AI group 85 in charge of assembly engaged in the assembly equipment digital twin group 89. The actions a21, a22, ... a2i of the personal AI group 84 in charge of assembly are also input to the assembly element agent 81, and the combined actions a2 are also input to the control agent 79 and the assembly element agent 81.

組立エレメントエージェント８１は、行為a2及び状態Ｓ２に基づいて、組立担当のパーソナルＡＩ群８５によるパフォーマンスp2を算出し、そのパフォーマンスp2を統括エージェント７９に送信する。統括エージェント７９は、そのパフォーマンスp2に基づいて報酬r2を決定し、その報酬r2を組立エレメントエージェント８１へ送信する。組立エレメントエージェント８１は、組立担当のパーソナルＡＩ群８５の各行為ａ21,a22,・・・a2iに基づいて報酬分配率を決定し、その報酬分配率に従って報酬r2を組立担当のパーソナルＡＩ８５に分配する。 The assembly element agent 81 calculates the performance p2 by the personal AI group 85 in charge of assembly based on the act a2 and the state S2, and transmits the performance p2 to the control agent 79. The controlling agent 79 determines the reward r2 based on the performance p2, and transmits the reward r2 to the assembly element agent 81. The assembly element agent 81 determines the reward distribution rate based on each action a21, a22, ... a2i of the personal AI group 85 in charge of assembly, and distributes the reward r2 to the personal AI 85 in charge of assembly according to the reward distribution rate. ..

宣伝担当のパーソナルＡＩ群８６が内部打合せにおいて提案等の行為ａ51,a52,・・・a5jを行い、最終的にまとまった行為a5を消費者のパーソナルＡＩ群９０に対して実行する。その行為a5に対する消費者のパーソナルＡＩ群９０の状態Ｓ５が宣伝エレメントエージェント８２及び宣伝担当のパーソナルＡＩ群８６に入力される。この状態Ｓ５は、例えば、消費者への商品レコメンド行為a5に対しての当該消費者の当該商品購入の有無や購入金額等である。なお、宣伝担当のパーソナルＡＩ群８６の各行為a51,a５2,・・・a5jは宣伝エレメントエージェント８２にも入力され、まとまった行為a5は、統括エージェント７９及び宣伝エレメントエージェント８２にも入力される。 The personal AI group 86 in charge of advertising performs actions such as proposals a51, a52, ... a5j at an internal meeting, and finally executes a set action a5 on the consumer's personal AI group 90. The state S5 of the consumer's personal AI group 90 with respect to the act a5 is input to the promotion element agent 82 and the personal AI group 86 in charge of promotion. This state S5 is, for example, whether or not the consumer has purchased the product, the purchase price, and the like for the product recommendation act a5 to the consumer. The actions a51, a52, ... a5j of the personal AI group 86 in charge of promotion are also input to the promotion element agent 82, and the combined actions a5 are also input to the general agent 79 and the promotion element agent 82.

宣伝エレメントエージェント８２は、行為a5及び状態Ｓ５に基づいて、宣伝担当のパーソナルＡＩ群８６によるパフォーマンスp5を算出し、そのパフォーマンスp5を統括エージェント７９に送信する。統括エージェント７９は、そのパフォーマンスp5に基づいて報酬r5を決定し、その報酬r5を宣伝エレメントエージェント８２へ送信する。宣伝エレメントエージェント８２は、宣伝担当のパーソナルＡＩ群８６の各行為ａ51,a52,・・・a5jに基づいて報酬分配率を決定し、その報酬分配率に従って報酬r5を宣伝担当のパーソナルＡＩ８６に分配する。 The promotion element agent 82 calculates the performance p5 by the personal AI group 86 in charge of promotion based on the act a5 and the state S5, and transmits the performance p5 to the controlling agent 79. The general agent 79 determines the reward r5 based on the performance p5, and transmits the reward r5 to the promotion element agent 82. The advertising element agent 82 determines the reward distribution rate based on each act a51, a52, ... a5j of the personal AI group 86 in charge of advertising, and distributes the reward r5 to the personal AI 86 in charge of advertising according to the reward distribution rate. ..

販売担当のパーソナルＡＩ群８７が内部打合せにおいて提案等の行為ａ91,a92,・・・a9mを行い、最終的にまとまった行為a9を店舗及び消費者のデジタルツイン群９１に対して実行する。その行為a9に対する店舗及び消費者のデジタルツイン群９１の状態Ｓ９が販売エレメントエージェント８３及び販売担当のパーソナルＡＩ群８７に入力される。この状態Ｓ９は、例えば、店舗での総販売金額等である。販売エレメントエージェント８３には、上記行為a5に対する資材提供業者のデジタルツイン８８の状態Ｓ５も入力される。なお、販売担当のパーソナルＡＩ群８７の各行為a91,a92,・・・a9mは販売エレメントエージェント８３にも入力され、まとまった行為a9は、統括エージェント７９及び販売エレメントエージェント８３にも入力される。 The personal AI group 87 in charge of sales performs actions such as proposals a91, a92, ... a9m at an internal meeting, and finally executes a set action a9 for the store and the consumer's digital twin group 91. The state S9 of the store and consumer digital twin group 91 with respect to the act a9 is input to the sales element agent 83 and the personal AI group 87 in charge of sales. This state S9 is, for example, the total sales amount at the store. The state S5 of the digital twin 88 of the material provider for the above-mentioned act a5 is also input to the sales element agent 83. The actions a91, a92, ... a9m of the personal AI group 87 in charge of sales are also input to the sales element agent 83, and the collective actions a9 are also input to the general agent 79 and the sales element agent 83.

販売エレメントエージェント８３は、行為a9及び状態Ｓ５，Ｓ９に基づいて、販売担当のパーソナルＡＩ群８７によるパフォーマンスp9を算出し、そのパフォーマンスp9を統括エージェント７９に送信する。統括エージェント７９は、そのパフォーマンスp9に基づいて報酬r9を決定し、その報酬r9を販売エレメントエージェント８３へ送信する。販売エレメントエージェント８３は、販売担当のパーソナルＡＩ群８７の各行為ａ91,a92,・・・a9mに基づいて報酬分配率を決定し、その報酬分配率に従って報酬r9を販売担当のパーソナルＡＩ８７に分配する。 The sales element agent 83 calculates the performance p9 by the personal AI group 87 in charge of sales based on the act a9 and the states S5 and S9, and transmits the performance p9 to the controlling agent 79. The general agent 79 determines the reward r9 based on the performance p9, and transmits the reward r9 to the sales element agent 83. The sales element agent 83 determines the reward distribution rate based on each action a91, a92, ... a9m of the personal AI group 87 in charge of sales, and distributes the reward r9 to the personal AI 87 in charge of sales according to the reward distribution rate. ..

上記各パフォーマンスp1〜p9と各報酬分配率との算出方法を、図４７（Ａ）（Ｂ）（Ｃ）、図４８（Ａ）に基づいて説明する。資材調達エレメントエージェント８０は、パフォーマンスp1と分配率との算出アルゴリズムを知識として記憶している。このパフォーマンスp1と分配率との算出アルゴリズムを図４７（Ａ）に基づいて説明する。資材調達エレメントエージェント８０は、今回（Ｓ６８９による前回のＹＥＳ時点から今回のＹＥＳ時点までの間）の資材購入金額uと現在の在庫数ｚとを状態Ｓ１とし、パフォーマンスp1＝{２（平均購入金額／u）＋（ｚ／平均在庫数）}／３の計算式でパフォーマンスp1を算出する。平均購入金額とは、シミュレーション強化学習開始時点から現在までの資材の平均購入金額である。平均在庫数とは、シミュレーション強化学習開始時点から現在までの資材の平均購在庫数である。この計算式の結果、今回の資材購入金額uが安くなればパフォーマンスp1が大きくなり、現在の在庫数ｚが多くなればパフォーマンスp1が小さくなる。 The calculation method of each of the performances p1 to p9 and each reward distribution rate will be described with reference to FIGS. 47 (A) (B) (C) and 48 (A). The material procurement element agent 80 stores the calculation algorithm of the performance p1 and the distribution rate as knowledge. The calculation algorithm of the performance p1 and the distribution rate will be described with reference to FIG. 47 (A). The material procurement element agent 80 sets the material purchase amount u and the current inventory quantity z of this time (between the previous YES time point and the present YES time point by S689) as the state S1, and the performance p1 = {2 (average purchase amount). Calculate performance p1 using the formula / u) + (z / average stock quantity)} / 3. The average purchase price is the average purchase price of materials from the start of simulation reinforcement learning to the present. The average inventory quantity is the average inventory quantity of materials from the start of simulation reinforcement learning to the present. As a result of this calculation formula, the performance p1 increases as the material purchase price u this time decreases, and the performance p1 decreases as the current inventory quantity z increases.

また、報酬分配率は、ｐ1≧１の場合、まとまった行為a1への賛成度合いに比例して算出し、逆に、ｐ1＜１の場合は、まとまった行為a1への賛成度合いに反比例して算出する。ここに、「賛成度合い」の一乗に比例または反比例するものに限らず、「賛成度合い」のｎ乗に比例または反比例するもの等も含まれ、最適な比例関数または反比例関数を資材調達エレメントエージェント８０が強化学習（機械学習）を行うことにより求める。また、「賛成度合い」は、まとまった行為a1自体を提案したパーソナルＡＩが１番高い賛成度合いとなり、パーソナルＡＩの各行為a11,a12,・・・a1nに基づいて資材調達エレメントエージェント８０が各パーソナルＡＩの賛成度合いを判断する（算出する）。 In addition, when p1 ≧ 1, the reward distribution rate is calculated in proportion to the degree of approval for the collective act a1, and conversely, when p1 <1, it is inversely proportional to the degree of approval for the collective act a1. calculate. This includes not only those proportional to or inversely proportional to the first power of the "degree of approval" but also those proportional or inversely proportional to the nth power of the "degree of approval", and the optimum proportional function or inverse proportional function is used in the material procurement element agent 80. Is obtained by performing reinforcement learning (machine learning). In addition, as for the "degree of approval", the personal AI that proposed the collective action a1 itself has the highest degree of approval, and the material procurement element agent 80 is each personal based on each action a11, a12, ... a1n of the personal AI. Judge (calculate) the degree of approval of AI.

組立エレメントエージェント８１が知識として記憶しているパフォーマンスp2と分配率との算出アルゴリズムを、図４７（Ｂ）に基づいて説明する。組立エレメントエージェント８１は、組立設備の今回（Ｓ７１０による前回のＹＥＳ時点から今回のＹＥＳ時点までの間）の消費電力ｅと今回の組立作業員総動労時間ｔとを状態Ｓ２とし、パフォーマンスｐ2＝{（平均消費電力／ｅ）＋（平均総労働時間／ｔ）}／２の計算式でパフォーマンスp2を算出する。今回の組立作業員総動労時間ｔとは、今回の組立設備デジタルツイン群８９に従事した組立担当のパーソナルＡＩ群８５の総労働時間である。平均消費電力とは、シミュレーション強化学習開始時点から現在までの組立設備の消費電力の平均である。平均総労働時間とは、シミュレーション強化学習開始時点から現在までの組立作業員総動労時間の平均である。この計算式の結果、組立設備の今回の消費電力ｅが少なくなればパフォーマンスp2が大きくなり、今回の組立作業員総動労時間ｔが多くなればパフォーマンスp2が小さくなる。 The calculation algorithm of the performance p2 and the distribution rate stored as knowledge by the assembly element agent 81 will be described with reference to FIG. 47 (B). The assembly element agent 81 sets the power consumption e of the assembly facility this time (between the previous YES time point and the YES time point of this time by S710) and the total working time t of the current assembly worker as the state S2, and the performance p2 = { Calculate performance p2 using the formula (average power consumption / e) + (average total working hours / t)} / 2. The total working hours t of the assembly workers this time is the total working hours of the personal AI group 85 in charge of assembly engaged in the assembly equipment digital twin group 89 this time. The average power consumption is the average power consumption of the assembly equipment from the start of simulation reinforcement learning to the present. The average total working hours is the average total working hours of assembly workers from the start of simulation reinforcement learning to the present. As a result of this calculation formula, the performance p2 increases as the current power consumption e of the assembly equipment decreases, and the performance p2 decreases as the total labor time t of the assembly workers increases.

また、報酬分配率は、ｐ２≧１の場合、まとまった行為a2への賛成度合いに比例して算出し、逆に、ｐ２＜１の場合は、まとまった行為a2への賛成度合いに反比例して算出する。ここに、「賛成度合い」の一乗に比例または反比例するものに限らず、「賛成度合い」のｎ乗に比例または反比例するもの等も含まれ、最適な比例関数または反比例関数を組立エレメントエージェント８１が強化学習（機械学習）を行うことにより求める。また、「賛成度合い」は、まとまった行為a2自体を提案したパーソナルＡＩが１番高い賛成度合いとなり、パーソナルＡＩの各行為a21,a22,・・・a2iに基づいて組立エレメントエージェント８１が各パーソナルＡＩの賛成度合いを判断する（算出する）。 In addition, when p2 ≧ 1, the reward distribution rate is calculated in proportion to the degree of approval for the collective act a2, and conversely, when p2 <1, it is inversely proportional to the degree of approval for the collective act a2. calculate. This includes not only those proportional to or inversely proportional to the first power of the "degree of approval" but also those proportional or inversely proportional to the nth power of the "degree of approval", and the assembly element agent 81 assembles the optimum proportional function or inverse proportional function. Obtained by performing reinforcement learning (machine learning). As for the "degree of approval", the personal AI that proposed the collective action a2 itself has the highest degree of approval, and the assembly element agent 81 has each personal AI based on each action a21, a22, ... a2i of the personal AI. Judge (calculate) the degree of approval of.

宣伝エレメントエージェント８２が知識として記憶しているパフォーマンスp5と分配率との算出アルゴリズムを、図４７（Ｃ）に基づいて説明する。宣伝エレメントエージェント８２は、レコメンドした消費者パーソナルＡＩの今回（Ｓ７２８による前回のＹＥＳ時点から今回のＹＥＳ時点までの間）の総購入金額ｋを状態Ｓ５とし、パフォーマンスｐ5＝ｋ／レコメンドした消費者パーソナルＡＩの平均総購入金額Ｋの計算式でパフォーマンスp5を算出する。平均総購入金額Ｋとは、シミュレーション強化学習開始時点から現在までのレコメンドした消費者パーソナルＡＩ群９０の総購入金額の平均である。この計算式の結果、レコメンドした消費者パーソナルＡＩの今回の総購入金額ｋ今が高くなればパフォーマンスp5が大きくなる。 The calculation algorithm of the performance p5 and the distribution rate stored as knowledge by the promotion element agent 82 will be described with reference to FIG. 47 (C). The advertising element agent 82 sets the total purchase amount k of the recommended consumer personal AI this time (between the previous YES time and the YES time of this time by S728) as the state S5, and the performance p5 = k / recommended consumer personal. Calculate performance p5 using the formula for the average total purchase price K of AI. The average total purchase amount K is the average of the total purchase amount of the recommended consumer personal AI group 90 from the start of simulation reinforcement learning to the present. As a result of this formula, the performance p5 will increase if the total purchase amount k of the recommended consumer personal AI this time is higher.

また、報酬分配率は、ｐ５≧１の場合、まとまった行為a5への賛成度合いに比例して算出し、逆に、ｐ５＜１の場合は、まとまった行為a5への賛成度合いに反比例して算出する。ここに、「賛成度合い」の一乗に比例または反比例するものに限らず、「賛成度合い」のｎ乗に比例または反比例するもの等も含まれ、最適な比例関数または反比例関数を宣伝エレメントエージェント８２が強化学習（機械学習）を行うことにより求める。また、「賛成度合い」は、まとまった行為a5自体を提案したパーソナルＡＩが１番高い賛成度合いとなり、パーソナルＡＩの各行為a51,a52,・・・a5jに基づいて宣伝エレメントエージェント８２が各パーソナルＡＩの賛成度合いを判断する（算出する）。 In addition, when p5 ≧ 1, the reward distribution rate is calculated in proportion to the degree of approval for the collective act a5, and conversely, when p5 <1, it is inversely proportional to the degree of approval for the collective act a5. calculate. This includes not only those proportional to or inversely proportional to the first power of the "degree of approval" but also those proportional or inversely proportional to the nth power of the "degree of approval", and the advertising element agent 82 advertises the optimum proportional function or inverse proportional function. Obtained by performing reinforcement learning (machine learning). As for the "degree of approval", the personal AI that proposed the collective act a5 itself has the highest degree of approval, and the advertising element agent 82 has each personal AI based on each act a51, a52, ... a5j of the personal AI. Judge (calculate) the degree of approval of.

販売エレメントエージェント８３が知識として記憶しているパフォーマンスp5と分配率との算出アルゴリズムを、図４８（Ａ）に基づいて説明する。販売エレメントエージェント８３は、店舗での今回（Ｓ７４９による前回のＹＥＳ時点から今回のＹＥＳ時点までの間）の総販売金額hと平均総販売金額Ｈを状態Ｓ９とし、この状態Ｓ９と上記状態Ｓ５とにより、パフォーマンスｐ9＝（h−ｋ）／（Ｈ−Ｋ）の計算式でパフォーマンスp9を算出する。平均総販売金額Ｈとは、シミュレーション強化学習開始時点から現在までの店舗での総販売金額の平均である。また、ｋは、レコメンドした消費者パーソナルＡＩの今回の総購入金額であり、Ｋは、シミュレーション強化学習開始時点から現在までのレコメンドした消費者パーソナルＡＩ群９０の総購入金額の平均である（図４７（Ｃ）及びその説明参照）。この計算式の結果、店舗での今回の総販売金額hからレコメンドした消費者パーソナルＡＩの今回の総購入金額ｋを減算した値が大きくなればパフォーマンスp9が大きくなる。レコメンドした消費者パーソナルＡＩの今回の総購入金額ｋは、宣伝担当のパーソナルＡＩ群８６の手柄であり、販売担当のパーソナルＡＩ群８７のみの手柄は、店舗での今回の総販売金額hからレコメンドした消費者パーソナルＡＩの今回の総購入金額ｋを減算した値となるためである。 The calculation algorithm of the performance p5 and the distribution rate stored as knowledge by the sales element agent 83 will be described with reference to FIG. 48 (A). The sales element agent 83 sets the total sales amount h and the average total sales amount H of this time (between the previous YES point and the YES point of this time by S749) at the store as the state S9, and sets the state S9 and the above state S5. Therefore, the performance p9 is calculated by the formula of performance p9 = (h−k) / (HK). The average total sales amount H is the average of the total sales amount at the store from the start of the simulation reinforcement learning to the present. Further, k is the total purchase amount of the recommended consumer personal AI this time, and K is the average of the total purchase amount of the recommended consumer personal AI group 90 from the start of simulation reinforcement learning to the present (Fig.). See 47 (C) and its description). As a result of this calculation formula, the performance p9 becomes larger as the value obtained by subtracting the current total purchase amount k of the recommended consumer personal AI from the current total sales amount h at the store becomes larger. The total purchase amount k of the recommended consumer personal AI this time is the credit of the personal AI group 86 in charge of advertising, and the credit of only the personal AI group 87 in charge of sales is recommended from the total sales amount h of this time at the store. This is because the value is obtained by subtracting the total purchase amount k of the consumer personal AI this time.

また、報酬分配率は、ｐ９≧１の場合、まとまった行為a9への賛成度合いに比例して算出し、逆に、ｐ９＜１の場合は、まとまった行為a9への賛成度合いに反比例して算出する。ここに、「賛成度合い」の一乗に比例または反比例するものに限らず、「賛成度合い」のｎ乗に比例または反比例するもの等も含まれ、最適な比例関数または反比例関数を宣伝エレメントエージェント８３が強化学習（機械学習）を行うことにより求める。また、「賛成度合い」は、まとまった行為a9自体を提案したパーソナルＡＩが１番高い賛成度合いとなり、パーソナルＡＩの各行為a91,a92,・・・a9mに基づいて販売エレメントエージェント８３が各パーソナルＡＩの賛成度合いを判断する（算出する）。 In addition, when p9 ≧ 1, the reward distribution rate is calculated in proportion to the degree of approval for the collective act a9, and conversely, when p9 <1, it is inversely proportional to the degree of approval for the collective act a9. calculate. This includes not only those proportional to or inversely proportional to the first power of the "degree of approval" but also those proportional or inversely proportional to the nth power of the "degree of approval", and the advertising element agent 83 advertises the optimum proportional function or inverse proportional function. Obtained by performing reinforcement learning (machine learning). As for the "degree of approval", the personal AI that proposed the collective action a9 itself has the highest degree of approval, and the sales element agent 83 has each personal AI based on each action a91, a92, ... a9m of the personal AI. Judge (calculate) the degree of approval of.

次に、統括エージェント７９が知識として記憶している報酬テーブル９２を、図４８（Ｂ）に基づいて説明する。この報酬テーブル９２には、統括エージェント７９が各エレメントエージェント８０〜８３に分配する報酬の算出式が記憶されている。分配する報酬は、係数×（今期の利益）×（対象のエレメントエージェントから送られてきたパフォーマンス）÷（全エレメントエージェントから送られてきたパフォーマンスの合計）で算出される。ここに、「今期」とは、Ｓ６７５による前回のＹＥＳ時点から今回のＹＥＳ時点までの間のことである。 Next, the reward table 92 stored as knowledge by the controlling agent 79 will be described with reference to FIG. 48 (B). In the reward table 92, the calculation formula of the reward distributed by the controlling agent 79 to the element agents 80 to 83 is stored. The reward to be distributed is calculated by multiplying the coefficient x (profit for the current term) x (performance sent from the target element agent) / (total performance sent from all element agents). Here, the "current term" is the period from the previous YES time point to the current YES time point according to S675.

具体的には、資材調達エレメントエージェント８０に分配する報酬r1は、r1＝A1・Ｌｔ・p1／（p1+p2+p5+p9）。組立エレメントエージェント８１に分配する報酬r2は、r2＝A2・Ｌｔ・p2／（p1+p2+p5+p9）。宣伝エレメントエージェント８２に分配する報酬r5は、r5＝A5・Ｌｔ・p5／（p1+p2+p5+p9）。販売エレメントエージェント８３に分配する報酬r9は、r9＝A9・Ｌｔ・p9／（p1+p2+p5+p9）である。ここに、Ｌｔは今期の利益、A1,A2,A5,A9は、統括エージェント７９が決定した行為としての係数である。 Specifically, the reward r1 to be distributed to the material procurement element agent 80 is r1 = A1, Lt, p1 / (p1 + p2 + p5 + p9). The reward r2 distributed to the assembly element agent 81 is r2 = A2 ・ Lt ・ p2 / (p1 + p2 + p5 + p9). The reward r5 to be distributed to the advertising element agent 82 is r5 = A5 ・ Lt ・ p5 / (p1 + p2 + p5 + p9). The reward r9 distributed to the sales element agent 83 is r9 = A9 ・ Lt ・ p9 / (p1 + p2 + p5 + p9). Here, Lt is the profit for this term, and A1, A2, A5, A9 are the coefficients as the actions decided by the controlling agent 79.

次に、Ｓ６８０に示したシミュレーション強化学習処理の具体的内容を図４９に基づいて説明する。Ｓ６８７により統括エージェント強化学習処理を実行し、Ｓ６８８により資材調達エレメントエージェント強化学習処理を実行し、Ｓ６８９により組立エレメントエージェント強化学習処理を実行し、Ｓ６９０により宣伝エレメントエージェント強化学習処理を実行し、Ｓ６９１により販売エージェント強化学習処理を実行し、Ｓ６９２により資材調達担当パーソナルＡＩ強化学習処理を実行し、Ｓ６９３により組立担当パーソナルＡＩ強化学習処理を実行し、Ｓ６９４により宣伝担当パーソナルＡＩ強化学習処理を実行し、Ｓ６９５により販売担当パーソナルＡＩ強化学習処理を実行する。 Next, the specific contents of the simulation reinforcement learning process shown in S680 will be described with reference to FIG. 49. S687 executes the integrated agent reinforcement learning process, S688 executes the material procurement element agent reinforcement learning process, S689 executes the assembly element agent reinforcement learning process, S690 executes the advertising element agent reinforcement learning process, and S691 executes the advertising element agent reinforcement learning process. Sales agent reinforcement learning process is executed, material procurement person personal AI reinforcement learning process is executed by S692, assembly person personal AI reinforcement learning process is executed by S693, advertising person personal AI reinforcement learning process is executed by S694, and S695. Executes the personal AI reinforcement learning process in charge of sales.

Ｓ６８７に示した統括エージェント強化学習処理の詳細を図５０に基づいて説明する。統括エージェント７９は、Ｓ６９９において、各エレメントエージェント８０〜８３から送られてくる各パフォーマンスｐを受信したか否か判定する。受信していない場合には制御がＳ６７１に進むが、受信していると判定した場合はＳ６７０において、受信した各パフォーマンスｐを記憶する。 The details of the integrated agent reinforcement learning process shown in S687 will be described with reference to FIG. In S699, the general agent 79 determines whether or not each performance p sent from each element agent 80 to 83 has been received. If it is not received, the control proceeds to S671, but if it is determined that it is being received, each received performance p is stored in S670.

次に、Ｓ６７１において、各行為ａ1〜a9を受信したか否か判定し、受信していない場合は制御がＳ６７３に進む。受信していると判定した場合にはＳ６７２において、受信した各行為ａ1〜a9を記憶する。次に、Ｓ６７３において、店舗及び消費者のパーソナルＡＩ群９１から送られてくる状態Ｓ９の入力があったか否か判定し、ない場合には制御がＳ６７５に進む。入力があったと判定した場合にはＳ６７４において、売上＝ΣＳ９を算出する。 Next, in S671, it is determined whether or not each of the actions a1 to a9 has been received, and if not, the control proceeds to S673. If it is determined that the signal has been received, the received actions a1 to a9 are stored in S672. Next, in S673, it is determined whether or not there is an input of the state S9 sent from the personal AI group 91 of the store and the consumer, and if not, the control proceeds to S675. If it is determined that there is an input, sales = ΣS9 is calculated in S674.

次に、Ｓ６７５において、報酬算出時期になったか否か判定し、なっていない場合には制御がＳ６７７に進むが、なっていると判定した場合には、Ｓ６７６において、報酬テーブル９２を参照して報酬ｒ1,r2,r3,r5,r9を算出して該当するエレメントエージェント８０〜８３へ送信する。 Next, in S675, it is determined whether or not it is time to calculate the reward, and if it is not, the control proceeds to S677, but if it is determined, in S676, the reward table 92 is referred to. The rewards r1, r2, r3, r5, and r9 are calculated and transmitted to the corresponding element agents 80 to 83.

次に、Ｓ６７７において、強化学習（機械学習）の更新時期になったか否か判定し、なっていない場合にはリターンする。更新時期になったと判定した場合にはＳ６８７において、今期の利益Ｌt＝売上−経費を算出する。次に、Ｓ６７９において、各報酬ｒ1,ｒ2,ｒ5,ｒ9を算出して対応するエレメントエージェント８０〜８３に配布する。 Next, in S677, it is determined whether or not it is time to update the reinforcement learning (machine learning), and if not, a return is made. If it is determined that the renewal time has come, the profit Lt = sales-expense for the current period is calculated in S687. Next, in S679, each reward r1, r2, r5, r9 is calculated and distributed to the corresponding element agents 80 to 83.

次に、Ｓ６８０において、利益Ｌtから統括エージェント７９の報酬Ｒを算出する。この報酬Ｒは、利益Ｌtに比例する。次に、Ｓ６８１において、上記報酬Ｒに基づいて、ＴＤ学習により最適政策π^＊に従った行為（係数）A1,A2,A5,A9を求める。次にＳ６８２において、報酬テーブル９２のA1,A2,A5,A9を、Ｓ６８１で求めた行為（係数）A1,A2,A5,A9に更新する。この結果、統括エージェント７９は、利益Ｌtが最大となる行為（係数）A1,A2,A5,A9を学習することになる。Next, in S680, the reward R of the controlling agent 79 is calculated from the profit Lt. This reward R is proportional to the profit Lt. Next, in S681, based on the above reward R, the actions (coefficients) A1, A2, A5, A9 according ^{to the optimum policy π * are obtained by TD learning.} Next, in S682, A1, A2, A5, A9 of the reward table 92 are updated to the actions (coefficients) A1, A2, A5, A9 obtained in S681. As a result, the supervising agent 79 learns the actions (coefficients) A1, A2, A5, and A9 that maximize the profit Lt.

Ｓ６８８に示した資材調達エレメントエージェント強化学習処理の詳細を図５１に基づいて説明する。資材調達エレメントエージェント８０は、Ｓ６８４において、クローラによる情報収集処理を行う。クローラとは、ウェブ上の文書や画像などを周期的に取得し、自動的にデータベース化するプログラムである。「ボット（Bot）」、「スパイダー」、「ロボット」などとも呼ばれる。 The details of the material procurement element agent reinforcement learning process shown in S688 will be described with reference to FIG. 51. The material procurement element agent 80 performs information collection processing by the crawler in S684. A crawler is a program that periodically acquires documents and images on the Web and automatically creates a database. Also called "bot", "spider", "robot", etc.

このクローラによる情報収集処理の詳細を図５２（Ａ）に基づいて説明する。資材調達エレメントエージェント８０は、Ｓ７０２において、クローラがネット上を巡回して収集した情報を受信する。次に、Ｓ７０３において、その受信した情報を資材調達ＤＢ９３に格納する。 The details of the information collection process by the crawler will be described with reference to FIG. 52 (A). The material procurement element agent 80 receives the information collected by the crawler patrolling the net in S702. Next, in S703, the received information is stored in the material procurement DB 93.

その資材調達ＤＢ９３に格納された情報を図５２（Ｂ）に示す。資材調達ＤＢ９３には、図示するように、経済情報、社会情報、気象情報、在庫情報、市場情報、・・・仕入先情報等の、資材調達業務に必要となる各種情報が格納されている。 The information stored in the material procurement DB 93 is shown in FIG. 52 (B). As shown in the figure, the material procurement DB 93 stores various information necessary for material procurement operations, such as economic information, social information, weather information, inventory information, market information, and so on. ..

図５１に戻り、資材調達エレメントエージェント８０は、Ｓ６８５において、パーソナルＡＩ群８４からの行為a11,a12,・・・a1nを受信したか否か判定する。受信していない場合には制御がＳ６８７に進むが、受信したと判定した場合にはＳ６８６において、受信した各行為a11,a12,・・・a1nを記憶する。Ｓ６８７において、資材供給業者のデジタルツイン群８８からの状態Ｓ１を受信したか否か判定し、受信していない場合は制御がＳ６８９に進む。受信したと判定した場合はＳ６８８において、受信したＳ１を記憶する。 Returning to FIG. 51, the material procurement element agent 80 determines whether or not the actions a11, a12, ... a1n from the personal AI group 84 have been received in S685. If it is not received, the control proceeds to S687, but if it is determined that it has been received, the received actions a11, a12, ... a1n are stored in S686. In S687, it is determined whether or not the state S1 from the digital twin group 88 of the material supplier has been received, and if not, the control proceeds to S689. If it is determined that the signal has been received, the received S1 is stored in S688.

Ｓ６８９において、パフォーマンスｐ１の算出時期であるか否か判定し、算出時期でない場合には制御がＳ６９２に進む。算出時期と判定した場合には、Ｓ６９０において、パフォーマンスｐ1＝{２（平均購入金額／u）＋（ｚ／平均在庫数）}／３を算出する。そのパフォーマンスｐ1を統括エージェント７９へ送信する（Ｓ６９１）。 In S689, it is determined whether or not it is the calculation time of the performance p1, and if it is not the calculation time, the control proceeds to S692. If it is determined that it is the calculation time, the performance p1 = {2 (average purchase amount / u) + (z / average inventory quantity)} / 3 is calculated in S690. The performance p1 is transmitted to the controlling agent 79 (S691).

Ｓ６９２において、統括エージェント７９から送信された報酬ｒ１を受信したか否か判定し、受信していない場合にはリターンする。受信したと判定した場合には、図４７（Ａ）に示した報酬分配率のアルゴリズムに基づいて報酬分配率を算出する（Ｓ６９３）。Ｓ６９４において、報酬に各分配率を乗じて各報酬r11,r12…r1nを算出し、Ｓ６９５において、各報酬r11,r12…r1nを各資材調達担当パーソナルＡＩ群８４に付与する。Ｓ６９６において、受信した報酬r1にもとづいて、ＴＤ学習により最適政策π^＊に従った行為（比例関数または反比例関数）を求める。Ｓ６９７において、比例関数または反比例関数を、Ｓ６９６で求めたものに更新する。その結果、資材調達エレメントエージェント８０は、上記パフォーマンスｐ1を最大にする比例関数または反比例関数を学習することになる。In S692, it is determined whether or not the reward r1 transmitted from the control agent 79 has been received, and if not, the reward r1 is returned. If it is determined that the reward has been received, the reward distribution rate is calculated based on the reward distribution rate algorithm shown in FIG. 47 (A) (S693). In S694, each reward r11, r12 ... r1n is calculated by multiplying the reward by each distribution rate, and in S695, each reward r11, r12 ... r1n is given to each material procurement personal AI group 84. In S696, based on the received reward r1, the action (proportional function or inverse proportional function) according ^{to the optimum policy π * is obtained by TD learning.} In S697, the proportional function or the inverse proportional function is updated to the one obtained in S696. As a result, the material procurement element agent 80 learns the proportional function or the inverse proportional function that maximizes the performance p1.

次に、Ｓ６８９に示した組立エレメントエージェント強化学習処理の詳細を図５３に基づいて説明する。組立エレメントエージェント８１は、Ｓ７０６において、パーソナルＡＩ群８５からの行為a21,a22,・・・a2nを受信したか否か判定する。受信していない場合には制御がＳ７０８に進むが、受信したと判定した場合にはＳ７０７において、受信した各行為a21,a22,・・・a2nを記憶する。Ｓ７０８において、組立設備のデジタルツイン群８９からの状態Ｓ２を受信したか否か判定し、受信していない場合は制御がＳ７１０に進む。受信したと判定した場合はＳ７０９において、受信した状態Ｓ２を記憶する。 Next, the details of the assembly element agent reinforcement learning process shown in S689 will be described with reference to FIG. 53. The assembly element agent 81 determines in S706 whether or not the actions a21, a22, ... a2n from the personal AI group 85 have been received. If it is not received, the control proceeds to S708, but if it is determined that it has been received, each of the received actions a21, a22, ... a2n is stored in S707. In S708, it is determined whether or not the state S2 from the digital twin group 89 of the assembly equipment has been received, and if not, the control proceeds to S710. When it is determined that the signal has been received, the received state S2 is stored in S709.

Ｓ７１０において、パフォーマンスｐ２の算出時期であるか否か判定し、算出時期でない場合には制御がＳ７１３に進む。算出時期と判定した場合には、Ｓ７１１において、パフォーマンスｐ2＝{（平均消費電力／ｅ）＋（平均総労働時間／ｔ）}／２を算出する。そのパフォーマンスｐ２を統括エージェント７９へ送信する（Ｓ７１２）。 In S710, it is determined whether or not it is the calculation time of the performance p2, and if it is not the calculation time, the control proceeds to S713. If it is determined that it is the calculation time, the performance p2 = {(average power consumption / e) + (average total working hours / t)} / 2 is calculated in S711. The performance p2 is transmitted to the controlling agent 79 (S712).

Ｓ７１３において、統括エージェント７９から送られてくる報酬ｒ２を受信したか否か判定し、受信していない場合にはリターンする。受信したと判定した場合には、図４７（Ｂ）に示した報酬分配率のアルゴリズムに基づいて報酬分配率を算出する（Ｓ７１４）。Ｓ７１５において、報酬に各分配率を乗じて各報酬r11,r12…r1nを算出し、Ｓ６９５において、各報酬r21,r22…r2iを組立て担当パーソナルＡＩ群８６に付与する。Ｓ７１７において、受信した報酬r２にもとづいて、ＴＤ学習により最適政策π^＊に従った行為（比例関数または反比例関数）を求める。Ｓ７１８において、比例関数または反比例関数を、Ｓ７１７で求めたものに更新する。その結果、組立エレメントエージェント８１は、上記パフォーマンスｐ２を最大にする比例関数または反比例関数を学習することになる。In S713, it is determined whether or not the reward r2 sent from the controlling agent 79 has been received, and if not, the reward r2 is returned. If it is determined that the reward has been received, the reward distribution rate is calculated based on the reward distribution rate algorithm shown in FIG. 47 (B) (S714). In S715, each reward r11, r12 ... r1n is calculated by multiplying the reward by each distribution rate, and in S695, each reward r21, r22 ... r2i is assembled and given to the personal AI group 86 in charge. In S717, based on the received reward r2, the action (proportional function or inverse proportional function) according ^{to the optimum policy π * is obtained by TD learning.} In S718, the proportional function or the inverse proportional function is updated to the one obtained in S717. As a result, the assembly element agent 81 learns a proportional function or an inverse proportional function that maximizes the performance p2.

Ｓ６９０に示した宣伝エレメントエージェント強化学習処理の詳細を図５４に基づいて説明する。宣伝エレメントエージェント８３は、Ｓ７２３において、クローラによる情報収集処理を行う。この処理の詳細を図５５（Ａ）に基づいて説明する。宣伝エレメントエージェント８３は、Ｓ７４０において、クローラがネット上を巡回して収集した情報を受信し、Ｓ７４１において、受信した情報を宣伝ＤＢ９４に格納する。 The details of the promotion element agent reinforcement learning process shown in S690 will be described with reference to FIG. 54. The promotion element agent 83 performs information collection processing by the crawler in S723. The details of this process will be described with reference to FIG. 55 (A). The promotion element agent 83 receives the information collected by the crawler patrolling the net in S740, and stores the received information in the promotion DB 94 in S741.

その宣伝ＤＢ９４に格納された収集データを図５５（Ｂ）に示す。宣伝ＤＢ９４には、消費者である太郎、次郎、・・・花子等の各種行動データが格納されている。例えば太郎の場合、「一戸建てを注文した」という情報から家具を購入する可能性が高いと判断し、太郎に対し家具の宣伝を行う。次郎の場合、「夫婦じゃわん購入」という情報から、近々結婚するために新居で家具を購入する可能性が高いと判断し、次郎に対し家具の宣伝を行う。 The collected data stored in the promotion DB 94 is shown in FIG. 55 (B). Various behavior data of consumers such as Taro, Jiro, ... Hanako, etc. are stored in the promotion DB 94. For example, in the case of Taro, it is judged that there is a high possibility of purchasing furniture from the information that "I ordered a detached house", and the furniture is advertised to Taro. In the case of Jiro, it is judged that there is a high possibility that he will buy furniture in his new house to get married soon, based on the information that he will buy a couple, and he will advertise the furniture to Jiro.

図５４に戻り、宣伝エレメントエージェント８３は、Ｓ７４２において、各パーソナルＡＩ群８６からの行為を受信したか否か判定する。受信していない場合は制御がＳ７２６に進むが、受信していると判定した場合にはＳ７２５において、受信した各行為を記憶する。Ｓ７２６において、消費者のパーソナルＡＩ群９０から送られてくる状態Ｓ５を受信したか否か判定し、未だ受信していない場合には制御がＳ７２８に進む。受信したと判定した場合にはＳ７２７において、受信した状態Ｓ５を記憶する。 Returning to FIG. 54, the promotion element agent 83 determines whether or not the action from each personal AI group 86 has been received in S742. If it is not received, the control proceeds to S726, but if it is determined that it is being received, each act received is stored in S725. In S726, it is determined whether or not the state S5 sent from the consumer's personal AI group 90 has been received, and if not yet received, the control proceeds to S728. If it is determined that the signal has been received, the received state S5 is stored in S727.

Ｓ７２８において、パフォーマンスｐ5の算出時期であるか否か判定し、パフォーマンスｐ5の算出時期でない場合には制御がＳ７３１に進む。パフォーマンスｐ5の算出時期であると判定した場合にはＳ７２９において、パフォーマンスｐ5＝ｋ／レコメンドした消費者パーソナルＡＩの平均総購入金額Ｋを算出する。次にＳ７３０において、パフォーマンスＰ５を統括エージェント７９へ送信する。 In S728, it is determined whether or not it is the time to calculate the performance p5, and if it is not the time to calculate the performance p5, the control proceeds to S731. If it is determined that it is time to calculate the performance p5, the average total purchase amount K of the performance p5 = k / recommended consumer personal AI is calculated in S729. Next, in S730, the performance P5 is transmitted to the controlling agent 79.

Ｓ７３１において、統括エージェント７９から送られてくる報酬ｒ５を受信したか否か判定し、未だ受信していない場合はリターンする。受信したと判定した場合にはＳ７３２において、図４７（Ｃ）に示した報酬分配率のアルゴリズムに基づいて報酬分配率を算出する（Ｓ７３２）。Ｓ７３３において、報酬に各分配率を乗じて各報酬r51,r52…r5jを算出し、Ｓ７３４において、各報酬r51,r52…r5jを宣伝担当パーソナルＡＩ群８７に付与する。Ｓ７３５において、受信した報酬r５にもとづいて、ＴＤ学習により最適政策π^＊に従った行為（比例関数または反比例関数）を求める。Ｓ７３６において、比例関数または反比例関数を、Ｓ７３５で求めたものに更新する。その結果、宣伝エレメントエージェント８２は、上記パフォーマンスｐ５を最大にする比例関数または反比例関数を学習することになる。In S731, it is determined whether or not the reward r5 sent from the controlling agent 79 has been received, and if it has not been received yet, the reward r5 is returned. When it is determined that the reward has been received, the reward distribution rate is calculated in S732 based on the reward distribution rate algorithm shown in FIG. 47 (C) (S732). In S733, each reward r51, r52 ... r5j is calculated by multiplying the reward by each distribution rate, and in S734, each reward r51, r52 ... r5j is given to the personal AI group 87 in charge of promotion. In S735, based on the received reward r5, the action (proportional function or inverse proportional function) according ^{to the optimum policy π * is obtained by TD learning.} In S736, the proportional function or the inverse proportional function is updated to the one obtained in S735. As a result, the advertising element agent 82 learns a proportional function or an inverse proportional function that maximizes the performance p5.

次に、Ｓ６９１に示した販売エレメントエージェント強化学習処理の詳細を図５６にもとづいて説明する。販売エレメントエージェント８４は、Ｓ７４４において、クローラによる情報収集処理を行う。この処理の詳細を図５７（Ａ）に基づいて説明する。販売エレメントエージェント８４は、Ｓ７６０において、クローラがネット上を巡回して収集した情報を受信し、Ｓ７６１において、受信した情報を販売ＤＢ９５に格納する。さらに、Ｓ７６２において、店舗におけるＰＯＳデータを販売ＤＢ９５に格納する。 Next, the details of the sales element agent reinforcement learning process shown in S691 will be described with reference to FIG. 56. The sales element agent 84 performs information collection processing by the crawler in S744. The details of this process will be described with reference to FIG. 57 (A). The sales element agent 84 receives the information collected by the crawler patrolling the net in S760, and stores the received information in the sales DB 95 in S761. Further, in S762, the POS data in the store is stored in the sales DB 95.

その販売ＤＢ９５に格納された収集データを図５７（Ｂ）に示す。販売ＤＢ９５には、気象データやＰＯＳデータ等の各種データが格納されている。気象情報における「日付別」とは、曜日別を含む概念である。気象情報（日付別時間別天候気温データ）とＰＯＳデータ（日付別時間別販売商品データ）とに基づいて、例えば、曜日と時間と気象状況とを考慮した陳列商品の配置替え等を行うことができる。 The collected data stored in the sales DB 95 is shown in FIG. 57 (B). Various data such as weather data and POS data are stored in the sales DB 95. "By date" in meteorological information is a concept that includes each day of the week. Based on the weather information (hourly weather temperature data by date) and POS data (hourly sales product data by date), for example, it is possible to rearrange the displayed products in consideration of the day, time, and weather conditions. can.

図５６に戻り、販売エレメントエージェント８４は、Ｓ７４５において、各パーソナルＡＩ群８７からの行為を受信したか否か判定する。受信していない場合は制御がＳ７４７に進むが、受信していると判定した場合にはＳ７４６において、受信した各行為を記憶する。Ｓ７４７において、販売店及び消費者のパーソナルＡＩ群９１から送られてくる状態Ｓ９を受信したか否か判定し、未だ受信していない場合には制御がＳ７４９に進む。受信したと判定した場合にはＳ７４８において、受信した状態Ｓ９を記憶する。 Returning to FIG. 56, the sales element agent 84 determines in S745 whether or not the action from each personal AI group 87 has been received. If it is not received, the control proceeds to S747, but if it is determined that it is being received, each act received is stored in S746. In S747, it is determined whether or not the state S9 sent from the personal AI group 91 of the store and the consumer has been received, and if not yet received, the control proceeds to S747. If it is determined that the signal has been received, the received state S9 is stored in S748.

Ｓ７４９において、パフォーマンスｐ9の算出時期であるか否か判定し、パフォーマンスｐ9の算出時期でない場合には制御がＳ７５２に進む。パフォーマンスｐ9の算出時期であると判定した場合にはＳ７５０において、パフォーマンスｐ9＝（h−ｋ）／（Ｈ−Ｋ）を算出する。次にＳ７５１において、パフォーマンスp9を統括エージェント７９へ送信する。 In S749, it is determined whether or not it is the calculation time of the performance p9, and if it is not the calculation time of the performance p9, the control proceeds to S752. If it is determined that it is time to calculate the performance p9, the performance p9 = (h-k) / (HK) is calculated in S750. Next, in S751, the performance p9 is transmitted to the controlling agent 79.

Ｓ７５２において、統括エージェント７９から送られてくる報酬ｒ９を受信したか否か判定し、未だ受信していない場合はリターンする。受信したと判定した場合にはＳ７５３において、図４８（Ａ）に示した報酬分配率のアルゴリズムに基づいて報酬分配率を算出する（Ｓ７５３）。Ｓ７５４において、報酬に各分配率を乗じて各報酬r91,r92…r9mを算出し、Ｓ７５５において、各報酬r91,r92…r9mを販売担当パーソナルＡＩ群８８に付与する。Ｓ７５６において、受信した報酬r９にもとづいて、ＴＤ学習により最適政策π^＊に従った行為（比例関数または反比例関数）を求める。Ｓ７５７において、比例関数または反比例関数を、Ｓ７５６で求めたものに更新する。その結果、販売エレメントエージェント８３は、上記パフォーマンスｐ９を最大にする比例関数または反比例関数を学習することになる。In S752, it is determined whether or not the reward r9 sent from the controlling agent 79 has been received, and if it has not been received yet, the reward r9 is returned. If it is determined that the reward has been received, the reward distribution rate is calculated in S753 based on the reward distribution rate algorithm shown in FIG. 48 (A) (S753). In S754, each reward r91, r92 ... r9m is calculated by multiplying the reward by each distribution rate, and in S755, each reward r91, r92 ... r9m is given to the personal AI group 88 in charge of sales. In S756, based on the received reward r9, the action (proportional function or inverse proportional function) according ^{to the optimum policy π * is obtained by TD learning.} In S757, the proportional function or the inverse proportional function is updated to the one obtained in S756. As a result, the sales element agent 83 learns the proportional function or the inverse proportional function that maximizes the performance p9.

次に、Ｓ６９２に示した資材調達担当パーソナルＡＩ強化学習処理の詳細を図５８（Ａ）に基づいて説明する。資材調達担当パーソナルＡＩ群８４は、Ｓ７６５において、資材供給業者のデジタルツイン群８８と交渉するか否か判定し、交渉しない場合は制御がＳ７７０に進む。交渉すると判定した場合はＳ７６６において、資金調達ＤＢ９３の格納データを閲覧し、格納データを参考にして内部打合せを行いながら行為ａ1を決定し（Ｓ７６７）、資材供給業者のデジタルツイン群８８と交渉する（Ｓ７６８）。Ｓ７６９において、交渉が終了いたか否か判定し、未だ終了していない場合にはＳ７６６に戻り、Ｓ７６７→Ｓ７６８→Ｓ７６９→Ｓ７６６のループを巡回する。Ｓ７６９において交渉が終了したと判定した段階で制御がＳ７７０に進む。 Next, the details of the material procurement personal AI reinforcement learning process shown in S692 will be described with reference to FIG. 58 (A). In S765, the personal AI group 84 in charge of material procurement determines whether or not to negotiate with the digital twin group 88 of the material supplier, and if not, the control proceeds to S770. If it is determined to negotiate, in S766, the stored data of the fund procurement DB 93 is browsed, the action a1 is determined while conducting an internal meeting with reference to the stored data (S767), and negotiations are made with the digital twin group 88 of the material supplier. (S768). In S769, it is determined whether or not the negotiation has been completed, and if it has not been completed, the process returns to S766 and the loop of S767 → S768 → S769 → S766 is patrolled. When it is determined in S769 that the negotiation is completed, the control proceeds to S770.

Ｓ７７０において、資材調達エレメントエージェント８０から報酬r11,r12・・・r1nを受信したか否か判定し、受信していない場合にはリターンする。報酬を受信したと判定した場合にはＳ７７１において、受信した報酬に基づいて、ＴＤ学習により最適政策π^＊に従った行為（a11,a12・・・a1n）を求める。この行為a1iは、受け取った報酬r1iが満足できるものでなければ、他の会社ＤＡＯデジタルツイン（例えば、図４５の株式会社ＡＢＣのＤＡＯデジタルツイン５９）に移る（転職する）ものも含む。この強化学習の結果、資材調達担当パーソナルＡＩの各々は、前述のパフォーマンスｐ１を増加させる行為を学習することになる。In S770, it is determined whether or not the rewards r11, r12 ... r1n have been received from the material procurement element agent 80, and if not, the reward is returned. When it is determined that the reward has been received, in S771, the action (a11, a12 ... a1n) according ^{to the optimum policy π * is obtained by TD learning based on the received reward.} This act a1i also includes a move (change of job) to another company DAO Digital Twin (for example, DAO Digital Twin 59 of ABC Co., Ltd. in FIG. 45) if the received reward r1i is not satisfactory. As a result of this reinforcement learning, each of the personal AIs in charge of material procurement will learn the above-mentioned act of increasing the performance p1.

次に、Ｓ６９３に示した組立担当パーソナルＡＩ強化学習処理の詳細を図５８（Ｂ）に基づいて説明する。組立担当パーソナルＡＩ群８５は、Ｓ７７５において、内部打合せするか否か判定し、しない場合は制御がＳ７７９に進む。打合せすると判定した場合はＳ７７６において、各組立担当パーソナルＡＩが内部打合せを行いながら行為ａ2を決定する。次に、Ｓ７７７において、行為ａ2に従って、組立設備デジタルツイン群８９を試運転し、行為ａ2の妥当性を検証する。Ｓ７７８において、打合せが終了いたか否か判定し、未だ終了していない場合にはＳ７７６に戻り、Ｓ７７７→Ｓ７７８→Ｓ７７６のループを巡回する。Ｓ７７７の試運転の結果行為ａ2が妥当である場合にはＳ７７８により打合せ終了と判定され、制御がＳ７７９に進む。 Next, the details of the assembly-in-charge personal AI reinforcement learning process shown in S693 will be described with reference to FIG. 58 (B). In S775, the personal AI group 85 in charge of assembly determines whether or not to have an internal meeting, and if not, the control proceeds to S779. If it is determined that a meeting is to be held, in S776, each personal AI in charge of assembly decides the action a2 while conducting an internal meeting. Next, in S777, the assembly equipment digital twin group 89 is commissioned according to the act a2, and the validity of the act a2 is verified. In S778, it is determined whether or not the meeting has been completed, and if it has not been completed, the process returns to S777 and the loop of S777 → S778 → S776 is patrolled. If the action a2 is appropriate as a result of the trial run of S777, it is determined by S778 that the meeting is completed, and the control proceeds to S779.

Ｓ７７９において、組立エレメントエージェント８１から報酬r21,r22・・・r2nを受信したか否か判定し、受信していない場合にはリターンする。報酬を受信したと判定した場合にはＳ７８０において、受信した報酬に基づいて、ＴＤ学習により最適政策π^＊に従った行為（a21,a22・・・a2i）を求める。この行為a2iは、受け取った報酬r2iが満足できるものでなければ、他の会社ＤＡＯデジタルツイン（例えば、図４５の株式会社ＡＢＣのＤＡＯデジタルツイン５９）に移る（転職する）ものも含む。この強化学習の結果、組立担当パーソナルＡＩの各々は、前述のパフォーマンスｐ２を増加させる行為を学習することになる。In S779, it is determined whether or not the rewards r21, r22 ... r2n have been received from the assembly element agent 81, and if not, the reward r21, r22 ... r2n is returned. When it is determined that the reward has been received, in S780, the action (a21, a22 ... a2i) according ^{to the optimum policy π * is obtained by TD learning based on the received reward.} This act a2i also includes a move (change of job) to another company DAO Digital Twin (for example, DAO Digital Twin 59 of ABC Co., Ltd. in FIG. 45) if the received reward r2i is not satisfactory. As a result of this reinforcement learning, each of the personal AIs in charge of assembly learns the above-mentioned act of increasing the performance p2.

次に、Ｓ６９４に示した宣伝担当パーソナルＡＩ強化学習処理の詳細を図５９（Ａ）に基づいて説明する。宣伝担当パーソナルＡＩ群８６は、Ｓ７８４において、内部打合せするか否か判定し、しない場合は制御がＳ７８９に進む。打合せすると判定した場合はＳ７８５において、各宣伝担当パーソナルＡＩが内部打合せを行いながら行為ａ5を決定する。次に、Ｓ７８７において、消費者への行為ａ2を実行する。Ｓ７８８において、打合せが終了いたか否か判定し、未だ終了していない場合にはＳ７８５に戻り、Ｓ７８６→Ｓ７８７→Ｓ７８８のループを巡回する。Ｓ７８８により打合せ終了と判定された段階で制御がＳ７８９に進む。 Next, the details of the promotion personal AI reinforcement learning process shown in S694 will be described with reference to FIG. 59 (A). In S784, the personal AI group 86 in charge of promotion determines whether or not to have an internal meeting, and if not, the control proceeds to S789. If it is determined that a meeting is to be held, in S785, each personal AI in charge of advertising decides the act a5 while conducting an internal meeting. Next, in S787, the act a2 for the consumer is executed. In S788, it is determined whether or not the meeting has been completed, and if it has not been completed, the process returns to S785 and the loop of S786 → S787 → S788 is patrolled. Control proceeds to S789 when it is determined by S788 that the meeting is completed.

Ｓ７８９において、宣伝エレメントエージェント８２から報酬r51,r52・・・r5jを受信したか否か判定し、受信していない場合にはリターンする。報酬を受信したと判定した場合にはＳ７９０において、受信した報酬に基づいて、ＴＤ学習により最適政策π^＊に従った行為（a51,a52・・・a5j）を求める。この行為a5iは、受け取った報酬r5iが満足できるものでなければ、他の会社ＤＡＯデジタルツイン（例えば、図４５の株式会社ＡＢＣのＤＡＯデジタルツイン５９）に移る（転職する）ものも含む。この強化学習の結果、宣伝担当パーソナルＡＩの各々は、前述のパフォーマンスｐ５を増加させる行為を学習することになる。In S789, it is determined whether or not the rewards r51, r52 ... r5j have been received from the advertising element agent 82, and if not, the reward r51, r52 ... r5j is returned. When it is determined that the reward has been received, in S790, the action (a51, a52 ... a5j) according ^{to the optimum policy π * is obtained by TD learning based on the received reward.} This act a5i also includes a move (change of job) to another company DAO Digital Twin (for example, DAO Digital Twin 59 of ABC Co., Ltd. in FIG. 45) if the received reward r5i is not satisfactory. As a result of this reinforcement learning, each of the personal AIs in charge of advertising will learn the above-mentioned act of increasing the performance p5.

次に、Ｓ６９５に示した販売担当パーソナルＡＩ強化学習処理の詳細を図５９（Ｂ）に基づいて説明する。販売担当パーソナルＡＩ群８７は、Ｓ７９１において、内部打合せするか否か判定し、しない場合は制御がＳ７９５に進む。打合せすると判定した場合はＳ７９２において、各販売担当パーソナルＡＩが内部打合せを行いながら行為ａ9を決定する。次に、Ｓ７９３において打合せが終了いたか否か判定し、未だ終了していない場合にはＳ７９２に戻り、Ｓ７９２→Ｓ７９３→Ｓ７９２のループを巡回する。Ｓ７９３により打合せ終了と判定された段階で制御がＳ７９４に進む。Ｓ７９４において、上記打合せで決定された行為を消費者及び店舗に対し実行する。 Next, the details of the sales person in charge personal AI reinforcement learning process shown in S695 will be described with reference to FIG. 59 (B). In S791, the sales person in charge of personal AI group 87 determines whether or not to have an internal meeting, and if not, the control proceeds to S795. If it is determined that a meeting is to be made, in S792, each sales person in charge of personal AI decides the act a9 while conducting an internal meeting. Next, it is determined in S793 whether or not the meeting has been completed, and if it has not been completed, the process returns to S792 and the loop of S792 → S793 → S792 is patrolled. Control proceeds to S794 when it is determined by S793 that the meeting is completed. In S794, the act determined in the above meeting is executed for the consumer and the store.

次に、Ｓ７９５において、販売エレメントエージェント８３から報酬r91,r92・・・r9mを受信したか否か判定し、受信していない場合にはリターンする。報酬を受信したと判定した場合にはＳ７９６において、受信した報酬に基づいて、ＴＤ学習により最適政策π＊に従った行為（a91,a92・・・a9m）を求める。この行為a9iは、受け取った報酬r9iが満足できるものでなければ、他の会社ＤＡＯデジタルツイン（例えば、図４５の株式会社ＡＢＣのＤＡＯデジタルツイン５９）に移る（転職する）ものも含む。この強化学習の結果、販売担当パーソナルＡＩの各々は、前述のパフォーマンスｐ９を増加させる行為を学習することになる。 Next, in S795, it is determined whether or not the rewards r91, r92 ... r9m have been received from the sales element agent 83, and if not, the reward is returned. When it is determined that the reward has been received, in S796, the action (a91, a92 ... a9m) according to the optimum policy π * is obtained by TD learning based on the received reward. This act a9i also includes a move (change of job) to another company DAO Digital Twin (for example, DAO Digital Twin 59 of ABC Co., Ltd. in FIG. 45) if the received reward r9i is not satisfactory. As a result of this reinforcement learning, each of the sales person personal AIs learns the above-mentioned act of increasing the performance p9.

シミュレーション強化学習が終了したエレメント統合ＤＡＯは、リアルワールド４７において実際の組織として運用される。その段階では、図４６の「各パーソナルＡＩ群８４〜８７」をリアルワールドにおける実際の人間（ユーザ）が担当することとなる。その際に、シミュレーション強化学習済みの各パーソナルＡＩ群８４〜８７が実際の人間（ユーザ）の相談役となり、シミュレーション強化学習によって得た知識・経験・ノウハウを実際の人間（ユーザ）に提供することができる。 The element integrated DAO for which simulation reinforcement learning has been completed is operated as an actual organization in the real world 47. At that stage, an actual human (user) in the real world will be in charge of "each personal AI group 84 to 87" in FIG. 46. At that time, each personal AI group 84 to 87 that has undergone simulation reinforcement learning acts as a consultant for an actual human (user), and the knowledge, experience, and know-how gained through simulation reinforcement learning can be provided to the actual human (user). can.

以上説明したエレメント統合ＤＡＯの構築は、会社やＮＰＯや地方自治体等の組織全体を、機能別のエレメントＤＡＯの組み合わせで作成するものを示したが、組織全体ではなく組織の一部（例えば資材調達）のみをエレメントＤＡＯで構築するようにしてもよい。 The construction of the element integrated DAO described above shows that the entire organization such as a company, NPO, or local government is created by combining element DAOs by function, but it is not the entire organization but a part of the organization (for example, material procurement). ) May be constructed with the element DAO.

ユーザ端末１６等及び各種サーバで動作する前述したプログラムは、所定のウェブサイト等からダウンロードしてインストールしてもよいが、例えばＣＤ−ＲＯＭ９９等の記録媒体（非一時的（non-transitory）な記録媒体）に記録させて流通させ、そのＣＤ−ＲＯＭ９９等を購入した者がプログラムをユーザ端末１６及び各種サーバにインストールしてもよい（図６０参照）。
[変形例]The above-mentioned programs running on the user terminal 16 and the like and various servers may be downloaded and installed from a predetermined website or the like, but for example, a recording medium such as a CD-ROM 99 (non-transitory recording). The program may be installed on the user terminal 16 and various servers by a person who has purchased the CD-ROM99 or the like by recording it on a medium) and distributing it (see FIG. 60).
[Modification example]

（１）例えば、図２９に示したデジタルツインデータにおける太郎、次郎、桜、三郎等の氏名は、個人情報保護の観点から仮名（匿名）を用い、同一人物であるとの同定はできるが特定の個人を特定できないようにしてもよい。その場合に、ＡＩ識別番号またはブロックチェーンアドレスを仮名（匿名）として用いてもよい。同様に、株式会社ＡＢＣ等のデジタルツインも、会社名（組織名）に仮名（匿名）を用い、同一会社（同一組織）であるとの同定はできるが特定の会社（組織）を特定できないようにしてもよい。また、人間のデジタルツインは、１人の人間に対し複数のパーソナルＡＩによる複数のデジタルツインを用意してもよい。さらには、１人の人間における１つのデジタルツインを複数のパーソナルＡＩの集合（例えば、各種分野における専門的パーソナルＡＩの集合等）で構成してもよい。 (1) For example, the names of Taro, Jiro, Sakura, Saburo, etc. in the digital twin data shown in FIG. 29 are identified by using pseudonyms (anonymous) from the viewpoint of personal information protection, although they can be identified as the same person. It may not be possible to identify the individual. In that case, the AI identification number or blockchain address may be used as a pseudonym (anonymous). Similarly, digital twins such as ABC Co., Ltd. also use a pseudonym (anonymous) for the company name (organization name) so that they can be identified as the same company (same organization) but not a specific company (organization). It may be. Further, as the human digital twin, a plurality of digital twins by a plurality of personal AIs may be prepared for one person. Further, one digital twin in one person may be composed of a set of a plurality of personal AIs (for example, a set of specialized personal AIs in various fields).

（２）図３４〜図５９では、役務が複数種類存在するマルチ役務ＤＡＯを例にしてシミュレーションを行うことにより、ＤＡＯにおけるインセンティブ設計の最適解を導き出すシステムを説明したが、マルチ役務ＤＡＯに限らず、役務が１種類しか存在しないＤＡＯについて、シミュレーションによるインセンティブ設計の最適解を導き出すシステムであってもよい。 (2) In FIGS. 34 to 59, a system for deriving the optimum solution of incentive design in DAO by performing a simulation using a multi-service DAO having a plurality of types of services as an example has been described, but the system is not limited to the multi-service DAO. , The system may be a system for deriving the optimum solution of incentive design by simulation for DAO in which only one type of service exists.

（３）図３５では、ペルソナ毎に学習済みのペルソナエージェント群を生成しているが、ペルソナ毎に属するユーザ群の各のパーソナルＡＩを、ミラーワールド５１に登録されている既存のパーソナルＡＩ群の中から選んでペルソナエージェント群として用いてもよい。この場合は、ペルソナ毎に属するユーザ群に対し、パーソナルＡＩをシミュレーションに用いてもよいか否か問合わせ、用いてもよいとの承諾を得ておく必要がある。承諾を得たユーザ群の各パーソナルＡＩをコピーしてシミュレーションに用い、シミュレーション完了後の学習済みパーソナルＡＩ群を該当する各ユーザに送信する。それを受信した各ユーザは、学習済みパーソナルＡＩが有用（必要）であると判断した場合に、既存のパーソナルＡＩに対し学習済みパーソナルＡＩを上書き保存する。なお、既存のパーソナルＡＩと学習済みパーソナルＡＩとの両者を共に記憶しておき、必要に応じて使い分けるようにしてもよい。 (3) In FIG. 35, a trained persona agent group is generated for each persona, but each personal AI of the user group belonging to each persona is set to the existing personal AI group registered in the mirror world 51. You may choose from them and use it as a group of persona agents. In this case, it is necessary to inquire the user group belonging to each persona whether or not the personal AI may be used for the simulation, and obtain the consent that the personal AI may be used. Each personal AI of the user group with the consent is copied and used for the simulation, and the learned personal AI group after the simulation is completed is transmitted to each corresponding user. When each user who receives it determines that the learned personal AI is useful (necessary), it overwrites the learned personal AI with the existing personal AI and saves it. In addition, both the existing personal AI and the learned personal AI may be stored together and used properly as needed.

（４）マルチエージェント強化学習として、全体最適化を担う統括エージェント（マスターエージェント）が各エージェントに報酬の配分を行うと共に統括エージェント自身も強化学習を行って報酬配分の行為を最適なものに収束させるマスターエージェント方式を示した。しかし、マルチエージェント強化学習としては、これに限定されるものではなく、例えば、マルコフ決定過程の下で最適解に収束することができるD-learning、あるいは、Classifier Systemにおける強化学習アルゴリズムとしてのBucket BrigadeやProfit Sharingを用いてもよい。 (4) As multi-agent reinforcement learning, the general agent (master agent) responsible for overall optimization distributes rewards to each agent, and the general agent itself also performs reinforcement learning to converge the reward distribution action to the optimum one. The master agent method is shown. However, multi-agent reinforcement learning is not limited to this, for example, D-learning that can converge to the optimum solution under the Markov decision process, or Bucket Brigade as a reinforcement learning algorithm in the Classifier System. Or Profit Sharing may be used.

（５）デジタルツインを用いたシミュレーションとしては、人または人によって構成された組織（例えば株式会社やＮＰＯ等）のデジタルツインに限定されない。例えば、ＡＩ搭載の機械や電気製品等のオブジェクト（例えばＡＩ搭載掃除機）において、当該オブジェクトが動作する環境（例えば、自立して移動するＡＩ搭載掃除機が動作するユーザ宅の室内）のデジタルツインをサイバー空間内に生成し、その環境デジタルツイン内でオブジェクトに搭載されているＡＩを事前にシミュレーションして強化学習（機械学習）させ、そのカスタマイズ（パーソナライズ）された学習済みＡＩ搭載のオブジェクトを該当するユーザに提供するようにしてもよい。 (5) The simulation using the digital twin is not limited to the digital twin of a person or an organization composed of people (for example, a corporation, an NPO, etc.). For example, in an object such as an AI-equipped machine or an electric product (for example, an AI-equipped vacuum cleaner), a digital twin in an environment in which the object operates (for example, a user's house in which an AI-equipped vacuum cleaner that moves independently operates). Is generated in the cyber space, the AI mounted on the object is simulated in advance in the environment digital twin, reinforcement learning (machine learning) is performed, and the customized (personalized) trained AI-equipped object is applicable. It may be provided to the user who does.

記録された情報の真正の保証とその情報の削除権の保証とが二律背反するジレンマを極力解消することができるため、ブロックチェーン等の消去不可能性を有する情報記録方式に対し利用できる。 Since the dilemma in which the guarantee of the authenticity of the recorded information and the guarantee of the right to delete the information conflict with each other can be solved as much as possible, it can be used for an information recording method having an indelible property such as a blockchain.

１インターネット
２プライベートチェーン
３コンソーシアムチェーン
４パブリックチェーン
１２ＨＤＤ
１６ユーザ端末
１９ノード
３０鍵登録センタ
３２鍵ＤＢ
４６ミラーワールドサーバ
５１ミラーワールド
５２地球デジタルツイン
５３日本デジタルツイン
５４町デジタルツイン
５７太郎デジタルツイン
５８太郎一家デジタルツイン
５９株式会社ＡＢＣデジタルツイン
６１ＤＡＯエージェント
７２トークン
７８ＤＡＯデジタルツイン
７９統括エージェント。1 Internet 2 Private chain 3 Consortium chain 4 Public chain 12 HDD
16 User terminal 19 Node 30 Key registration center 32 Key DB
46 Mirror World Server 51 Mirror World 52 Earth Digital Twin 53 Japan Digital Twin 54 Town Digital Twin 57 Taro Digital Twin 58 Taro Family Digital Twin 59 ABC Digital Twin Co., Ltd. 61 DAO Agent 72 Token 78 DAO Digital Twin 79 General Agent.

Claims

An encryption means that performs encryption processing to encrypt the information to be recorded,
A recording means for recording information after the encryption process, and
Decoding means that performs decoding processing on the information recorded by the recording means using the first key and the second key to obtain plain text information.
The information recorded by the recording means is provided with an undecryptable means for making the information undecryptable, which cannot be decoded.
The decryption means includes a second key concealment holding means for concealing and holding the second key.
The decryption-disabled means is a processing system that puts the second key held by the second key concealment-holding means into a decryption-disabled state by updating it with another key.

The processing system according to claim 1, wherein the decryption means further includes a first key distribution means for distributing the first key to a person who wants to view information.

The processing system according to claim 1 or 2, further comprising a search means for retrieving information recorded by the recording means without making it plain text.

The information recorded by the recording means includes personal information and includes personal information.
The processing system according to any one of claims 1 to 3, wherein the undecryptable means puts the personal information of the personal information owner into the undecryptable state at the request of the personal information owner.

Steps to perform encryption processing to encrypt the information to be recorded,
A decryption step of decrypting the information recorded by the recording means for recording the information after the encryption process using the first key and the second key to obtain plain text information.
The step of making the information recorded by the recording means undecipherable, which cannot be decoded,
Let the computer run
The decryption step includes a step of keeping the second key secret.
The step of making the decryption impossible state is a program that makes the decryption state by updating the second key held by the holding step to another one.