WO2022193173A1 - Blockchain-based financial data information federated transfer learning system and method - Google Patents

Blockchain-based financial data information federated transfer learning system and method Download PDF

Info

Publication number
WO2022193173A1
WO2022193173A1 PCT/CN2021/081308 CN2021081308W WO2022193173A1 WO 2022193173 A1 WO2022193173 A1 WO 2022193173A1 CN 2021081308 W CN2021081308 W CN 2021081308W WO 2022193173 A1 WO2022193173 A1 WO 2022193173A1
Authority
WO
WIPO (PCT)
Prior art keywords
model training
task
data
smart contract
blockchain
Prior art date
Application number
PCT/CN2021/081308
Other languages
French (fr)
Chinese (zh)
Inventor
王化
赵建
Original Assignee
深圳技术大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳技术大学 filed Critical 深圳技术大学
Priority to PCT/CN2021/081308 priority Critical patent/WO2022193173A1/en
Publication of WO2022193173A1 publication Critical patent/WO2022193173A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Definitions

  • the invention relates to the technical field of blockchain, in particular to a blockchain-based financial data information federated transfer learning system and method.
  • Machine learning techniques have achieved significant success in many fields, but machine learning methods only work well under the assumption that the training and test data are in the same feature space or have the same distribution. When the distribution changes, most statistical models require rebuilding the model using newly collected training data. In many real-world applications, re-collecting the required training data and rebuilding the model is prohibitively expensive. Machine learning generates predictive models from data, so high-quality data is often needed to help regulate statistical models.
  • the present invention provides a blockchain-based financial data information federated transfer learning system and method, aiming to solve the problem that when traditional machine learning is applied to actual financial scenarios, there is less application scenario data and users Privacy and data security concerns.
  • the present invention adopts the following technical solutions:
  • a blockchain-based financial data information federated transfer learning system including:
  • a task publishing module used to publish the smart contract of the model training task on the blockchain, where the smart contract of the model training task includes the model training graph;
  • a task management module which is connected with the task release module, and is used for recording and managing the smart contracts of the model training tasks and generating an address list;
  • a running node which is connected to the task management module, is used to read the address list, download the model training graph from the smart contract of the model training task, and use local data to train the model training graph.
  • the gradient data is uploaded to the designated storage location of the smart contract of the model training task for data sharing.
  • a blockchain-based financial data information federated transfer learning method which is applied to a blockchain-based financial data information federated transfer learning system.
  • the system includes a task publishing module, a task management module, and an operating node. , the method includes:
  • the task publishing module publishes the smart contract of the model training task on the blockchain, and the smart contract of the model training task includes the model training graph;
  • the task management module records and manages the smart contract of the model training task and generates an address list
  • the operating node reads the address list, downloads the model training graph from the smart contract of the model training task, uses local data to train the model training graph, and uploads the trained gradient data to the model training task's Designated storage location for smart contracts for data sharing.
  • the task management module of the present invention is respectively connected with the task release module and the running node, and the smart contract of the model training task is released on the blockchain through the task release module.
  • the task management module records and manages the released smart contract and generates its address list, and runs the
  • the node reads the address list, downloads the model training graph from the corresponding smart contract, uses local data to train its own model, and uploads the trained gradient data to the designated storage location of the smart contract for data sharing, avoiding the risk of uploading original data.
  • sharing gradient data can solve the problem of less data in application scenarios and improve training accuracy.
  • FIG. 1 is a block connection diagram of a blockchain-based financial data information federated transfer learning system provided by an embodiment of the present invention
  • FIG. 2 is a flowchart of a blockchain-based method for federated transfer learning of financial data information provided by an embodiment of the present invention.
  • FIG. 1 is a module connection diagram of a blockchain-based financial data information federated transfer learning system provided by an embodiment of the present invention.
  • the present invention provides a blockchain-based financial data information federated transfer learning system, including:
  • the task publishing module 10 is used to publish the smart contract of the model training task on the blockchain, and the smart contract of the model training task includes the model training graph;
  • the blockchain is a consortium chain, which is between the private chain and the public chain. It is a blockchain that requires registration and permission, and is limited to members of the consortium who have permission to participate in the reading and writing of the ledger.
  • the roles and functional division of nodes need to be preset, and the consensus, operation and maintenance and access in the network are controlled by the preset nodes.
  • the alliance chain is suitable for cross-institutional transactions, settlement, collaborative office and certificate deposit. After the transaction is reached, each participant on the blockchain first verifies the transaction. Once all participants reach a consensus, the transaction information is stamped with a timestamp indicating the order in which the transaction occurred. The timestamp function guarantees the traceability of transactions.
  • the application of blockchain technology solves the pain point of high credit risk in traditional transactions and improves the security of transactions.
  • each participant in the blockchain has a complete set of books, which has unique advantages in reconciliation, which reduces the It reduces the cost of reconciliation and improves the efficiency of settlement.
  • the product has the characteristics of decentralization, de-trust, time stamp, etc., as the underlying technology of the platform architecture, making all transaction information open, transparent and tamper-proof, greatly reducing the occurrence of operational risks and credit risks, enabling transactions safer.
  • the blockchain includes a data layer, a network layer, a consensus layer, an incentive layer, a contract layer and an application layer.
  • the task management module 20 which is connected with the task release module, is used for recording and managing the smart contracts of the model training tasks and generating an address list.
  • the task management module 20 is a factory-mode smart contract, which is encapsulated in the contract layer of the blockchain.
  • Running node 30 which is connected to the task management module 20, is used to read the address list, download the model training graph from the smart contract of the model training task, and use local data to train the model training graph, The trained gradient data is uploaded to the designated storage location of the smart contract of the model training task for data sharing.
  • the task management module 20 of the present invention is respectively connected with the task release module 10 and the operation node 30, and the smart contract of the model training task is released on the blockchain through the task release module 10.
  • the task management module 20 records and manages the released smart contract and generates For its address list, the operating node 30 reads the address list, downloads the model training graph from the corresponding smart contract, uses local data to train its own model, and uploads the trained gradient data to the designated storage location of the smart contract for data sharing, The privacy problem caused by uploading the original data is avoided, and the sharing of gradient data can solve the problem of less data in application scenarios and improve the training accuracy.
  • the blockchain-based financial data information federated transfer learning system further includes a file storage module for storing data files, which is connected to the task management module 20, and the hash values and acquisition paths of the data files are stored in the data file. in the smart contract that describes the model training task.
  • the task management module 20 has an API interface, so that the running node 30 can read the address list.
  • the smart contract of the model training task further includes a training data set storage path, a test data set storage path and accuracy requirements.
  • FIG. 2 is a flowchart of a method for federated transfer learning of financial data information based on blockchain provided by an embodiment of the present invention.
  • the present invention also provides a blockchain-based financial data information federated transfer learning method, which is applied to a blockchain-based financial data information federated transfer learning system.
  • the system includes a task issuing module 10, a task management Module 20 and operating node 30, the method includes:
  • Step S101 the task publishing module 10 publishes the smart contract of the model training task on the blockchain, and the smart contract of the model training task includes the model training graph;
  • Step S102 the task management module 20 records and manages the smart contract of the model training task and generates an address list
  • Step S103 the operating node 30 reads the address list, downloads the model training graph from the smart contract of the model training task, uses local data to train the model training graph, and uploads the trained gradient data to the model training graph. Designated storage location for smart contracts for model training tasks for data sharing.
  • the running node 30 includes a first running node and a second running node, then the model training graph is trained by using local data, and the trained gradient data is uploaded to
  • the designated storage location of the smart contract of the model training task includes:
  • the first running node and the second running node respectively initialize their respective models
  • the second running node calculates part of the estimated value and part of the loss based on its own characteristics, and encrypts it and sends it to the first running node;
  • the first running node calculates part of the estimated value based on its own features, and combines the estimated value of the second running node to calculate the final loss function and gradient data, and then sends the gradient data and loss function required by the second running node back to the first running node.
  • the first running node and the second running node After completing the gradient data calculation, the first running node and the second running node encrypt and mask the gradient data respectively, and send them to the off-chain storage server;
  • Off-chain storage servers decrypt and aggregate gradient data
  • the first running node and the second running node download the aggregated gradient data from the off-chain storage server, remove their own masks and update their own models;
  • the operating nodes 30 include multiple, and the model training graph is trained by using local data, and the trained gradient data is uploaded to the smart contract of the model training task.
  • Designated storage locations include:
  • the off-chain storage server initializes and encrypts the item profile
  • the off-chain storage server sends the encrypted item profile to each running node
  • Each running node decrypts the item profile and calculates the loss based on the local data, updates its own user profile, and encrypts and transmits the gradient data of the item profile to the off-chain storage server;
  • Each running node uses a matrix to decompose its user profile and item profile and recommend it.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a blockchain-based financial data information federated transfer learning system and method. The system comprises: a task publishing module, which is used to publish on a blockchain a smart contract of a model training task, wherein the smart contract of the model training task comprises a model training graph; a task management module, which is connected to the task publishing module and used to record and manage the smart contract of the model training task and generate an address list; and a running node, which is connected to the task management module and used to read the address list, download the model training graph from the smart contract of the model training task, train the model training graph by using local data, and upload gradient data of training to a specified storage location of the smart contract of the model training task, so as to perform data sharing. According to the present invention, data privacy leakage of financial data information during machine learning is avoided, and training accuracy is improved.

Description

一种基于区块链的金融数据信息联邦迁移学习系统及方法A blockchain-based financial data information federated transfer learning system and method 技术领域technical field
本发明涉及区块链技术领域,尤其是指一种基于区块链的金融数据信息联邦迁移学习系统及方法。The invention relates to the technical field of blockchain, in particular to a blockchain-based financial data information federated transfer learning system and method.
背景技术Background technique
机器学习技术在许多领域取得了重大成功,但是机器学习方法只有在训练数据和测试数据在相同的特征空间中或具有相同分布的假设下才能很好地发挥作用。当分布发生变化时,大多数统计模型需要使用新收集的训练数据重建模型。在许多实际应用中,重新收集所需的训练数据并重建模型的代价是非常昂贵的。机器学习根据数据生成预测模型,因此往往需要高质量的数据来帮助调控统计模型。Machine learning techniques have achieved significant success in many fields, but machine learning methods only work well under the assumption that the training and test data are in the same feature space or have the same distribution. When the distribution changes, most statistical models require rebuilding the model using newly collected training data. In many real-world applications, re-collecting the required training data and rebuilding the model is prohibitively expensive. Machine learning generates predictive models from data, so high-quality data is often needed to help regulate statistical models.
目前金融机构的数据来源纷杂,数据格式不标准,数据更新周期不稳定,因此将机器学习应用于实际金融场景存在着数据孤岛导致的应用场景数据较少,以及在实际应用中常常需要利用多个数据集导致的用户隐私和数据安全问题。At present, financial institutions have various data sources, non-standard data formats, and unstable data update cycles. Therefore, when applying machine learning to actual financial scenarios, there are data islands resulting in less data in application scenarios, and in practical applications, it is often necessary to use multiple User privacy and data security issues caused by datasets.
技术问题technical problem
针对现有技术的不足,本发明提供了一种基于区块链的金融数据信息联邦迁移学习系统及方法,旨在解决将传统机器学习应用于实际金融场景时存在的应用场景数据较少以及用户隐私和数据安全问题。Aiming at the deficiencies of the prior art, the present invention provides a blockchain-based financial data information federated transfer learning system and method, aiming to solve the problem that when traditional machine learning is applied to actual financial scenarios, there is less application scenario data and users Privacy and data security concerns.
技术解决方案technical solutions
为了解决上述技术问题,本发明采用了如下技术方案:In order to solve the above-mentioned technical problems, the present invention adopts the following technical solutions:
第一方面,提供了一种基于区块链的金融数据信息联邦迁移学习系统,包括:In the first aspect, a blockchain-based financial data information federated transfer learning system is provided, including:
任务发布模块,用于在区块链上发布模型训练任务的智能合约,所述模型训练任务的智能合约包括模型训练图;A task publishing module, used to publish the smart contract of the model training task on the blockchain, where the smart contract of the model training task includes the model training graph;
任务管理模块,其与任务发布模块连接,用于记录和管理所述模型训练任务的智能合约并生成地址列表;a task management module, which is connected with the task release module, and is used for recording and managing the smart contracts of the model training tasks and generating an address list;
运行节点,其与任务管理模块连接,用于读取所述地址列表,从所述模型训练任务的智能合约中下载模型训练图,并利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置,以进行数据共享。A running node, which is connected to the task management module, is used to read the address list, download the model training graph from the smart contract of the model training task, and use local data to train the model training graph. The gradient data is uploaded to the designated storage location of the smart contract of the model training task for data sharing.
第二方面,提供了一种基于区块链的金融数据信息联邦迁移学习方法,应用于基于区块链的金融数据信息联邦迁移学习系统,所述系统包括任务发布模块、任务管理模块及运行节点,所述方法包括:In a second aspect, a blockchain-based financial data information federated transfer learning method is provided, which is applied to a blockchain-based financial data information federated transfer learning system. The system includes a task publishing module, a task management module, and an operating node. , the method includes:
任务发布模块在区块链上发布模型训练任务的智能合约,所述模型训练任务的智能合约包括模型训练图;The task publishing module publishes the smart contract of the model training task on the blockchain, and the smart contract of the model training task includes the model training graph;
任务管理模块记录和管理所述模型训练任务的智能合约并生成地址列表;The task management module records and manages the smart contract of the model training task and generates an address list;
运行节点读取所述地址列表,从所述模型训练任务的智能合约中下载模型训练图,并利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置,以进行数据共享。The operating node reads the address list, downloads the model training graph from the smart contract of the model training task, uses local data to train the model training graph, and uploads the trained gradient data to the model training task's Designated storage location for smart contracts for data sharing.
有益效果beneficial effect
本发明的有益效果在于:The beneficial effects of the present invention are:
本发明的任务管理模块分别与任务发布模块、运行节点连接,通过任务发布模块在区块链上发布模型训练任务的智能合约,任务管理模块记录和管理发布的智能合约并生成其地址列表,运行节点读取地址列表,从对应的智能合约中下载模型训练图利用本地数据对自身模型进行训练,并将训练的梯度数据上传至该智能合约的指定存储位置进行数据共享,避免了上传原始数据造成的隐私问题,同时共享梯度数据可解决应用场景数据较少的问题,提升训练准确度。The task management module of the present invention is respectively connected with the task release module and the running node, and the smart contract of the model training task is released on the blockchain through the task release module. The task management module records and manages the released smart contract and generates its address list, and runs the The node reads the address list, downloads the model training graph from the corresponding smart contract, uses local data to train its own model, and uploads the trained gradient data to the designated storage location of the smart contract for data sharing, avoiding the risk of uploading original data. At the same time, sharing gradient data can solve the problem of less data in application scenarios and improve training accuracy.
附图说明Description of drawings
下面结合附图详述本发明的具体结构The specific structure of the present invention will be described in detail below in conjunction with the accompanying drawings
图1为本发明实施例提供的基于区块链的金融数据信息联邦迁移学习系统的模块连接图;1 is a block connection diagram of a blockchain-based financial data information federated transfer learning system provided by an embodiment of the present invention;
图2为本发明实施例提供的基于区块链的金融数据信息联邦迁移学习方法的流程框图。FIG. 2 is a flowchart of a blockchain-based method for federated transfer learning of financial data information provided by an embodiment of the present invention.
本发明的实施方式Embodiments of the present invention
为详细说明本发明的技术内容、构造特征、所实现目的及效果,以下结合实施方式并配合附图详予说明。In order to describe the technical content, structural features, achieved objects and effects of the present invention in detail, the following detailed description is given in conjunction with the embodiments and the accompanying drawings.
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本发明,而不能理解为对本发明的限制。The following describes in detail the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the accompanying drawings are exemplary, and are intended to explain the present invention and should not be construed as limiting the present invention.
请参考图1,图1为本发明实施例提供的基于区块链的金融数据信息联邦迁移学习系统的模块连接图Please refer to FIG. 1. FIG. 1 is a module connection diagram of a blockchain-based financial data information federated transfer learning system provided by an embodiment of the present invention.
第一方面,本发明提供了一种基于区块链的金融数据信息联邦迁移学习系统,包括:In a first aspect, the present invention provides a blockchain-based financial data information federated transfer learning system, including:
任务发布模块10,用于在区块链上发布模型训练任务的智能合约,所述模型训练任务的智能合约包括模型训练图;The task publishing module 10 is used to publish the smart contract of the model training task on the blockchain, and the smart contract of the model training task includes the model training graph;
具体地,所述区块链为联盟链,其介于私有链与公有链之间,是一种需要注册许可的区块链,仅限于联盟中具有权限的成员参与账本的读写,网络中节点的角色及功能划分需预先设定,且网络中的共识、运维和接入均由预先设定的节点控制。一般来说,联盟链适合于跨机构的交易、结算、协同办公及存证等。交易达成后,首先由区块链上的各个参与者对交易进行验证,一旦所有的参与者达成共识,该条交易信息盖上表明交易发生的先后顺序的时间戳。时间戳功能保证了交易的可追溯性。区块链技术的应用解决了传统交易信用风险高的痛点,提高了交易的安全性,同时区块链每个参与者都有一套完整的账本,在对账方面具有得天独厚的优势,这就降低了对账的成本,提高了清算的效率。产品具有去中心化、去信任、时间戳等特征的区块链技术作为平台架构的底层技术,使得所有的交易信息公开透明、不可被篡改,大大降低了操作风险及信用风险的发生,使交易更加安全。Specifically, the blockchain is a consortium chain, which is between the private chain and the public chain. It is a blockchain that requires registration and permission, and is limited to members of the consortium who have permission to participate in the reading and writing of the ledger. The roles and functional division of nodes need to be preset, and the consensus, operation and maintenance and access in the network are controlled by the preset nodes. Generally speaking, the alliance chain is suitable for cross-institutional transactions, settlement, collaborative office and certificate deposit. After the transaction is reached, each participant on the blockchain first verifies the transaction. Once all participants reach a consensus, the transaction information is stamped with a timestamp indicating the order in which the transaction occurred. The timestamp function guarantees the traceability of transactions. The application of blockchain technology solves the pain point of high credit risk in traditional transactions and improves the security of transactions. At the same time, each participant in the blockchain has a complete set of books, which has unique advantages in reconciliation, which reduces the It reduces the cost of reconciliation and improves the efficiency of settlement. The product has the characteristics of decentralization, de-trust, time stamp, etc., as the underlying technology of the platform architecture, making all transaction information open, transparent and tamper-proof, greatly reducing the occurrence of operational risks and credit risks, enabling transactions safer.
进一步地,所述区块链包括数据层、网络层、共识层、激励层、合约层和应用层。Further, the blockchain includes a data layer, a network layer, a consensus layer, an incentive layer, a contract layer and an application layer.
任务管理模块20,其与任务发布模块连接,用于记录和管理所述模型训练任务的智能合约并生成地址列表。The task management module 20, which is connected with the task release module, is used for recording and managing the smart contracts of the model training tasks and generating an address list.
具体地,任务管理模块20为工厂模式的智能合约,其封装在区块链的合约层中。Specifically, the task management module 20 is a factory-mode smart contract, which is encapsulated in the contract layer of the blockchain.
运行节点30,其与任务管理模块20连接,用于读取所述地址列表,从所述模型训练任务的智能合约中下载模型训练图,并利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置,以进行数据共享。Running node 30, which is connected to the task management module 20, is used to read the address list, download the model training graph from the smart contract of the model training task, and use local data to train the model training graph, The trained gradient data is uploaded to the designated storage location of the smart contract of the model training task for data sharing.
本发明的有益效果在于:The beneficial effects of the present invention are:
本发明的任务管理模块20分别与任务发布模块10、运行节点30连接,通过任务发布模块10在区块链上发布模型训练任务的智能合约,任务管理模块20记录和管理发布的智能合约并生成其地址列表,运行节点30读取地址列表,从对应的智能合约中下载模型训练图利用本地数据对自身模型进行训练,并将训练的梯度数据上传至该智能合约的指定存储位置进行数据共享,避免了上传原始数据造成的隐私问题,同时共享梯度数据可解决应用场景数据较少的问题,提升训练准确度。The task management module 20 of the present invention is respectively connected with the task release module 10 and the operation node 30, and the smart contract of the model training task is released on the blockchain through the task release module 10. The task management module 20 records and manages the released smart contract and generates For its address list, the operating node 30 reads the address list, downloads the model training graph from the corresponding smart contract, uses local data to train its own model, and uploads the trained gradient data to the designated storage location of the smart contract for data sharing, The privacy problem caused by uploading the original data is avoided, and the sharing of gradient data can solve the problem of less data in application scenarios and improve the training accuracy.
其中,所述基于区块链的金融数据信息联邦迁移学习系统还包括用于存储数据文件的文件存储模块,其与任务管理模块20连接,所述数据文件的哈希值及获取路径存储在所述模型训练任务的智能合约中。Wherein, the blockchain-based financial data information federated transfer learning system further includes a file storage module for storing data files, which is connected to the task management module 20, and the hash values and acquisition paths of the data files are stored in the data file. in the smart contract that describes the model training task.
其中,所述任务管理模块20具有API接口,以使所述运行节点30进行地址列表的读取。Wherein, the task management module 20 has an API interface, so that the running node 30 can read the address list.
其中,所述模型训练任务的智能合约还包括训练数据集存储路径、测试数据集存储路径及准确度要求。Wherein, the smart contract of the model training task further includes a training data set storage path, a test data set storage path and accuracy requirements.
请参考图2,图2为本发明实施例提供的基于区块链的金融数据信息联邦迁移学习方法的流程框图。Please refer to FIG. 2. FIG. 2 is a flowchart of a method for federated transfer learning of financial data information based on blockchain provided by an embodiment of the present invention.
第二方面,本发明还提供了一种基于区块链的金融数据信息联邦迁移学习方法,应用于基于区块链的金融数据信息联邦迁移学习系统,所述系统包括任务发布模块10、任务管理模块20及运行节点30,所述方法包括:In the second aspect, the present invention also provides a blockchain-based financial data information federated transfer learning method, which is applied to a blockchain-based financial data information federated transfer learning system. The system includes a task issuing module 10, a task management Module 20 and operating node 30, the method includes:
步骤S101,任务发布模块10在区块链上发布模型训练任务的智能合约,所述模型训练任务的智能合约包括模型训练图;Step S101, the task publishing module 10 publishes the smart contract of the model training task on the blockchain, and the smart contract of the model training task includes the model training graph;
步骤S102,任务管理模块20记录和管理所述模型训练任务的智能合约并生成地址列表;Step S102, the task management module 20 records and manages the smart contract of the model training task and generates an address list;
步骤S103,运行节点30读取所述地址列表,从所述模型训练任务的智能合约中下载模型训练图,并利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置,以进行数据共享。Step S103, the operating node 30 reads the address list, downloads the model training graph from the smart contract of the model training task, uses local data to train the model training graph, and uploads the trained gradient data to the model training graph. Designated storage location for smart contracts for model training tasks for data sharing.
进一步地,在一种可能的实施方式中,所述运行节点30包括第一运行节点及第二运行节点,则所述利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置包括:Further, in a possible implementation manner, the running node 30 includes a first running node and a second running node, then the model training graph is trained by using local data, and the trained gradient data is uploaded to The designated storage location of the smart contract of the model training task includes:
第一运行节点和第二运行节点分别初始化各自的模型;The first running node and the second running node respectively initialize their respective models;
第二运行节点基于自己的特征计算部分预估值和部分loss,并加密发送给第一运行节点;The second running node calculates part of the estimated value and part of the loss based on its own characteristics, and encrypts it and sends it to the first running node;
第一运行节点基于自己的特征计算部分预估值,并结合第二运行节点的预估值,计算最终的损失函数和梯度数据,然后将第二运行节点需要的梯度数据和损失函数发送回第二运行节点;The first running node calculates part of the estimated value based on its own features, and combines the estimated value of the second running node to calculate the final loss function and gradient data, and then sends the gradient data and loss function required by the second running node back to the first running node. Two running nodes;
第一运行节点和第二运行节点在完成梯度数据计算后,分别将梯度数据进行加密并掩码,发送给链下存储服务器;After completing the gradient data calculation, the first running node and the second running node encrypt and mask the gradient data respectively, and send them to the off-chain storage server;
链下存储服务器解密并汇总梯度数据;Off-chain storage servers decrypt and aggregate gradient data;
第一运行节点和第二运行节点从链下存储服务器下载汇总的梯度数据,去除自身的掩码并更新自有模型;The first running node and the second running node download the aggregated gradient data from the off-chain storage server, remove their own masks and update their own models;
循环上述步骤直至收敛,分别得到第一运行节点和第二运行节点的模型。The above steps are repeated until convergence, and the models of the first running node and the second running node are obtained respectively.
在另一种可能的实施方式中,所述运行节点30包括多个,则所述利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置包括:In another possible implementation manner, the operating nodes 30 include multiple, and the model training graph is trained by using local data, and the trained gradient data is uploaded to the smart contract of the model training task. Designated storage locations include:
链下存储服务器初始化并加密item profile;The off-chain storage server initializes and encrypts the item profile;
链下存储服务器将加密的item profile 发送给各运行节点;The off-chain storage server sends the encrypted item profile to each running node;
各运行节点解密item profile并基于本地数据计算loss,更新各自的user profile,并将item profile的梯度数据加密传输给链下存储服务器;Each running node decrypts the item profile and calculates the loss based on the local data, updates its own user profile, and encrypts and transmits the gradient data of the item profile to the off-chain storage server;
链下存储服务器汇总item profile的梯度数据并更新item profiles;Off-chain storage server summary item profile gradient data and update item profiles;
各运行节点利用矩阵分解各自的user profile和item profile并进行推荐。Each running node uses a matrix to decompose its user profile and item profile and recommend it.
以上所述仅为本发明的实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above descriptions are only the embodiments of the present invention, and are not intended to limit the scope of the present invention. Any equivalent structure or equivalent process transformation made by using the contents of the description and drawings of the present invention, or directly or indirectly applied to other related technologies Fields are similarly included in the scope of patent protection of the present invention.

Claims (7)

  1. 一种基于区块链的金融数据信息联邦迁移学习系统,其特征在于,包括:A blockchain-based financial data information federated transfer learning system, characterized in that it includes:
    任务发布模块,用于在区块链上发布模型训练任务的智能合约,所述模型训练任务的智能合约包括模型训练图;A task publishing module, used to publish the smart contract of the model training task on the blockchain, where the smart contract of the model training task includes the model training graph;
    任务管理模块,其与任务发布模块连接,用于记录和管理所述模型训练任务的智能合约并生成地址列表;a task management module, which is connected with the task release module, and is used for recording and managing the smart contracts of the model training tasks and generating an address list;
    运行节点,其与任务管理模块连接,用于读取所述地址列表,从所述模型训练任务的智能合约中下载模型训练图,并利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置,以进行数据共享。A running node, which is connected to the task management module, is used to read the address list, download the model training graph from the smart contract of the model training task, and use local data to train the model training graph. The gradient data is uploaded to the designated storage location of the smart contract of the model training task for data sharing.
  2. 如权利要求1所述的基于区块链的金融数据信息联邦迁移学习系统,其特征在于,还包括用于存储数据文件的文件存储模块,所述数据文件的哈希值及获取路径存储在所述模型训练任务的智能合约中。The blockchain-based financial data information federated transfer learning system according to claim 1, further comprising a file storage module for storing data files, the hash values and acquisition paths of the data files are stored in the in the smart contract that describes the model training task.
  3. 如权利要求1所述的基于区块链的金融数据信息联邦迁移学习系统,其特征在于,所述任务管理模块具有API接口,以使所述运行节点进行地址列表的读取。The blockchain-based financial data information federated transfer learning system according to claim 1, wherein the task management module has an API interface, so that the running node can read the address list.
  4. 如权利要求1所述的基于区块链的金融数据信息联邦迁移学习系统,其特征在于,所述模型训练任务的智能合约还包括训练数据集存储路径、测试数据集存储路径及准确度要求。The blockchain-based financial data information federated transfer learning system according to claim 1, wherein the smart contract for the model training task further includes a training data set storage path, a test data set storage path, and accuracy requirements.
  5. 一种基于区块链的金融数据信息联邦迁移学习方法,应用于基于区块链的金融数据信息联邦迁移学习系统,所述系统包括任务发布模块、任务管理模块及运行节点,其特征在于,包括:A blockchain-based financial data information federated transfer learning method is applied to a blockchain-based financial data information federated transfer learning system. The system includes a task issuing module, a task management module and an operation node, and is characterized in that it includes: :
    任务发布模块在区块链上发布模型训练任务的智能合约,所述模型训练任务的智能合约包括模型训练图;The task publishing module publishes the smart contract of the model training task on the blockchain, and the smart contract of the model training task includes the model training graph;
    任务管理模块记录和管理所述模型训练任务的智能合约并生成地址列表;The task management module records and manages the smart contract of the model training task and generates an address list;
    运行节点读取所述地址列表,从所述模型训练任务的智能合约中下载模型训练图,并利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置,以进行数据共享。The operating node reads the address list, downloads the model training graph from the smart contract of the model training task, uses local data to train the model training graph, and uploads the trained gradient data to the model training task's Designated storage location for smart contracts for data sharing.
  6. 如权利要求5所述的基于区块链的金融数据信息联邦迁移学习方法,其特征在于,所述运行节点包括第一运行节点及第二运行节点,则所述利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置包括:The method for federated transfer learning of financial data information based on blockchain according to claim 5, wherein the operation node includes a first operation node and a second operation node, and the model is trained by using local data The specified storage location of the smart contract for uploading the trained gradient data to the model training task includes:
    第一运行节点和第二运行节点分别初始化各自的模型;The first running node and the second running node respectively initialize their respective models;
    第二运行节点基于自己的特征计算部分预估值和部分loss,并加密发送给第一运行节点;The second running node calculates part of the estimated value and part of the loss based on its own characteristics, and encrypts it and sends it to the first running node;
    第一运行节点基于自己的特征计算部分预估值,并结合第二运行节点的预估值,计算最终的损失函数和梯度数据,然后将第二运行节点需要的梯度数据和损失函数发送回第二运行节点;The first running node calculates part of the estimated value based on its own features, and combines the estimated value of the second running node to calculate the final loss function and gradient data, and then sends the gradient data and loss function required by the second running node back to the first running node. Two running nodes;
    第一运行节点和第二运行节点在完成梯度数据计算后,分别将梯度数据进行加密并掩码,发送给链下存储服务器;After completing the gradient data calculation, the first running node and the second running node encrypt and mask the gradient data respectively, and send them to the off-chain storage server;
    链下存储服务器解密并汇总梯度数据;Off-chain storage servers decrypt and aggregate gradient data;
    第一运行节点和第二运行节点从链下存储服务器下载汇总的梯度数据,去除自身的掩码并更新自有模型;The first running node and the second running node download the aggregated gradient data from the off-chain storage server, remove their own masks and update their own models;
    循环上述步骤直至收敛,分别得到第一运行节点和第二运行节点的模型。The above steps are repeated until convergence, and the models of the first running node and the second running node are obtained respectively.
  7. 如权利要求5所述的基于区块链的金融数据信息联邦迁移学习方法,其特征在于,所述运行节点包括多个,则所述利用本地数据对所述模型训练图进行训练,将训练的梯度数据上传至所述模型训练任务的智能合约的指定存储位置包括:The method for federated transfer learning of financial data information based on blockchain according to claim 5, characterized in that, if the operating nodes include a plurality of nodes, the model training graph is trained by using local data, and the trained The designated storage location of the gradient data uploaded to the smart contract of the model training task includes:
    链下存储服务器初始化并加密item profile;The off-chain storage server initializes and encrypts the item profile;
    链下存储服务器将加密的item profile 发送给各运行节点;The off-chain storage server sends the encrypted item profile to each running node;
    各运行节点解密item profile并基于本地数据计算loss,更新各自的user profile,并将item profile的梯度数据加密传输给链下存储服务器;Each running node decrypts the item profile and calculates the loss based on the local data, updates its own user profile, and encrypts and transmits the gradient data of the item profile to the off-chain storage server;
    链下存储服务器汇总item profile的梯度数据并更新item profiles;Off-chain storage server summary item profile gradient data and update item profiles;
    各运行节点利用矩阵分解各自的user profile和item profile并进行推荐。Each running node uses a matrix to decompose its user profile and item profile and recommend it.
PCT/CN2021/081308 2021-03-17 2021-03-17 Blockchain-based financial data information federated transfer learning system and method WO2022193173A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/081308 WO2022193173A1 (en) 2021-03-17 2021-03-17 Blockchain-based financial data information federated transfer learning system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/081308 WO2022193173A1 (en) 2021-03-17 2021-03-17 Blockchain-based financial data information federated transfer learning system and method

Publications (1)

Publication Number Publication Date
WO2022193173A1 true WO2022193173A1 (en) 2022-09-22

Family

ID=83321796

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/081308 WO2022193173A1 (en) 2021-03-17 2021-03-17 Blockchain-based financial data information federated transfer learning system and method

Country Status (1)

Country Link
WO (1) WO2022193173A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190188706A1 (en) * 2017-12-18 2019-06-20 Apple Inc. Transference tracking
CN111428881A (en) * 2020-03-20 2020-07-17 深圳前海微众银行股份有限公司 Recognition model training method, device, equipment and readable storage medium
CN112257873A (en) * 2020-11-11 2021-01-22 深圳前海微众银行股份有限公司 Training method, device, system, equipment and storage medium of machine learning model

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190188706A1 (en) * 2017-12-18 2019-06-20 Apple Inc. Transference tracking
CN111428881A (en) * 2020-03-20 2020-07-17 深圳前海微众银行股份有限公司 Recognition model training method, device, equipment and readable storage medium
CN112257873A (en) * 2020-11-11 2021-01-22 深圳前海微众银行股份有限公司 Training method, device, system, equipment and storage medium of machine learning model

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHANG YANYAN: ""Federal Learning" and Its Application Analysis in Finance", RURAL FINANCE RESEARCH, no. 12, 15 December 2020 (2020-12-15), pages 52 - 58, XP055967397, ISSN: 1003-1812, DOI: 10.16127/j.cnki.issn1003-1812.2020.12.015 *

Similar Documents

Publication Publication Date Title
Liu et al. Fedcoin: A peer-to-peer payment system for federated learning
CN109314636B (en) Cryptographic method and system for secure extraction of data from blockchains
CN107180350B (en) Method, device and system for multi-party sharing transaction metadata based on block chain
EP3953834A1 (en) Systems, devices, and methods for dlt-based data management platforms and data products
CN108418783A (en) A kind of protection method of block chain intelligence contract privacy, medium
CN107577427A (en) For the data migration method of block catenary system, equipment and storage medium
JP7312746B2 (en) Hierarchical data exchange management system
US11159331B2 (en) Traceability of a multi-actor job string by block chain, allowing at least two levels of confidence in the information stored
Han et al. Fluid: A blockchain based framework for crowdsourcing
WO2020134631A1 (en) Block chain-based data processing method and apparatus
CN109937430A (en) Platform for the atom transfer intellectual capital in block chain network
JP2022527375A (en) Systems and methods for virtual distributed ledger networks
CN111596956B (en) Information processing method and device based on block chain, electronic equipment and medium
Michelson et al. Accumulate: An identity-based blockchain protocol with cross-chain support, human-readable addresses, and key management capabilities
Lu et al. Educoin: a secure and efficient payment solution for mooc environment
CN113657609B (en) Data management method and system based on block chain and federation transfer learning
Liu et al. EduChain: a blockchain-based education data management system
WO2022193173A1 (en) Blockchain-based financial data information federated transfer learning system and method
Liu et al. Blockchain and mobile client privacy protection in e-commerce consumer shopping tendency identification application
CN111861117A (en) Musical instrument evaluation data sharing method and system based on alliance chain
CN113420083B (en) System with heterogeneous parallel block chain structure of extensible distributed ledger
Bhole et al. Allocation and Tracking of Public Funds using Blockchain
Luhanga WHY BUSINESS SCHOOLS IN TANZANIA SHOULD TEACH BLOCK CHAIN TECHNOLOGY
CN112016898A (en) Block chain-based collaborative office method, device and system
Ştefan Blockchain technologies and microservices for open learning communities. a software architecture perspective

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21930767

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21930767

Country of ref document: EP

Kind code of ref document: A1