WO2022077720A1 - Method and apparatus for sharing medical data - Google Patents

Method and apparatus for sharing medical data Download PDF

Info

Publication number
WO2022077720A1
WO2022077720A1 PCT/CN2020/132615 CN2020132615W WO2022077720A1 WO 2022077720 A1 WO2022077720 A1 WO 2022077720A1 CN 2020132615 W CN2020132615 W CN 2020132615W WO 2022077720 A1 WO2022077720 A1 WO 2022077720A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
medical data
assistant
shadow
seeking
Prior art date
Application number
PCT/CN2020/132615
Other languages
French (fr)
Chinese (zh)
Inventor
赵建
罗凯文
相韶华
林宏壤
Original Assignee
深圳技术大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳技术大学 filed Critical 深圳技术大学
Publication of WO2022077720A1 publication Critical patent/WO2022077720A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/176Support for shared access to files; File sharing support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H30/00ICT specially adapted for the handling or processing of medical images
    • G16H30/20ICT specially adapted for the handling or processing of medical images for handling medical images, e.g. DICOM, HL7 or PACS
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H40/00ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices
    • G16H40/60ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices
    • G16H40/67ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices for remote operation

Definitions

  • the present invention relates to the technical field of data processing, and in particular, to a medical data sharing method and device.
  • Federated machine learning (Federated machine learning/Federated Learning): also known as federated learning, federated learning, and federated learning.
  • Federated Machine Learning is a machine learning framework that can effectively help multiple agencies conduct data usage and machine learning modeling while meeting user privacy protection, data security, and government regulations.
  • federated learning can effectively solve the problem of data silos, allowing participants to jointly model without sharing data, which can technically break the data silos and realize AI collaboration.
  • Federated learning defines a machine learning framework under which the problem of different data owners collaborating without exchanging data is solved by designing virtual models.
  • the virtual model is the optimal model for all parties to aggregate data together, and each region serves local goals according to the model.
  • Federated learning requires that this modeling result should be infinitely close to the traditional model, that is, the result of aggregating the data of multiple data owners into one place for modeling.
  • the identity and status of each participant are the same, and a shared data policy can be established. Since the data is not transferred, it does not reveal user privacy or affect data compliance. To protect data privacy and meet legal compliance requirements.
  • the main purpose of the present invention is to propose a medical data sharing method and device, which realizes decentralization through blockchain technology and federated machine learning training, and solves the shortcomings of traditional federated learning relying on a single server.
  • the main purpose of the present invention is to propose a medical data sharing method and device, which realizes decentralization through blockchain technology and federated machine learning training, and solves the shortcomings of traditional federated learning relying on a single server.
  • a medical data sharing method includes:
  • consortium chains to connect multiple data islands includes:
  • Synthetic training is performed by the temporary center generator to generate synthetic data samples that are similar to the data in the plurality of data islands.
  • the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
  • an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node.
  • OCT image quality control AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node.
  • OCT image quality control AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node.
  • the smart ophthalmology OCT module is mounted on a Zhilian platform.
  • a medical data sharing device comprising:
  • connection module is used to connect multiple data islands using the alliance chain
  • the publishing module is used to publish federated machine learning tasks in the form of smart contracts on the alliance chain;
  • the update module is used to summarize and update the medical data of each node in the shadow-seeking assistant.
  • connection module includes:
  • a setting unit for setting a one-to-one corresponding multiple distributed asynchronous discriminators in the multiple data islands
  • an adversarial unit configured to form an adversarial network of the plurality of distributed asynchronous discriminators and a temporary center generator
  • a training unit configured to perform synthetic training through the temporary center generator to generate synthetic data samples that are similar to the data in the plurality of data islands.
  • the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
  • an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node.
  • OCT image quality control AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node.
  • OCT image quality control AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node.
  • the smart ophthalmology OCT module is mounted on a Zhilian platform.
  • a medical data sharing method and device provided by the present invention include: using a consortium chain to connect multiple data islands; publishing a federated machine learning task in the form of a smart contract on the consortium chain;
  • the shadow assistant joins federated machine learning; it aggregates and updates the medical data of each node in the shadow assistant; through blockchain technology and federated machine learning training, it achieves decentralization and solves the shortcomings of traditional federated learning relying on a single server.
  • FIG. 1 is a flowchart of a medical data sharing method according to Embodiment 1 of the present invention
  • Fig. 2 is the method flow chart of step S10 in Fig. 1;
  • FIG. 3 is an exemplary structural block diagram of a medical data sharing apparatus according to Embodiment 2 of the present invention.
  • FIG. 4 is an exemplary structural block diagram of a connection module according to Embodiment 2 of the present invention.
  • a medical data sharing method includes:
  • the decentralization of medical data is realized through blockchain technology and federated machine learning training, which solves the disadvantage that traditional federated learning relies on a single server.
  • the smart ophthalmology OCT module in the shadow-seeking assistant is configured with a smart contract API interface, and nodes on the chain can join federated machine learning training through this interface, and the blockchain can continuously store each node in such an iterative process.
  • the update record of achieves decentralization, and solves the shortcomings of traditional federated learning relying on a single server.
  • the system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.
  • the step S10 includes:
  • the Zhilian platform introduces federated machine learning to realize multi-party collaborative training, so that enterprises and institutions can collaborate on the shared model without sharing patient data to achieve decentralized Neural network training.
  • the Zhilian platform After acquiring the training task, the Zhilian platform forms an adversarial network by combining distributed asynchronous discriminators located in multiple data islands and a temporary central generator, so that the temporary central generator can also conduct training without touching the original private data. Synthetic training, so that synthetic data samples that are similar to the original data in each data island can be generated for downstream tasks. On this basis, two loss functions are also used, so that the temporary center generator has a certain lifelong learning ability, and can continue to learn in the dynamically changing data island environment. There are cases where institutions drop out and learn incrementally from different data silos to approximate the distribution of homogeneous data or even different data.
  • the Zhilian platform adheres to the core advantages of federated learning and solves the problem of privacy protection of medical data well. Not only that, compared with the traditional federated learning, the Zhilian platform also effectively reduces the amount of communication data between the temporary center generator and the data island due to the "different approach" in the implementation method, only the transmission of synthetic image data and feedback errors are not required. Not all parameter data of the entire model, and there is no need to exchange any data or parameters between data islands, which can significantly reduce the cost of research through federated learning between medical institutions, and speed up research efficiency and AI model production.
  • the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
  • each node adopts an algorithm suitable for nodes to update model parameters in a distributed manner, and completes the distributed update and convergence of machine learning model parameters with the support of a decentralized parameter server.
  • a node will be randomly selected by the smart contract as a temporary server, which is responsible for processing the gradient information of all nodes.
  • the smart contract will send the public key of the ephemeral server to other nodes participating in the training.
  • Each node will train the model locally, obtain gradient information, store the gradient information locally, and then encrypt the storage path with the public key, and write the encrypted storage path into the smart contract. Subsequently, the ephemeral server reads all the collected encrypted storage paths from the smart contract.
  • the temporary server After the temporary server reads the encrypted paths, it decrypts them, and then downloads the gradient information of this round from the corresponding nodes. After the temporary server obtains all the gradient information, it will integrate the gradient information. The simplest way to integrate is to average all the gradients. After integration, the temporary server stores the integrated parameters locally, encrypts the storage paths with the public keys of all nodes, and writes them into the smart contract. Each node obtains the parameter path encrypted by its own public key by reading the smart contract. After decrypting the parameter path, connect to the temporary server to download the parameter information updated in this round. Because the temporary server for each round of training is randomly selected by the smart contract, the temporary server for each round is almost different, so no node can obtain all the intermediate gradient information. However, each node only grasps the intermediate gradients of very few rounds, thus eliminating the risk of privacy leakage caused by intermediate gradients.
  • an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI auxiliary annotation and The function of multi-node collaborative training.
  • using the system can help the ophthalmologist to complete the preliminary fundus disease screening, and greatly save the patient's examination time.
  • Intelligent ophthalmology OCT combines OCT fundus examination and AI lesion screening.
  • the AI-assisted labeling system will also continue to learn during this process, and perform image processing and labeling, so that each new labelled image can become training data. Further Improve the accuracy of the model.
  • the smart ophthalmic OCT module is mounted on the Zhilian platform and Clara.
  • the Clara platform is a software toolkit launched by Nvidia that provides medical application developers with a series of libraries for GPUs, such as computing, advanced visualization, and AI technology-related libraries.
  • the smart ophthalmology oct system is also equipped with a smart contract API interface. Nodes on the chain can join federated machine learning training through this interface.
  • the blockchain can continuously store the update records of each node in such an iterative process, realizing decentralization. It solves the shortcomings of traditional federated learning relying on a single server.
  • the system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.
  • a medical data sharing device includes:
  • connection module 10 is used to connect multiple data islands by using the alliance chain
  • the publishing module 20 is used to publish federated machine learning tasks in the form of smart contracts on the alliance chain;
  • the joining module 30 is used to add the shadow-seeking assistant to the federated machine learning through the smart contract API interface;
  • the updating module 40 is used for summarizing and updating the medical data of each node in the shadow-seeking assistant.
  • the decentralization of medical data is realized through blockchain technology and federated machine learning training, which solves the disadvantage that traditional federated learning relies on a single server.
  • the smart ophthalmology OCT module in the shadow-seeking assistant is configured with a smart contract API interface, and nodes on the chain can join federated machine learning training through this interface, and the blockchain can continuously store each node in such an iterative process.
  • the update record of achieves decentralization, and solves the shortcomings of traditional federated learning relying on a single server.
  • the system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.
  • connection module includes:
  • a setting unit 11 is used to set a plurality of distributed asynchronous discriminators in one-to-one correspondence in the plurality of data islands;
  • an adversarial unit 12 configured to form an adversarial network with the multiple distributed asynchronous discriminators and a temporary center generator
  • the training unit 13 is configured to perform synthetic training through the temporary center generator to generate synthetic data samples that are similar to the data in the multiple data islands.
  • the Zhilian platform introduces federated machine learning to realize multi-party collaborative training, so that enterprises and institutions can collaborate on the shared model without sharing patient data to achieve decentralized Neural network training.
  • the Zhilian platform After acquiring the training task, the Zhilian platform forms an adversarial network by combining distributed asynchronous discriminators located in multiple data islands and a temporary central generator, so that the temporary central generator can also conduct training without touching the original private data. Synthetic training, so that synthetic data samples that are similar to the original data in each data island can be generated for downstream tasks. On this basis, two loss functions are also used, so that the temporary center generator has a certain lifelong learning ability, and can continue to learn in the dynamically changing data island environment. There are cases where institutions drop out and learn incrementally from different data silos to approximate the distribution of homogeneous data or even different data.
  • the Zhilian platform adheres to the core advantages of federated learning and solves the problem of privacy protection of medical data well. Not only that, compared with the traditional federated learning, the Zhilian platform also effectively reduces the amount of communication data between the temporary center generator and the data island due to the "different approach" in the implementation method, only the transmission of synthetic image data and feedback errors are not required. Not all parameter data of the entire model, and there is no need to exchange any data or parameters between data islands, which can significantly reduce the cost of research through federated learning between medical institutions, and speed up research efficiency and AI model production.
  • the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
  • an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI auxiliary annotation and The function of multi-node collaborative training.
  • each node adopts an algorithm suitable for nodes to update model parameters in a distributed manner, and completes the distributed update and convergence of machine learning model parameters with the support of a decentralized parameter server.
  • a node will be randomly selected by the smart contract as a temporary server, which is responsible for processing the gradient information of all nodes.
  • the smart contract will send the public key of the ephemeral server to other nodes participating in the training.
  • Each node will train the model locally, obtain gradient information, store the gradient information locally, and then encrypt the storage path with the public key, and write the encrypted storage path into the smart contract. Subsequently, the ephemeral server reads all the collected encrypted storage paths from the smart contract.
  • the temporary server After the temporary server reads the encrypted paths, it decrypts them, and then downloads the gradient information of this round from the corresponding nodes. After the temporary server obtains all the gradient information, it will integrate the gradient information. The simplest way to integrate is to average all the gradients. After integration, the temporary server stores the integrated parameters locally, encrypts the storage paths with the public keys of all nodes, and writes them into the smart contract. Each node obtains the parameter path encrypted by its own public key by reading the smart contract. After decrypting the parameter path, connect to the temporary server to download the parameter information updated in this round. Because the temporary server for each round of training is randomly selected by the smart contract, the temporary server for each round is almost different, so no node can obtain all the intermediate gradient information. However, each node only grasps the intermediate gradients of very few rounds, thus eliminating the risk of privacy leakage caused by intermediate gradients.
  • an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI auxiliary annotation and The function of multi-node collaborative training.
  • using the system can help the ophthalmologist to complete the preliminary fundus disease screening, and greatly save the patient's examination time.
  • Intelligent ophthalmology OCT combines OCT fundus examination and AI lesion screening.
  • the AI-assisted labeling system will also continue to learn during this process, and perform image processing and labeling, so that each new labelled image can become training data. Further Improve the accuracy of the model.
  • the smart ophthalmic OCT module is mounted on the Zhilian platform and Clara.
  • the Clara platform is a software toolkit launched by Nvidia that provides medical application developers with a series of libraries for GPUs, such as computing, advanced visualization, and AI technology-related libraries.
  • the smart ophthalmology oct system is also equipped with a smart contract API interface. Nodes on the chain can join federated machine learning training through this interface.
  • the blockchain can continuously store the update records of each node in such an iterative process, realizing decentralization. It solves the shortcomings of traditional federated learning relying on a single server.
  • the system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Epidemiology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Business, Economics & Management (AREA)
  • Mathematical Physics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Radiology & Medical Imaging (AREA)
  • Eye Examination Apparatus (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
  • Image Analysis (AREA)

Abstract

The present invention relates to the technical field of data processing. Disclosed are a method and apparatus for sharing medical data. The method comprises: using a consortium chain to connect a plurality of data islands; publishing a federated machine learning task in the form of a smart contract on the consortium chain; adding an image-finding assistant to federated machine learning by means of a smart contract API interface; and summarizing and updating medical data of each node in the image-finding assistant. Decentralization is achieved by means of blockchain technology and federated machine learning training, which ameliorates the shortcoming of traditional federated learning relying on a single server.

Description

一种医疗数据共享方法及装置Method and device for sharing medical data 技术领域technical field
本发明涉及数据处理技术领域,尤其涉及一种医疗数据共享方法及装置。The present invention relates to the technical field of data processing, and in particular, to a medical data sharing method and device.
背景技术Background technique
联邦机器学习(Federated machine learning/Federated Learning):又名联邦学习,联合学习,联盟学习。联邦机器学习是一个机器学习框架,能有效帮助多个机构在满足用户隐私保护、数据安全和政府法规的要求下,进行数据使用和机器学习建模。联邦学习作为分布式的机器学习范式,可以有效解决数据孤岛问题,让参与方在不共享数据的基础上联合建模,能从技术上打破数据孤岛,实现AI协作。Federated machine learning (Federated machine learning/Federated Learning): also known as federated learning, federated learning, and federated learning. Federated Machine Learning is a machine learning framework that can effectively help multiple agencies conduct data usage and machine learning modeling while meeting user privacy protection, data security, and government regulations. As a distributed machine learning paradigm, federated learning can effectively solve the problem of data silos, allowing participants to jointly model without sharing data, which can technically break the data silos and realize AI collaboration.
联邦学习定义了机器学习框架,在此框架下通过设计虚拟模型解决不同数据拥有方在不交换数据的情况下进行协作的问题。虚拟模型是各方将数据聚合在一起的最优模型,各自区域依据模型为本地目标服务。联邦学习要求此建模结果应当无限接近传统模式,即将多个数据拥有方的数据汇聚到一处进行建模的结果。在联邦机制下,各参与者的身份和地位相同,可建立共享数据策略。由于数据不发生转移,因此不会泄露用户隐私或影响数据规范。为了保护数据隐私、满足合法合规的要求。Federated learning defines a machine learning framework under which the problem of different data owners collaborating without exchanging data is solved by designing virtual models. The virtual model is the optimal model for all parties to aggregate data together, and each region serves local goals according to the model. Federated learning requires that this modeling result should be infinitely close to the traditional model, that is, the result of aggregating the data of multiple data owners into one place for modeling. Under the federal mechanism, the identity and status of each participant are the same, and a shared data policy can be established. Since the data is not transferred, it does not reveal user privacy or affect data compliance. To protect data privacy and meet legal compliance requirements.
虽然传统的联邦学习能够很好的解决数据孤岛的问题,但始终无法解决的,是中心化的问题。在传统的联邦学习中,参数的最终汇总更新需要在中心化的服务器上进行,而这极为耗费算力及带宽。Although traditional federated learning can solve the problem of data silos well, the problem that cannot be solved is the problem of centralization. In traditional federated learning, the final summary update of parameters needs to be performed on a centralized server, which consumes computing power and bandwidth.
技术问题technical problem
本发明的主要目的在于提出一种医疗数据共享方法及装置,通过区块链技术和联邦机器学习训练,实现了去中心化,解决了传统联邦学习依赖单一服务器的缺点。The main purpose of the present invention is to propose a medical data sharing method and device, which realizes decentralization through blockchain technology and federated machine learning training, and solves the shortcomings of traditional federated learning relying on a single server.
技术解决方案technical solutions
本发明的主要目的在于提出一种医疗数据共享方法及装置,通过区块链技术和联邦机器学习训练,实现了去中心化,解决了传统联邦学习依赖单一服务器的缺点。The main purpose of the present invention is to propose a medical data sharing method and device, which realizes decentralization through blockchain technology and federated machine learning training, and solves the shortcomings of traditional federated learning relying on a single server.
为实现上述目的,本发明提供的一种医疗数据共享方法,包括:To achieve the above purpose, a medical data sharing method provided by the present invention includes:
利用联盟链将多个数据孤岛连接起来;Use consortium chains to connect multiple data silos;
在联盟链上以智能合约的形式发布联邦机器学习任务;Publish federated machine learning tasks in the form of smart contracts on the consortium chain;
通过智能合约API接口将寻影助手加入联邦机器学习;Add the shadow-seeking assistant to federated machine learning through the smart contract API interface;
对寻影助手中各节点的医疗数据进行汇总更新。Summarize and update the medical data of each node in the shadow-seeking assistant.
可选地,所述利用联盟链将多个数据孤岛连接起来包括:Optionally, the use of consortium chains to connect multiple data islands includes:
在所述多个数据孤岛中设置一一对应的多个分布式异步鉴别器;setting a one-to-one corresponding multiple distributed asynchronous discriminators in the multiple data islands;
将所述多个分布式异步鉴别器和一个临时中心生成器组成对抗网络;forming an adversarial network of the plurality of distributed asynchronous discriminators and a temporary center generator;
通过所述临时中心生成器进行合成训练,生成与所述多个数据孤岛中的数据相近似的合成数据样本。Synthetic training is performed by the temporary center generator to generate synthetic data samples that are similar to the data in the plurality of data islands.
可选地,所述对寻影助手中各节点的医疗数据进行汇总更新具体为:Optionally, the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
通过分布式更新模型参数算法对寻影助手中各节点的医疗数据进行汇总更新。Through the distributed update model parameter algorithm, the medical data of each node in the shadow-seeking assistant is summarized and updated.
可选地,所述寻影助手中设置有智能眼科OCT模块,所述智能眼科OCT模块具有OCT图像质量控制、AI病变筛查、AI病灶自动检测、OCT图像质量增强、AI辅助标注和多节点协同训练的功能。Optionally, an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node. The function of collaborative training.
可选地,所述智能眼科OCT模块搭载在智联平台上。Optionally, the smart ophthalmology OCT module is mounted on a Zhilian platform.
作为本发明的另一方面,提供的一种医疗数据共享装置,包括:As another aspect of the present invention, a medical data sharing device is provided, comprising:
连接模块,用于利用联盟链将多个数据孤岛连接起来;The connection module is used to connect multiple data islands using the alliance chain;
发布模块,用于在联盟链上以智能合约的形式发布联邦机器学习任务;The publishing module is used to publish federated machine learning tasks in the form of smart contracts on the alliance chain;
加入模块,用于通过智能合约API接口将寻影助手加入联邦机器学习;Add a module for adding the shadow-seeking assistant to federated machine learning through the smart contract API interface;
更新模块,用于对寻影助手中各节点的医疗数据进行汇总更新。The update module is used to summarize and update the medical data of each node in the shadow-seeking assistant.
可选地,所述连接模块包括:Optionally, the connection module includes:
设置单元,用于在所述多个数据孤岛中设置一一对应的多个分布式异步鉴别器;a setting unit for setting a one-to-one corresponding multiple distributed asynchronous discriminators in the multiple data islands;
对抗单元,用于将所述多个分布式异步鉴别器和一个临时中心生成器组成对抗网络;an adversarial unit, configured to form an adversarial network of the plurality of distributed asynchronous discriminators and a temporary center generator;
训练单元,用于通过所述临时中心生成器进行合成训练,生成与所述多个数据孤岛中的数据相近似的合成数据样本。A training unit, configured to perform synthetic training through the temporary center generator to generate synthetic data samples that are similar to the data in the plurality of data islands.
可选地,所述对寻影助手中各节点的医疗数据进行汇总更新具体为:Optionally, the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
通过分布式更新模型参数算法对寻影助手中各节点的医疗数据进行汇总更新。Through the distributed update model parameter algorithm, the medical data of each node in the shadow-seeking assistant is summarized and updated.
可选地,所述寻影助手中设置有智能眼科OCT模块,所述智能眼科OCT模块具有OCT图像质量控制、AI病变筛查、AI病灶自动检测、OCT图像质量增强、AI辅助标注和多节点协同训练的功能。Optionally, an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI-assisted labeling and multi-node. The function of collaborative training.
可选地,所述智能眼科OCT模块搭载在智联平台上。Optionally, the smart ophthalmology OCT module is mounted on a Zhilian platform.
有益效果beneficial effect
本发明提出的一种医疗数据共享方法及装置,该方法包括:利用联盟链将多个数据孤岛连接起来;在联盟链上以智能合约的形式发布联邦机器学习任务;通过智能合约API接口将寻影助手加入联邦机器学习;对寻影助手中各节点的医疗数据进行汇总更新;通过区块链技术和联邦机器学习训练,实现了去中心化,解决了传统联邦学习依赖单一服务器的缺点。A medical data sharing method and device provided by the present invention include: using a consortium chain to connect multiple data islands; publishing a federated machine learning task in the form of a smart contract on the consortium chain; The shadow assistant joins federated machine learning; it aggregates and updates the medical data of each node in the shadow assistant; through blockchain technology and federated machine learning training, it achieves decentralization and solves the shortcomings of traditional federated learning relying on a single server.
附图说明Description of drawings
图1为本发明实施例一提供的一种医疗数据共享方法的流程图;FIG. 1 is a flowchart of a medical data sharing method according to Embodiment 1 of the present invention;
图2为图1中步骤S10的方法流程图;Fig. 2 is the method flow chart of step S10 in Fig. 1;
图3为本发明实施例二提供的一种医疗数据共享装置的示范性结构框图;3 is an exemplary structural block diagram of a medical data sharing apparatus according to Embodiment 2 of the present invention;
图4为本发明实施例二提供的一种连接模块的示范性结构框图。FIG. 4 is an exemplary structural block diagram of a connection module according to Embodiment 2 of the present invention.
本发明的实施方式Embodiments of the present invention
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics and advantages of the present invention will be further described with reference to the accompanying drawings in conjunction with the embodiments.
应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
实施例一Example 1
如图1所示,在本实施例中,一种医疗数据共享方法,包括:As shown in FIG. 1, in this embodiment, a medical data sharing method includes:
S10、利用联盟链将多个数据孤岛连接起来;S10. Use the alliance chain to connect multiple data islands;
S20、在联盟链上以智能合约的形式发布联邦机器学习任务;S20. Publish federated machine learning tasks in the form of smart contracts on the alliance chain;
S30、通过智能合约API接口将寻影助手加入联邦机器学习;S30. Add the shadow-seeking assistant to the federated machine learning through the smart contract API interface;
S40、对寻影助手中各节点的医疗数据进行汇总更新。S40, summarize and update the medical data of each node in the shadow-seeking assistant.
在本实施例中,通过区块链技术和联邦机器学习训练,实现了医疗数据的去中心化,解决了传统联邦学习依赖单一服务器的缺点。In this embodiment, the decentralization of medical data is realized through blockchain technology and federated machine learning training, which solves the disadvantage that traditional federated learning relies on a single server.
在本实施例中,寻影助手中的智能眼科OCT模块配置了智能合约API接口,链上的节点可通过此接口加入联邦机器学习训练,区块链可在这样的迭代过程中不断储存各节点的更新记录,实现了去中心化,解决了传统联邦学习依赖单一服务器的缺点。系统每隔一段时间选取临时节点进行参数的汇总更新,并将结果反馈回节点以及辅助系统。In this embodiment, the smart ophthalmology OCT module in the shadow-seeking assistant is configured with a smart contract API interface, and nodes on the chain can join federated machine learning training through this interface, and the blockchain can continuously store each node in such an iterative process. The update record of , achieves decentralization, and solves the shortcomings of traditional federated learning relying on a single server. The system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.
如图2所示,在本实施例中,所述步骤S10包括:As shown in FIG. 2, in this embodiment, the step S10 includes:
S11、在所述多个数据孤岛中设置一一对应的多个分布式异步鉴别器;S11, setting a one-to-one corresponding multiple distributed asynchronous discriminators in the multiple data islands;
S12、将所述多个分布式异步鉴别器和一个临时中心生成器组成对抗网络;S12, forming an adversarial network with the multiple distributed asynchronous discriminators and a temporary center generator;
S13、通过所述临时中心生成器进行合成训练,生成与所述多个数据孤岛中的数据相近似的合成数据样本。S13. Perform synthetic training through the temporary center generator to generate synthetic data samples that are similar to the data in the multiple data islands.
在本实施例中,在机器学习的模式上,智联平台引入了联邦机器学习,实现了多方协作训练,使得各企业机构在无需共享患者数据的情况下,针对共享模型开展协作,实现分散化的神经网络训练。In this embodiment, in the mode of machine learning, the Zhilian platform introduces federated machine learning to realize multi-party collaborative training, so that enterprises and institutions can collaborate on the shared model without sharing patient data to achieve decentralized Neural network training.
当获取训练任务后,智联平台通过将位于多个数据孤岛的分布式异步鉴别器和一个临时的中心生成器组成对抗网络,让临时中心生成器在不接触原始隐私数据的情况下,也能进行合成训练,从而能够生成与各数据孤岛中原始数据相近似的合成数据样本,供下游任务使用。在此基础上,还采用了2种损失函数,使得临时中心生成器具备一定的终身学习能力,可以在动态变化的数据孤岛环境中持续学习,比如学习过程中有新的机构加入或某些原有机构退出的情况,并从不同的数据孤岛中渐进地学习到同类数据甚至不同类数据的近似分布。由于避免了对原始数据的直接访问,智联平台秉承了联邦学习的核心优势,很好地解决了医疗数据的隐私保护问题。不仅如此,相较于传统的联邦学习,由于在实现方法上“另辟蹊径”,智联平台还有效减少了临时中心生成器与数据孤岛之间的通信数据量,仅需传输合成图像数据和反馈误差而非整个模型的所有参数数据,而且各数据孤岛之间无需交换任何数据或参数,因而可显著降低医疗机构之间通过联邦学习进行研究的成本,加快研究效率和AI模型的生产速度。After acquiring the training task, the Zhilian platform forms an adversarial network by combining distributed asynchronous discriminators located in multiple data islands and a temporary central generator, so that the temporary central generator can also conduct training without touching the original private data. Synthetic training, so that synthetic data samples that are similar to the original data in each data island can be generated for downstream tasks. On this basis, two loss functions are also used, so that the temporary center generator has a certain lifelong learning ability, and can continue to learn in the dynamically changing data island environment. There are cases where institutions drop out and learn incrementally from different data silos to approximate the distribution of homogeneous data or even different data. By avoiding direct access to the original data, the Zhilian platform adheres to the core advantages of federated learning and solves the problem of privacy protection of medical data well. Not only that, compared with the traditional federated learning, the Zhilian platform also effectively reduces the amount of communication data between the temporary center generator and the data island due to the "different approach" in the implementation method, only the transmission of synthetic image data and feedback errors are not required. Not all parameter data of the entire model, and there is no need to exchange any data or parameters between data islands, which can significantly reduce the cost of research through federated learning between medical institutions, and speed up research efficiency and AI model production.
在本实施例中,所述对寻影助手中各节点的医疗数据进行汇总更新具体为:In this embodiment, the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
通过分布式更新模型参数算法对寻影助手中各节点的医疗数据进行汇总更新。Through the distributed update model parameter algorithm, the medical data of each node in the shadow-seeking assistant is summarized and updated.
在本实施例中,各节点采用适用于节点分布式更新模型参数的算法,在无中心化参数服务器的支持下完成机器学习模型参数的分布式更新与收敛。在每个训练回合开始,会由智能合约随机选出一个节点作为临时服务器,来担当处理全部节点梯度信息的责任。智能合约会将临时服务器的公钥发送给参与训练的其他节点。每个节点会在本地训练模型,并获得梯度信息,并将梯度信息存储在本地,然后利用公钥加密存储路径,将加密的存储路径写入智能合约。随后,临时服务器从智能合约读取收集到的全部加密存储路径。临时服务器读取到加密路径后,会对它们进行解密,然后从相应节点处下载本回合的梯度信息。临时服务器获得全部的梯度信息后,会对梯度信息进行整合。最简单的整合方式就是将全部的梯度取平均值。整合之后,临时服务器将整合后的参数存储在本地,并将存储路径分别用全部节点的公钥分别进行加密后,再写入智能合约。各节点通过读取智能合约,获取由它自己的公钥进行加密的参数路径。解密参数路径后连接到临时服务器下载本轮更新的参数信息。因为每轮训练的临时服务器都是由智能合约随机选择得出,每一轮的临时服务器几乎都是不同的,所以没有任何一个节点能够获得全部的中间梯度信息。而每个节点只掌握极个别轮次的中间梯度,因此排除了由于中间梯度所导致的隐私泄露的风险。In this embodiment, each node adopts an algorithm suitable for nodes to update model parameters in a distributed manner, and completes the distributed update and convergence of machine learning model parameters with the support of a decentralized parameter server. At the beginning of each training round, a node will be randomly selected by the smart contract as a temporary server, which is responsible for processing the gradient information of all nodes. The smart contract will send the public key of the ephemeral server to other nodes participating in the training. Each node will train the model locally, obtain gradient information, store the gradient information locally, and then encrypt the storage path with the public key, and write the encrypted storage path into the smart contract. Subsequently, the ephemeral server reads all the collected encrypted storage paths from the smart contract. After the temporary server reads the encrypted paths, it decrypts them, and then downloads the gradient information of this round from the corresponding nodes. After the temporary server obtains all the gradient information, it will integrate the gradient information. The simplest way to integrate is to average all the gradients. After integration, the temporary server stores the integrated parameters locally, encrypts the storage paths with the public keys of all nodes, and writes them into the smart contract. Each node obtains the parameter path encrypted by its own public key by reading the smart contract. After decrypting the parameter path, connect to the temporary server to download the parameter information updated in this round. Because the temporary server for each round of training is randomly selected by the smart contract, the temporary server for each round is almost different, so no node can obtain all the intermediate gradient information. However, each node only grasps the intermediate gradients of very few rounds, thus eliminating the risk of privacy leakage caused by intermediate gradients.
在本实施例中,所述寻影助手中设置有智能眼科OCT模块,所述智能眼科OCT模块具有OCT图像质量控制、AI病变筛查、AI病灶自动检测、OCT图像质量增强、AI辅助标注和多节点协同训练的功能。In this embodiment, an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI auxiliary annotation and The function of multi-node collaborative training.
在本实施例中,使用该系统可以帮助眼科医生完成初步的眼底疾病筛查,大幅度节省病患的检查时间。智能眼科OCT结合了OCT眼底检查和AI病灶筛查,AI辅助标注系统也会在此过程中不断学习,并进行图像处理与标注,使得每一张新的标注影像都能够成为训练的数据,进一步改善模型的准确度。In this embodiment, using the system can help the ophthalmologist to complete the preliminary fundus disease screening, and greatly save the patient's examination time. Intelligent ophthalmology OCT combines OCT fundus examination and AI lesion screening. The AI-assisted labeling system will also continue to learn during this process, and perform image processing and labeling, so that each new labelled image can become training data. Further Improve the accuracy of the model.
在本实施例中,所述智能眼科OCT模块搭载在智联平台以及Clara上。Clara平台是Nvidia推出的一个软件工具包,能够提供医学应用开发人员一系列用于GPU的函式库,像是计算、进阶可视化、AI技术相关的函式库。智能眼科oct系统同时配置了智能合约API接口,链上的节点可通过此接口加入联邦机器学习训练,区块链可在这样的迭代过程中不断储存各节点的更新记录,实现了去中心化,解决了传统联邦学习依赖单一服务器的缺点。系统每隔一段时间选取临时节点进行参数的汇总更新,并将结果反馈回节点以及辅助系统。In this embodiment, the smart ophthalmic OCT module is mounted on the Zhilian platform and Clara. The Clara platform is a software toolkit launched by Nvidia that provides medical application developers with a series of libraries for GPUs, such as computing, advanced visualization, and AI technology-related libraries. The smart ophthalmology oct system is also equipped with a smart contract API interface. Nodes on the chain can join federated machine learning training through this interface. The blockchain can continuously store the update records of each node in such an iterative process, realizing decentralization. It solves the shortcomings of traditional federated learning relying on a single server. The system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.
实施例二Embodiment 2
如图3所示,一种医疗数据共享装置,包括:As shown in Figure 3, a medical data sharing device includes:
连接模块10,用于利用联盟链将多个数据孤岛连接起来;The connection module 10 is used to connect multiple data islands by using the alliance chain;
发布模块20,用于在联盟链上以智能合约的形式发布联邦机器学习任务;The publishing module 20 is used to publish federated machine learning tasks in the form of smart contracts on the alliance chain;
加入模块30,用于通过智能合约API接口将寻影助手加入联邦机器学习;The joining module 30 is used to add the shadow-seeking assistant to the federated machine learning through the smart contract API interface;
更新模块40,用于对寻影助手中各节点的医疗数据进行汇总更新。The updating module 40 is used for summarizing and updating the medical data of each node in the shadow-seeking assistant.
在本实施例中,通过区块链技术和联邦机器学习训练,实现了医疗数据的去中心化,解决了传统联邦学习依赖单一服务器的缺点。In this embodiment, the decentralization of medical data is realized through blockchain technology and federated machine learning training, which solves the disadvantage that traditional federated learning relies on a single server.
在本实施例中,寻影助手中的智能眼科OCT模块配置了智能合约API接口,链上的节点可通过此接口加入联邦机器学习训练,区块链可在这样的迭代过程中不断储存各节点的更新记录,实现了去中心化,解决了传统联邦学习依赖单一服务器的缺点。系统每隔一段时间选取临时节点进行参数的汇总更新,并将结果反馈回节点以及辅助系统。In this embodiment, the smart ophthalmology OCT module in the shadow-seeking assistant is configured with a smart contract API interface, and nodes on the chain can join federated machine learning training through this interface, and the blockchain can continuously store each node in such an iterative process. The update record of , achieves decentralization, and solves the shortcomings of traditional federated learning relying on a single server. The system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.
如图4所示,在本实施例中,所述连接模块包括:As shown in Figure 4, in this embodiment, the connection module includes:
设置单元11,用于在所述多个数据孤岛中设置一一对应的多个分布式异步鉴别器;A setting unit 11 is used to set a plurality of distributed asynchronous discriminators in one-to-one correspondence in the plurality of data islands;
对抗单元12,用于将所述多个分布式异步鉴别器和一个临时中心生成器组成对抗网络;an adversarial unit 12, configured to form an adversarial network with the multiple distributed asynchronous discriminators and a temporary center generator;
训练单元13,用于通过所述临时中心生成器进行合成训练,生成与所述多个数据孤岛中的数据相近似的合成数据样本。The training unit 13 is configured to perform synthetic training through the temporary center generator to generate synthetic data samples that are similar to the data in the multiple data islands.
在本实施例中,在机器学习的模式上,智联平台引入了联邦机器学习,实现了多方协作训练,使得各企业机构在无需共享患者数据的情况下,针对共享模型开展协作,实现分散化的神经网络训练。In this embodiment, in the mode of machine learning, the Zhilian platform introduces federated machine learning to realize multi-party collaborative training, so that enterprises and institutions can collaborate on the shared model without sharing patient data to achieve decentralized Neural network training.
当获取训练任务后,智联平台通过将位于多个数据孤岛的分布式异步鉴别器和一个临时的中心生成器组成对抗网络,让临时中心生成器在不接触原始隐私数据的情况下,也能进行合成训练,从而能够生成与各数据孤岛中原始数据相近似的合成数据样本,供下游任务使用。在此基础上,还采用了2种损失函数,使得临时中心生成器具备一定的终身学习能力,可以在动态变化的数据孤岛环境中持续学习,比如学习过程中有新的机构加入或某些原有机构退出的情况,并从不同的数据孤岛中渐进地学习到同类数据甚至不同类数据的近似分布。由于避免了对原始数据的直接访问,智联平台秉承了联邦学习的核心优势,很好地解决了医疗数据的隐私保护问题。不仅如此,相较于传统的联邦学习,由于在实现方法上“另辟蹊径”,智联平台还有效减少了临时中心生成器与数据孤岛之间的通信数据量,仅需传输合成图像数据和反馈误差而非整个模型的所有参数数据,而且各数据孤岛之间无需交换任何数据或参数,因而可显著降低医疗机构之间通过联邦学习进行研究的成本,加快研究效率和AI模型的生产速度。After acquiring the training task, the Zhilian platform forms an adversarial network by combining distributed asynchronous discriminators located in multiple data islands and a temporary central generator, so that the temporary central generator can also conduct training without touching the original private data. Synthetic training, so that synthetic data samples that are similar to the original data in each data island can be generated for downstream tasks. On this basis, two loss functions are also used, so that the temporary center generator has a certain lifelong learning ability, and can continue to learn in the dynamically changing data island environment. There are cases where institutions drop out and learn incrementally from different data silos to approximate the distribution of homogeneous data or even different data. By avoiding direct access to the original data, the Zhilian platform adheres to the core advantages of federated learning and solves the problem of privacy protection of medical data well. Not only that, compared with the traditional federated learning, the Zhilian platform also effectively reduces the amount of communication data between the temporary center generator and the data island due to the "different approach" in the implementation method, only the transmission of synthetic image data and feedback errors are not required. Not all parameter data of the entire model, and there is no need to exchange any data or parameters between data islands, which can significantly reduce the cost of research through federated learning between medical institutions, and speed up research efficiency and AI model production.
在本实施例中,所述对寻影助手中各节点的医疗数据进行汇总更新具体为:In this embodiment, the summary and update of the medical data of each node in the shadow-seeking assistant is specifically:
通过分布式更新模型参数算法对寻影助手中各节点的医疗数据进行汇总更新。Through the distributed update model parameter algorithm, the medical data of each node in the shadow-seeking assistant is summarized and updated.
在本实施例中,所述寻影助手中设置有智能眼科OCT模块,所述智能眼科OCT模块具有OCT图像质量控制、AI病变筛查、AI病灶自动检测、OCT图像质量增强、AI辅助标注和多节点协同训练的功能。In this embodiment, an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI auxiliary annotation and The function of multi-node collaborative training.
在本实施例中,各节点采用适用于节点分布式更新模型参数的算法,在无中心化参数服务器的支持下完成机器学习模型参数的分布式更新与收敛。在每个训练回合开始,会由智能合约随机选出一个节点作为临时服务器,来担当处理全部节点梯度信息的责任。智能合约会将临时服务器的公钥发送给参与训练的其他节点。每个节点会在本地训练模型,并获得梯度信息,并将梯度信息存储在本地,然后利用公钥加密存储路径,将加密的存储路径写入智能合约。随后,临时服务器从智能合约读取收集到的全部加密存储路径。临时服务器读取到加密路径后,会对它们进行解密,然后从相应节点处下载本回合的梯度信息。临时服务器获得全部的梯度信息后,会对梯度信息进行整合。最简单的整合方式就是将全部的梯度取平均值。整合之后,临时服务器将整合后的参数存储在本地,并将存储路径分别用全部节点的公钥分别进行加密后,再写入智能合约。各节点通过读取智能合约,获取由它自己的公钥进行加密的参数路径。解密参数路径后连接到临时服务器下载本轮更新的参数信息。因为每轮训练的临时服务器都是由智能合约随机选择得出,每一轮的临时服务器几乎都是不同的,所以没有任何一个节点能够获得全部的中间梯度信息。而每个节点只掌握极个别轮次的中间梯度,因此排除了由于中间梯度所导致的隐私泄露的风险。In this embodiment, each node adopts an algorithm suitable for nodes to update model parameters in a distributed manner, and completes the distributed update and convergence of machine learning model parameters with the support of a decentralized parameter server. At the beginning of each training round, a node will be randomly selected by the smart contract as a temporary server, which is responsible for processing the gradient information of all nodes. The smart contract will send the public key of the ephemeral server to other nodes participating in the training. Each node will train the model locally, obtain gradient information, store the gradient information locally, and then encrypt the storage path with the public key, and write the encrypted storage path into the smart contract. Subsequently, the ephemeral server reads all the collected encrypted storage paths from the smart contract. After the temporary server reads the encrypted paths, it decrypts them, and then downloads the gradient information of this round from the corresponding nodes. After the temporary server obtains all the gradient information, it will integrate the gradient information. The simplest way to integrate is to average all the gradients. After integration, the temporary server stores the integrated parameters locally, encrypts the storage paths with the public keys of all nodes, and writes them into the smart contract. Each node obtains the parameter path encrypted by its own public key by reading the smart contract. After decrypting the parameter path, connect to the temporary server to download the parameter information updated in this round. Because the temporary server for each round of training is randomly selected by the smart contract, the temporary server for each round is almost different, so no node can obtain all the intermediate gradient information. However, each node only grasps the intermediate gradients of very few rounds, thus eliminating the risk of privacy leakage caused by intermediate gradients.
在本实施例中,所述寻影助手中设置有智能眼科OCT模块,所述智能眼科OCT模块具有OCT图像质量控制、AI病变筛查、AI病灶自动检测、OCT图像质量增强、AI辅助标注和多节点协同训练的功能。In this embodiment, an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion automatic detection, OCT image quality enhancement, AI auxiliary annotation and The function of multi-node collaborative training.
在本实施例中,使用该系统可以帮助眼科医生完成初步的眼底疾病筛查,大幅度节省病患的检查时间。智能眼科OCT结合了OCT眼底检查和AI病灶筛查,AI辅助标注系统也会在此过程中不断学习,并进行图像处理与标注,使得每一张新的标注影像都能够成为训练的数据,进一步改善模型的准确度。In this embodiment, using the system can help the ophthalmologist to complete the preliminary fundus disease screening, and greatly save the patient's examination time. Intelligent ophthalmology OCT combines OCT fundus examination and AI lesion screening. The AI-assisted labeling system will also continue to learn during this process, and perform image processing and labeling, so that each new labelled image can become training data. Further Improve the accuracy of the model.
在本实施例中,所述智能眼科OCT模块搭载在智联平台以及Clara上。Clara平台是Nvidia推出的一个软件工具包,能够提供医学应用开发人员一系列用于GPU的函式库,像是计算、进阶可视化、AI技术相关的函式库。智能眼科oct系统同时配置了智能合约API接口,链上的节点可通过此接口加入联邦机器学习训练,区块链可在这样的迭代过程中不断储存各节点的更新记录,实现了去中心化,解决了传统联邦学习依赖单一服务器的缺点。系统每隔一段时间选取临时节点进行参数的汇总更新,并将结果反馈回节点以及辅助系统。In this embodiment, the smart ophthalmic OCT module is mounted on the Zhilian platform and Clara. The Clara platform is a software toolkit launched by Nvidia that provides medical application developers with a series of libraries for GPUs, such as computing, advanced visualization, and AI technology-related libraries. The smart ophthalmology oct system is also equipped with a smart contract API interface. Nodes on the chain can join federated machine learning training through this interface. The blockchain can continuously store the update records of each node in such an iterative process, realizing decentralization. It solves the shortcomings of traditional federated learning relying on a single server. The system selects temporary nodes at regular intervals to summarize and update the parameters, and feed the results back to the nodes and auxiliary systems.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The above-mentioned serial numbers of the embodiments of the present invention are only for description, and do not represent the advantages or disadvantages of the embodiments.
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above are only preferred embodiments of the present invention, and are not intended to limit the scope of the present invention. Any equivalent structure or equivalent process transformation made by using the contents of the description and drawings of the present invention, or directly or indirectly applied in other related technical fields , are similarly included in the scope of patent protection of the present invention.

Claims (10)

  1. 一种医疗数据共享方法,其特征在于,包括:A method for sharing medical data, comprising:
    利用联盟链将多个数据孤岛连接起来;Use consortium chains to connect multiple data silos;
    在联盟链上以智能合约的形式发布联邦机器学习任务;Publish federated machine learning tasks in the form of smart contracts on the consortium chain;
    通过智能合约API接口将寻影助手加入联邦机器学习;Add the shadow-seeking assistant to federated machine learning through the smart contract API interface;
    对寻影助手中各节点的医疗数据进行汇总更新。Summarize and update the medical data of each node in the shadow-seeking assistant.
  2. 根据权利要求1所述的一种医疗数据共享方法,其特征在于,所述利用联盟链将多个数据孤岛连接起来包括:The method for sharing medical data according to claim 1, wherein the connecting a plurality of data islands by using a consortium chain comprises:
    在所述多个数据孤岛中设置一一对应的多个分布式异步鉴别器;setting a one-to-one corresponding multiple distributed asynchronous discriminators in the multiple data islands;
    将所述多个分布式异步鉴别器和一个临时中心生成器组成对抗网络;forming an adversarial network of the plurality of distributed asynchronous discriminators and a temporary center generator;
    通过所述临时中心生成器进行合成训练,生成与所述多个数据孤岛中的数据相近似的合成数据样本。Synthetic training is performed by the temporary center generator to generate synthetic data samples that are similar to the data in the plurality of data islands.
  3. 根据权利要求1所述的一种医疗数据共享方法,其特征在于,所述对寻影助手中各节点的医疗数据进行汇总更新具体为:The method for sharing medical data according to claim 1, wherein the summarizing and updating the medical data of each node in the shadow-seeking assistant is specifically:
    通过分布式更新模型参数算法对寻影助手中各节点的医疗数据进行汇总更新。Through the distributed update model parameter algorithm, the medical data of each node in the shadow-seeking assistant is summarized and updated.
  4. 根据权利要求1所述的一种医疗数据共享方法,其特征在于,所述寻影助手中设置有智能眼科OCT模块,所述智能眼科OCT模块具有OCT图像质量控制、AI病变筛查、AI病灶自动检测、OCT图像质量增强、AI辅助标注和多节点协同训练的功能。The medical data sharing method according to claim 1, wherein an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion Functions of automatic detection, OCT image quality enhancement, AI-assisted annotation and multi-node collaborative training.
  5. 根据权利要求4所述的一种医疗数据共享方法,其特征在于,所述智能眼科OCT模块搭载在智联平台上。The medical data sharing method according to claim 4, wherein the intelligent ophthalmology OCT module is mounted on a Zhilian platform.
  6. 一种医疗数据共享装置,其特征在于,包括:A medical data sharing device, comprising:
    连接模块,用于利用联盟链将多个数据孤岛连接起来;The connection module is used to connect multiple data islands with the alliance chain;
    发布模块,用于在联盟链上以智能合约的形式发布联邦机器学习任务;The publishing module is used to publish federated machine learning tasks in the form of smart contracts on the alliance chain;
    加入模块,用于通过智能合约API接口将寻影助手加入联邦机器学习;Add a module for adding the shadow-seeking assistant to federated machine learning through the smart contract API interface;
    更新模块,用于对寻影助手中各节点的医疗数据进行汇总更新。The update module is used to summarize and update the medical data of each node in the shadow-seeking assistant.
  7. 根据权利要求6所述的一种医疗数据共享装置,其特征在于,所述连接模块包括:A medical data sharing device according to claim 6, wherein the connection module comprises:
    设置单元,用于在所述多个数据孤岛中设置一一对应的多个分布式异步鉴别器;a setting unit for setting a one-to-one corresponding multiple distributed asynchronous discriminators in the multiple data islands;
    对抗单元,用于将所述多个分布式异步鉴别器和一个临时中心生成器组成对抗网络;an adversarial unit, configured to form an adversarial network of the plurality of distributed asynchronous discriminators and a temporary center generator;
    训练单元,用于通过所述临时中心生成器进行合成训练,生成与所述多个数据孤岛中的数据相近似的合成数据样本。A training unit, configured to perform synthetic training through the temporary center generator to generate synthetic data samples that are similar to the data in the plurality of data islands.
  8. 根据权利要求6所述的一种医疗数据共享装置,其特征在于,所述对寻影助手中各节点的医疗数据进行汇总更新具体为:The medical data sharing device according to claim 6, wherein the summarizing and updating the medical data of each node in the shadow-seeking assistant is specifically:
    通过分布式更新模型参数算法对寻影助手中各节点的医疗数据进行汇总更新。Through the distributed update model parameter algorithm, the medical data of each node in the shadow-seeking assistant is summarized and updated.
  9. 根据权利要求6所述的一种医疗数据共享方法,其特征在于,所述寻影助手中设置有智能眼科OCT模块,所述智能眼科OCT模块具有OCT图像质量控制、AI病变筛查、AI病灶自动检测、OCT图像质量增强、AI辅助标注和多节点协同训练的功能。The medical data sharing method according to claim 6, wherein an intelligent ophthalmology OCT module is set in the shadow-seeking assistant, and the intelligent ophthalmology OCT module has the functions of OCT image quality control, AI lesion screening, AI lesion Functions of automatic detection, OCT image quality enhancement, AI-assisted annotation and multi-node collaborative training.
  10. 根据权利要求9所述的一种医疗数据共享方法,其特征在于,所述智能眼科OCT模块搭载在智联平台上。The medical data sharing method according to claim 9, wherein the intelligent ophthalmology OCT module is mounted on a Zhilian platform.
PCT/CN2020/132615 2020-10-15 2020-11-30 Method and apparatus for sharing medical data WO2022077720A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011104552.9 2020-10-15
CN202011104552.9A CN112162959B (en) 2020-10-15 2020-10-15 Medical data sharing method and device

Publications (1)

Publication Number Publication Date
WO2022077720A1 true WO2022077720A1 (en) 2022-04-21

Family

ID=73867194

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/132615 WO2022077720A1 (en) 2020-10-15 2020-11-30 Method and apparatus for sharing medical data

Country Status (2)

Country Link
CN (1) CN112162959B (en)
WO (1) WO2022077720A1 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113011598B (en) * 2021-03-17 2023-06-02 深圳技术大学 Financial data information federation migration learning method and device based on blockchain
CN113033652B (en) * 2021-03-23 2023-03-24 电子科技大学 Image recognition system and method based on block chain and federal learning
CN117157651A (en) * 2021-04-23 2023-12-01 Oppo广东移动通信有限公司 Federal learning method, federal learning system, first device, and third device
CN113778966B (en) * 2021-09-15 2024-03-26 深圳技术大学 Cross-school information sharing method and related device for university teaching and course score
CN114154645B (en) * 2021-12-03 2022-05-17 中国科学院空间应用工程与技术中心 Cross-center image joint learning method and system, storage medium and electronic equipment
CN116682543B (en) * 2023-08-03 2023-11-10 山东大学齐鲁医院 Sharing method and system of regional rehabilitation information
CN117894437A (en) * 2023-12-29 2024-04-16 湖南红普创新科技发展有限公司 Integrated sharing method, device, equipment and storage medium for digital pathology data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108063752A (en) * 2017-11-02 2018-05-22 暨南大学 A kind of credible genetic test and data sharing method based on block chain and proxy re-encryption technology
WO2019086553A1 (en) * 2017-10-31 2019-05-09 Twinpeek Privacy management
CN111698322A (en) * 2020-06-11 2020-09-22 福州数据技术研究院有限公司 Medical data safety sharing method based on block chain and federal learning
CN111754473A (en) * 2020-06-17 2020-10-09 平安科技(深圳)有限公司 Abnormal image screening method, device and equipment for 3D image and storage medium

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11043307B2 (en) * 2013-03-15 2021-06-22 James Paul Smurro Cognitive collaboration with neurosynaptic imaging networks, augmented medical intelligence and cybernetic workflow streams
US11676008B2 (en) * 2018-09-27 2023-06-13 Google Llc Parameter-efficient multi-task and transfer learning
WO2020185973A1 (en) * 2019-03-11 2020-09-17 doc.ai incorporated System and method with federated learning model for medical research applications
CN110008696A (en) * 2019-03-29 2019-07-12 武汉大学 A kind of user data Rebuilding Attack method towards the study of depth federation
CN110460600B (en) * 2019-08-13 2021-09-03 南京理工大学 Joint deep learning method capable of resisting generation of counterattack network attacks
CN110827147A (en) * 2019-10-31 2020-02-21 山东浪潮人工智能研究院有限公司 Federal learning incentive method and system based on alliance chain
CN111370098B (en) * 2020-04-27 2024-08-23 北京贝叶科技有限公司 Pathological diagnosis system and method based on edge side computing and service device
CN111695675B (en) * 2020-05-14 2024-05-07 平安科技(深圳)有限公司 Federal learning model training method and related equipment
CN111695701B (en) * 2020-06-12 2021-08-13 上海富数科技有限公司 System for realizing data set construction processing based on federal learning and construction generation method thereof
CN111723948A (en) * 2020-06-19 2020-09-29 深圳前海微众银行股份有限公司 Federal learning method, device, equipment and medium based on evolution calculation
CN111563275B (en) * 2020-07-14 2020-10-20 中国人民解放军国防科技大学 Data desensitization method based on generation countermeasure network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019086553A1 (en) * 2017-10-31 2019-05-09 Twinpeek Privacy management
CN108063752A (en) * 2017-11-02 2018-05-22 暨南大学 A kind of credible genetic test and data sharing method based on block chain and proxy re-encryption technology
CN111698322A (en) * 2020-06-11 2020-09-22 福州数据技术研究院有限公司 Medical data safety sharing method based on block chain and federal learning
CN111754473A (en) * 2020-06-17 2020-10-09 平安科技(深圳)有限公司 Abnormal image screening method, device and equipment for 3D image and storage medium

Also Published As

Publication number Publication date
CN112162959B (en) 2023-10-10
CN112162959A (en) 2021-01-01

Similar Documents

Publication Publication Date Title
WO2022077720A1 (en) Method and apparatus for sharing medical data
CN111698322B (en) Medical data safe sharing method based on blockchain and federal learning
CN106209947B (en) Data processing method and system for decentralized autonomous organization
Lounis et al. Healing on the cloud: Secure cloud architecture for medical wireless sensor networks
WO2020024607A1 (en) Block chain maintenance method and device, server, and computer readable storage medium
Sun et al. A blockchain-based secure storage scheme for medical information
Liu et al. A blockchain-empowered federated learning in healthcare-based cyber physical systems
Somy et al. Ownership preserving AI market places using blockchain
CN110727737A (en) Intelligent medical data storage method based on multi-level block chain system architecture
CN109617699A (en) A kind of key generation method, block chain network service platform and storage medium
CN113177645A (en) Federal learning method and device, computing equipment and storage medium
CN113609508A (en) Block chain-based federal learning method, device, equipment and storage medium
CN113127916A (en) Data set processing method, data processing device and storage medium
CN112487103B (en) Deep learning data set trusted sharing system based on blockchain intelligent contract
JP6296630B1 (en) Distributed ledger system and program
Tian et al. Robust and privacy-preserving decentralized deep federated learning training: Focusing on digital healthcare applications
CN113268758B (en) Data sharing system, method, medium and device based on federal learning
CN112565289B (en) System and method for credible issuing and verifying of medical certificate based on block chain
KR20220027809A (en) Systems and Methods for Virtual Distributed Ledger Networks
CN106845075A (en) One kind concentrates diagnosis report system
Liang et al. EduChain: A highly available education consortium blockchain platform based on Hyperledger Fabric
CN113642034A (en) Medical big data safety sharing method and system based on horizontal and vertical federal learning
CN115169576B (en) Model training method and device based on federal learning and electronic equipment
CN113271345A (en) Method for collaboratively maintaining reliable data evidence based on alliance block chain manufacturing industry department
CN111311165A (en) Labor information management method and labor information alliance chain system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20957494

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20957494

Country of ref document: EP

Kind code of ref document: A1