CN108985011A - A kind of genomic data management method and system based on block chain technology - Google Patents

A kind of genomic data management method and system based on block chain technology Download PDF

Info

Publication number
CN108985011A
CN108985011A CN201810809959.8A CN201810809959A CN108985011A CN 108985011 A CN108985011 A CN 108985011A CN 201810809959 A CN201810809959 A CN 201810809959A CN 108985011 A CN108985011 A CN 108985011A
Authority
CN
China
Prior art keywords
data
block chain
access
genomic data
genomic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810809959.8A
Other languages
Chinese (zh)
Inventor
李厦戎
王志波
孙兴强
郭登峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Genedock Technology Co Ltd
Original Assignee
Beijing Genedock Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Genedock Technology Co Ltd filed Critical Beijing Genedock Technology Co Ltd
Priority to CN201810809959.8A priority Critical patent/CN108985011A/en
Publication of CN108985011A publication Critical patent/CN108985011A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses a kind of genomic data management system and method based on block chain technology.The management system includes: entity and block chain;Entity includes data consumer and data set provider;Entity, for being veritified in the registration of block chain, identity and sending initialization registration information and genomic data to block chain;It is also used to issue or obtain access permission certificate and shared certificate corresponding to the genomic data;Block chain storage system in block chain is for access permission certificate and shared certificate corresponding to storing initial registration information, genomic data and genomic data;Block chain service platform is used to determine access protocol according to access permission certificate and shared certificate, and access authority and access historical record when storage entity access genomic data, generates log information and provides existence proof.Genomic data management system safety can be improved using management system provided by the present invention and method, realize resource-sharing.

Description

A kind of genomic data management method and system based on block chain technology
Technical field
The present invention relates to computer fields, more particularly to a kind of genomic data management method based on block chain technology And system.
Background technique
In recent years, gene sequencing service and Analysis Service price constantly reduce, so that gene sequencing and Analysis Service start Common people house is entered into, the management of genomic data also becomes particularly significant, and one side genomic data is related to individual privacy, On the other hand, a large amount of personal genomic data is needed to support the research of genomic data.And it is present, genomic data is all It is to be managed alone by personal or some enterprise institutions, in this way, each genomic data owner and genomic data are relevant Research institution is very difficult in data sharing, plays inhibiting effect to genomic data research work.
2009, the appearance of bit coin brought a new technology --- and block chain, block chain technology give existing many The Service Data Management of industry brings new thinking, however, due to the complexity and block chain of the specific business of various industries Technology it is immature so that block chain technology takes root in all trades and professions there are also section distance, block chain technical application to each row Each industry is also required to make its technical detail necessary modification.
In traditional genomic data way to manage, have the following problems:
Each data management mechanism respectively safeguards one's own data right management system, needs to put into a large amount of maintenance The operation of cost guarantee system;Once individual data management organization data management system is destroyed, it is likely to result in very serious Safety problem;Each data management mechanism individually manages the data of oneself, the data sharing between implementation mechanism relatively more tired It is difficult;Information has asymmetry between each genomic data management organization and the enterprise of offer genomic data related service Property, resource coordinating is shared relatively difficult;The specific way to manage of genomic data management organization is different, and each mechanism is managed alone Reason, the authenticity of later period access log and the retrospect audit of log are problematic in that.
The shortcomings that above-mentioned tradition genomic data way to manage, is primarily due to point of traditional genomic data management organization Independent, all kinds of genomic data associated mechanisms trust problem is dissipated to be difficult to caused by solving;Therefore, traditional genomic data management Security of system is low and cannot achieve resource-sharing.
Summary of the invention
The object of the present invention is to provide a kind of genomic data management method and system based on block chain technology, to solve The problem of traditional genomic data management system safety is low, cannot achieve resource-sharing.
To achieve the above object, the present invention provides following schemes:
A kind of genomic data management system based on block chain technology, comprising: entity and block chain;
The entity includes data consumer and data set provider;The entity, in block chain registration, body Part is veritified and sends initialization registration information and genomic data to the block chain;The initialization registration information includes Account, password and user identifier;The genomic data is the necessary summary information of data information;The data information must Wanting summary information is counted according to gene table information or with reference to gene loci coordinate information;It is also used to issue or obtain Access permission certificate and shared certificate corresponding to the genomic data;The access permission certificate and shared certificate packet Include access times, access data coordinate position and validity period;
The block chain includes block chain storage system and block chain service platform, and the block chain storage system is used for Store access permission certificate corresponding to the initialization registration information, genomic data and the genomic data and altogether Enjoy certificate;The block chain service platform is used to according to the access permission certificate and shared certificate determine access protocol, and Access authority and access historical record, generation log information when storing the entity access genomic data simultaneously provide presence Property proves.
Optionally, the data consumer includes genomic data analysis institution and genomic data scientific research institution;Institute Data consumer being stated, when for registering, providing the number card for proving that the third party of data consumer's identity reality issues Book.
Optionally, the data set provider includes genomic data deposit mechanism and personal sequencing participant;The number User identifier is provided when for data consumer registration according to supplier.
Optionally, the entity is also used to update and safeguard the access permission certificate and shared certificate.
Optionally, the block chain storage system includes data storage area on chain, data cached memory block and genome Data storage area.
Optionally, on the chain data storage area include genomic data, initialization registration information, gene data access go through Records of the Historian record, the corresponding access permission certificate of the genomic data and shared certificate;
The data cached memory block is stored in buffer area or the disk of block chain node, the data cached memory block Application broadcast message, data sharing record unconfirmed or unfinished are accessed including genomic data;
The genomic data memory block is stored in genome database or genomic data warehouse, the genome Data storage area include higher than memory space threshold value genome initial data or the intermediate data analyzed of gene data with And result data.
Optionally, the entity person that further includes data access;
The data access person is genomic data analysis institution or genomic data scientific research institution.
A kind of genomic data management method based on block chain technology, comprising:
Entity is registered in the block chain, identity is veritified and sends initialization registration information and base to the block chain Because of a group data;The initialization registration information includes account, password and user identifier;The genomic data is data information Necessary summary information;The necessary summary information of the data information is according to gene table information or to refer to gene loci coordinate What Information Statistics obtained;
The block chain stores corresponding to the initialization registration information, genomic data and the genomic data Access permission certificate and shared certificate;The access permission certificate and shared certificate include access times, access data seat Cursor position and validity period;
Access protocol is determined according to the access permission certificate and shared certificate, and stores the entity access genome Access authority and access historical record when data, generate log information and provide existence proof.
Optionally, the block chain stores the initialization registration information, genomic data and the genomic data After corresponding access permission certificate and shared certificate, further includes:
The necessary summary information of the data information is increased, modified or deleted.
Optionally, the entity is registered in the block chain, identity is veritified and sends initialization note to the block chain Volume information and genomic data, specifically include:
The entity is divided into data consumer and data set provider;
Different registration conditions is determined according to different types of entity;
Registered according to the registration condition, identity veritify and to the block chain send initialization registration information with And genomic data.
Optionally, after access permission certificate and shared certificate corresponding to the publication genomic data, also Include:
Update and safeguard the access permission certificate and shared certificate.
Optionally, described that access protocol is determined according to the access permission certificate and shared certificate, and store the reality Body access genomic data when access authority and access historical record, generate log information and provide existence proof it Before, further includes:
The genomic data is divided into genomic fragment;
Obtain the cryptographic Hash of the genomic fragment;
Merkel tree is constructed according to the cryptographic Hash;
Merkel's root of the Merkel tree is stored to the block chain.
The specific embodiment provided according to the present invention, the invention discloses following technical effects: the present invention provides one kind Genomic data management method and system based on block chain technology provide enterprise, the number of service by all kinds of genomic datas Participated in jointly according to owner and stakeholder, genomic data using the block chain network node of the operations such as user, safeguard and The use for auditing access and the service of all kinds of genomic datas ensure that the authenticity of the information being recorded on block chain and reliable Property;Using genomic data management method and system provided by the present invention can be all kinds of genomic data saving mechanism and enterprise Industry provides a unified, safe data sharing platform, reduces cost of each mechanism in terms of safeguarding data management, is Each mechanism in terms of provide convenience;Meanwhile by introducing block chain technology, and to traditional block chain skill Art has made necessary modification, uses so that the present invention is more suitable for disposing in actual environment.
Detailed description of the invention
It in order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, below will be to institute in embodiment Attached drawing to be used is needed to be briefly described, it should be apparent that, the accompanying drawings in the following description is only some implementations of the invention Example, for those of ordinary skill in the art, without any creative labor, can also be according to these attached drawings Obtain other attached drawings.
Fig. 1 is genomic data management system structure diagram provided by the present invention;
Fig. 2 is work flow diagram provided by the present invention;
Fig. 3 is bottom data storage organization schematic diagram provided by the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
The object of the present invention is to provide a kind of genomic data management method and system based on block chain technology, Neng Gouti Resource-sharing is realized in high gene group data management system safety.
In order to make the foregoing objectives, features and advantages of the present invention clearer and more comprehensible, with reference to the accompanying drawing and specific real Applying mode, the present invention is described in further detail.
The present invention is based on block chain technologies, it is contemplated that the trackability of block chain, record can not tamper, and go to center Change faith mechanism, a kind of genomic data management method and system, the present invention based on block chain technology of proposition fully considers The defect of the related advantages of block chain technology and the way to manage of traditional genomic data, and fully considered classical block Some inadaptable places under chain technical application to genomic data management environment, to the related system of classical block chain technology Modification appropriate is made, so that the system is more of practical significance.
Fig. 1 is genomic data management system structure diagram provided by the present invention, as shown in Figure 1, the structure is segmented into Two layers, bottom is block chain, and last time is all kinds of entities for accessing block chain.
Wherein, operation basis of the block chain as the system, provides block chain distinctive clothes for the management of genomic data Business, such as: the execution of intelligent contract, the credible record of genomic data situation, later period provide the services such as credible retrospect, in order to suitable It answers related data to manage behaviour in service, which can be designed as to single-stranded or multichain, the organizational form of area's data in block It is also possible to one or more Merkel tree, the present invention is with traditional single-stranded, for single Merkel tree;It should be noted that Genomic data integrality and accuracy prove Merkel described in Method of Data Organization in described Merkel tree and block Tree is not the same Merkel tree.
The block chain includes block chain storage system and block chain service platform, and the block chain storage system is used for Store access permission certificate corresponding to the initialization registration information, genomic data and the genomic data and altogether Enjoy certificate;The block chain service platform is used to according to the access permission certificate and shared certificate determine access protocol, and Access authority and access historical record, generation log information when storing the entity access genomic data simultaneously provide presence Property proves.
The block chain storage system includes data storage area on chain, data cached memory block and genomic data storage Area.
On the chain data storage area include genomic data, initialization registration information, gene data access historical record, The corresponding access permission certificate of genomic data and shared certificate.
The data cached memory block is stored in buffer area or the disk of block chain node, the data cached memory block Application broadcast message, data sharing record unconfirmed or unfinished are accessed including genomic data;Wherein, it may finally be lost Abandoning may also be added in block chain memory block, be the data with timeliness meaning in short-term.
The genomic data memory block includes the genome initial data or gene data point higher than memory space threshold value Analyse obtained intermediate data and result data.
The entity platform of upper layer access mainly includes data set provider and data consumer, and wherein data set provider generally can There is provided two kinds of data: the result come out including original gene group data and analysis of biological information.Data set provider includes genome Data Hosting mechanism, source (sequencing participant) of genomic data etc..Data consumer generally comprises genomic data analysis Relevant scientific research institution of mechanism, genomic data etc..
Detailed operation process is as shown in Fig. 2, specifically describe are as follows:
1) registration of user, user here be to refer to the accession to all kinds of entities of block chain, this based on block chain technology Genomic data management method is primary concern is that a kind of block chain data management mode for alliance's chain, the registration master of user If realizing the authentication management to user identity, either mechanism or individual is offered to enough proof of identification information with true Its fixed role and relevant initial rights.Illegal user registers to use or destroys the block chain in order to prevent, registers the note of user Volume application in the form broadcasted is published to the block chain, by entire block chain network legitimate user and block chain on record Registration rule verifies the validity of the registration, and determines the legitimacy of its initial rights application;It is directed to genomic data The individual's in source, only it need to provide the simple anonymous information such as user identifier, and data consumer then needs to provide by authoritative machine The necessary proof of identification message such as the certificate that structure is issued.
User's registration, all kinds of entities for accessing genomic data management system provided by the present invention need on the platform It carries out necessary identity veritification and records the registration information of initialization on block chain.By on block chain whether wherein succeeding in registration Legitimate user, which decides by vote, in the registration rule and block chain having had determines.The private information being related to is added using cryptography method It is close.The user for wherein participating in registration generally comprises:
1. data owner and shareholder: data owner is generally deposit mechanism and corresponding data source side (ginseng With the individual of sequencing);In order to protect the privacy data of this kind of entity, especially individual privacy, when these registers entities, it is only necessary to The information such as some user identifiers are provided.
2. data user: when Various types of data user registers, it is desirable to provide the energy such as digital certificate issued by third party Enough prove all kinds of data of its identity reality.
Component part in genomic data management system based on block chain technology, including block platform chain and upper layer connect Enter entity:
1. block platform chain provides the distinctive service function of block chain, faith mechanism, genome number including decentralization According to access and authority record, existence proof etc..
2. upper layer access entity is divided into data set provider and data applicant, data set provider is generally provided in two kinds of data Hold, including initial data and analysis of biological information result.Data access person generally comprises genomic data analysis institution, all kinds of bases Because of relevant scientific research institution of group data etc.;Data set provider generally comprises Data Hosting mechanism etc..
2) acquisition of genomic data: the user (including enterprise, mechanism and individual) after registration can choose have ready conditions or The unconditional valid data resource for sharing oneself, the data management service that platform offer also can be used in personal user have item The shared personal genomic data of part.
User is divided into two classes according to role, and sets the condition of different registrations to it by user registration course, is registered Final result whether function is determined jointly by the audit of the contents such as the blacklist that records on block chain and the user for having registered completion It is fixed.
It is had ready conditions by all kinds of entities after registering or the unconditional data information for disclosing oneself, the data of oneself is provided Block is recorded in the necessary summary information (obtaining according to gene phenotype information or with reference to statistics such as gene loci coordinate informations) of material On chain, and that is responsible for these summary informations increases modification.
3) data access authority management: the related data provider for a) generally accessing the block chain can be to owned number According to providing a simple initialization access authority setting, and to be ready shared data on block chain and record one substantially Brief account situation.B) data user can inquire all relevant overview of the data situations, and application accesses oneself interested data, This application is also to be submitted by block chain, determines refusal either license this application by data controlling party.It c) can also be by data User and data providing confer mutual data usage rights, and the access authority certificate after deliberation is published to area It is saved on block chain.D) it needs to apply for the entity using certain gene datas, needed for oneself can also being broadcasted to the block chain network The particular phenotype gene data wanted, the user for possessing related data can respond the broadcast and provide related data.In order to avoid Excessive is not overstock in block chain network by the application that shareholder audits, it should to the Various types of data in block chain network Packet one validity timestamp of setting, after data packet failure, this application is just dropped.
All kinds of genomic datas for sharing to the platform, generally sign and issue all kinds of access by data owner and shareholder License passport, and update and safeguard these access permission certificates;All kinds of certificates include access times, access data coordinate position, The information such as validity period;The substantive data sharing behavior occurred between all kinds of entities will all be recorded in block chain.
Here say " substance " be intended merely to distinguish it is some in the system need to be broadcast to whole district's block chain network, but not necessarily Some data contents in block chain, such as the cache information in Fig. 3 can be charged to, it is expired not responded, that is, it is considered as and does not occur Intensional data shares behavior.
The present invention provides the credible platform of a decentralization, which does not need to judge whether data sharing both sides occur Intensional data shares behavior, it is only necessary to which, to be ready that the both sides for carrying out shared data using the platform provide platform, the later period is supplied to Class shares log recording.
Genomic data is shared, and can share certificate by data owner's proactive dissemination, can also be by data applicant It proposes access request, signature is audited by data owner and promulgates certificate.
Wherein, data consumer obtains the mode of access permission certificate, may is that
1. the overview of the data mentioned in data applicant's active finding step 2, thus the information obtained when using its registration Data (meets the gene data of data applicant requirement: meeting phenotype requirement or want with reference to gene loci coordinate to specific data Ask) owner's application access authority.
" specific data " refer to gene data applicant application specific gene data, that is, meet its site coordinate requirement or The specific gene data that phenotypic information requires.
Specifically judge specific data process: informative summary includes the statistics letter that data owner is ready shared data Breath, applicant obtain the summary information of Various types of data owner by inquiring block chain, and the requirement of the data according to needed for oneself is It can determine whether which data owner possesses the data needed for oneself, these data owners are are as follows: possess the data of specific data Owner.
2. the phenotypic information of data needed for data applicant broadcasts to the block chain network, is mentioned by data owner according to it The registration information of confession and the expense for being ready payment decide whether to issue access permission certificate.
3. data user and data set provider, which are signed, has effective license passport for a long time, it is stored on block chain, As long as access certificate is effective every time, it is effective to be judged as access application.
4) offer of related service: the management platform is mainly to implement management, provided clothes to genomic data Business is that the relevant enterprise and tissue by accessing provide, the main gene order-checking service provided including gene sequencing company, base Data analysis service that the Data Hosting service that there is provided by group Data Hosting company, genomic data analysis company provide, also Other services of the offers such as the relevant company of other genomic datas and scientific research institution.Wherein, it is signed in these service use processes The agreement ordered and other vital documents, ISP's kimonos make sures use side can be by these agreements or the digest of file simultaneously It is stored on block chain, can also be signed based on certain condition and disposes intelligent contract.
The mechanism of the offer of all kinds of genomic data related services, all kinds of access platforms can be either right between each other Ordinary user provides all kinds of services, and in the abstract for the agreement that service provider and service user are signed and service process Necessary document is saved in block chain, and the service agreement of signing can also be write as intelligent contract and be deployed on block chain.
5) data integrity and Accuracy control mechanism: data consumer obtains after obtaining data access authority by counting The branch of the data content and corresponding Merkel tree that are provided according to provider, data consumer according to the branch of Merkel's root with The Merkel's tree root being recorded on block chain can examine the integrality and accuracy of oneself data obtained.For it is some by The specified genetic fragment of position coordinates, then calculate the abstract of corresponding data by intelligent contract automatically and data content sent together Give data access person.
Data integrity and Accuracy control, can be there are two types of mode:
1. each data possess entity need to be by the data content root of oneself in order to guarantee the integrality and accuracy of shared data It is divided into segment according to the certain sense of genomic data, and the cryptographic Hash of each segment is organized into Merkel tree, by Merkel Root is retained on block chain, the foundation as later data integrality and Accuracy Verification.
2. data applicant automatically inquires corresponding data information by intelligent contract after obtaining access permission qualification, And the cryptographic Hash of query result is calculated, encryption is sent to data user together.
6) later period retrospect audit: the block chain saves all kinds of important in genomic data authority distribution and use process Information, wherein sensitive privacy information uses cryptography method encrypting storing;Later period is to any gene for accessing the system Group data can trace back to its usage record and corresponding all authorization messages.All kinds of ISP's kimonos are make sure use The agreement of person's signature and other important files save summary info on block chain, and the summary info on block chain is recorded Or these files provide existence proof.
Bottom data storage organization is illustrated in fig. 3 shown below:
Content on chain:
1) registration information of each entity (including data set provider and data applicant) of the management platform is accessed;
2) the initialization authority distribution and brief overview that the genomic data of data providing saves on block chain;
3) authorisation verification of the data owner to actives such as relevant healthcare institutions;
4) data user proposes data access application and obtains the authorisation verification that data owner consents by signature;
5) the later period modification record information of all kinds of authorisation verifications;
6) necessary informations such as record information, including visitor, access time that gene data is used;
7) summary info of the agreement and other vital documents of all kinds of relevant service providers and service user's signature, with And intelligent contract of deployment etc.;
8) each Merkel's root cryptographic Hash of gene data abstract;
The storage of bottom data is divided into three parts:
1. data on chain, user's registration information, authority distribution more new record and the data being recorded in block chain use note The information such as record.
2. it is data cached, for having the data of timeliness meaning in short-term, such as, the data user hair mentioned in step 3 The data access request risen is retained in data cached when not out of date and uncommitted, that is, substantive permission does not occur Change or the data of the data sharing fact are all stored in data cached, is dropped or is authorized to up to its is expired.Also have one The information of a little similar advertisements, they are doomed to be saved in block chain, this also belongs to data cached.
3. genomic data, for genomic data in view of its data volume is huge, genomic data preservation is still used Its integrality and the message of Accuracy Verification are only done necessary record by database or data warehouse etc. on block chain.
Cache contents:
Cache contents mainly save: the information etc. that the access application of not out of date data, data requirements side broadcast, that is, Data content with short-term timeliness.It does so mainly in view of block chain storage resource is more precious, it is short for such Phase significant data, and later period retrospect is worth not high data, is not suitable for being recorded in block chain.
Certainly, all kinds of entities for accessing the platform can also directly retain access than more frequently block chain data To be data cached, to reduce the computing cost of inquiry block chain initial data.
It should be noted that although the transaction cache pool in cache contents here and classical block chain has similarly Side, but it has the difference of matter.
Genomic data content:
Generally, memory space occupied by genomic data is bigger, this category information needs to retain using traditional mode In database or data warehouse, different regions is marked off according to its certain sense, so that inquiry uses.Wherein in order to guarantee The data user quality of data obtained, the abstract needs that each corresponding region of genomic data generates are done on block chain Necessary record.It is also contemplated that therefore the complexity of genomic data uses the side of Merkel tree to the abstract of genomic data Formula tissue.The value of its Merkel's root is only saved on block chain.
Each embodiment in this specification is described in a progressive manner, the highlights of each of the examples are with other The difference of embodiment, the same or similar parts in each embodiment may refer to each other.For system disclosed in embodiment For, since it is corresponded to the methods disclosed in the examples, so being described relatively simple, related place is said referring to method part It is bright.
Used herein a specific example illustrates the principle and implementation of the invention, and above embodiments are said It is bright to be merely used to help understand method and its core concept of the invention;At the same time, for those skilled in the art, foundation Thought of the invention, there will be changes in the specific implementation manner and application range.In conclusion the content of the present specification is not It is interpreted as limitation of the present invention.

Claims (12)

1. a kind of genomic data management system based on block chain technology characterized by comprising entity and block chain;
The entity includes data consumer and data set provider;The entity, in block chain registration, identity core It tests and sends initialization registration information and genomic data to the block chain;The initialization registration information includes account Number, password and user identifier;The genomic data is the necessary summary information of data information;Necessity of the data information Summary information is counted according to gene table information or with reference to gene loci coordinate information;It is also used to issue or obtain institute State access permission certificate and shared certificate corresponding to genomic data;The access permission certificate and shared certificate include Access times, access data coordinate position and validity period;
The block chain, the block chain include block chain storage system and block chain service platform, the block chain storage System is demonstrate,proved for storing access permission corresponding to the initialization registration information, genomic data and the genomic data Book and shared certificate;The block chain service platform is used to determine access according to the access permission certificate and shared certificate Agreement, and store the access authority when entity accesses genomic data and access historical record, generate log information simultaneously Existence proof is provided.
2. genomic data management system according to claim 1, which is characterized in that the data consumer includes gene Group data analysis machine structure and genomic data scientific research institution;The data consumer, providing proves data consumer's body The digital certificate that the third party of part authenticity issues.
3. genomic data management system according to claim 1, which is characterized in that the data set provider includes gene Group Data Hosting mechanism and personal sequencing participant;The data set provider provides when for data consumer registration User identifier.
4. genomic data management system according to claim 1, which is characterized in that the entity, be also used to update and Safeguard the access permission certificate and shared certificate.
5. genomic data management system according to claim 1, which is characterized in that the block chain storage system includes Data storage area, data cached memory block and genomic data memory block on chain.
6. genomic data management system according to claim 5, which is characterized in that data storage area includes on the chain Genomic data, initialization registration information, gene data access historical record, the corresponding access permission card of the genomic data Book and shared certificate;
The data cached memory block is stored in buffer area or the disk of block chain node, and the data cached memory block includes Genomic data access application broadcast message, data sharing record unconfirmed or unfinished;
The genomic data memory block is stored in genome database or genomic data warehouse, the genomic data Memory block includes the intermediate data and knot analyzed higher than the genome initial data or gene data of memory space threshold value Fruit data.
7. genomic data management system according to claim 3, which is characterized in that the entity further includes data access Person;
The data access person is genomic data analysis institution or genomic data scientific research institution.
8. a kind of genomic data management method based on block chain technology characterized by comprising
Entity is registered in the block chain, identity is veritified and sends initialization registration information and genome to the block chain Data;The initialization registration information includes account, password and user identifier;The genomic data must for data information Want summary information;The necessary summary information of the data information is according to gene table information or to refer to gene loci coordinate information What statistics obtained;
The block chain stores access corresponding to the initialization registration information, genomic data and the genomic data License passport and shared certificate;The access permission certificate and shared certificate include access times, access data coordinate bit It sets and validity period;
Access protocol is determined according to the access permission certificate and shared certificate, and stores the entity access genomic data When access authority and access historical record, generate log information simultaneously existence proof is provided.
9. genomic data management method according to claim 8, which is characterized in that the block chain storage is described initial After access permission certificate corresponding to change registration information, genomic data and the genomic data and shared certificate, Further include:
The necessary summary information of the data information is increased, modified or deleted.
10. genomic data management method according to claim 8, which is characterized in that the entity is in the block chain Registration, identity are veritified and send initialization registration information and genomic data to the block chain, specifically include:
The entity is divided into data consumer and data set provider;
Different registration conditions is determined according to different types of entity;
It is registered according to the registration condition, identity is veritified and sends initialization registration information and base to the block chain Because of a group data.
11. genomic data management method according to claim 8, which is characterized in that the publication genome number After corresponding access permission certificate and shared certificate, further includes:
Update and safeguard the access permission certificate and shared certificate.
12. genomic data management method according to claim 8, which is characterized in that described according to the access permission Certificate and shared certificate determine access protocol, and store the access authority and access when the entity accesses genomic data Historical record, before generating log information and existence proof being provided, further includes:
The genomic data is divided into genomic fragment;
Obtain the cryptographic Hash of the genomic fragment;
Merkel tree is constructed according to the cryptographic Hash;
Merkel's root of the Merkel tree is stored to the block chain.
CN201810809959.8A 2018-07-23 2018-07-23 A kind of genomic data management method and system based on block chain technology Pending CN108985011A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810809959.8A CN108985011A (en) 2018-07-23 2018-07-23 A kind of genomic data management method and system based on block chain technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810809959.8A CN108985011A (en) 2018-07-23 2018-07-23 A kind of genomic data management method and system based on block chain technology

Publications (1)

Publication Number Publication Date
CN108985011A true CN108985011A (en) 2018-12-11

Family

ID=64550610

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810809959.8A Pending CN108985011A (en) 2018-07-23 2018-07-23 A kind of genomic data management method and system based on block chain technology

Country Status (1)

Country Link
CN (1) CN108985011A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021170049A1 (en) * 2020-02-29 2021-09-02 华为技术有限公司 Method and apparatus for recording access behavior

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407795A (en) * 2016-09-05 2017-02-15 北京众享比特科技有限公司 Data existence authentication system, authentication method and verification method
CN106682530A (en) * 2017-01-10 2017-05-17 杭州电子科技大学 Method and device for medical information sharing privacy protection based on blockchain technology
CN106796688A (en) * 2016-12-26 2017-05-31 深圳前海达闼云端智能科技有限公司 Permission control method, device and system of block chain and node equipment
CN107391944A (en) * 2017-07-27 2017-11-24 北京太云科技有限公司 A kind of electronic health record shared system based on block chain
CN107450979A (en) * 2017-03-28 2017-12-08 阿里巴巴集团控股有限公司 A kind of block chain common recognition method and device
CN107769925A (en) * 2017-09-15 2018-03-06 山东大学 Public key infrastructure system and its certificate management method based on block chain
CN108023894A (en) * 2017-12-18 2018-05-11 苏州优千网络科技有限公司 Visa information system and its processing method based on block chain
CN108092982A (en) * 2017-12-22 2018-05-29 广东工业大学 A kind of date storage method and system based on alliance's chain

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106407795A (en) * 2016-09-05 2017-02-15 北京众享比特科技有限公司 Data existence authentication system, authentication method and verification method
CN106796688A (en) * 2016-12-26 2017-05-31 深圳前海达闼云端智能科技有限公司 Permission control method, device and system of block chain and node equipment
CN106682530A (en) * 2017-01-10 2017-05-17 杭州电子科技大学 Method and device for medical information sharing privacy protection based on blockchain technology
CN107450979A (en) * 2017-03-28 2017-12-08 阿里巴巴集团控股有限公司 A kind of block chain common recognition method and device
CN107391944A (en) * 2017-07-27 2017-11-24 北京太云科技有限公司 A kind of electronic health record shared system based on block chain
CN107769925A (en) * 2017-09-15 2018-03-06 山东大学 Public key infrastructure system and its certificate management method based on block chain
CN108023894A (en) * 2017-12-18 2018-05-11 苏州优千网络科技有限公司 Visa information system and its processing method based on block chain
CN108092982A (en) * 2017-12-22 2018-05-29 广东工业大学 A kind of date storage method and system based on alliance's chain

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
薛腾飞等: "《基于区块链的医疗数据共享模型研究》", 《自动化学报》 *
陈烨等: "《基于区块链的网络安全技术综述》", 《电信科学》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021170049A1 (en) * 2020-02-29 2021-09-02 华为技术有限公司 Method and apparatus for recording access behavior

Similar Documents

Publication Publication Date Title
US10979418B2 (en) Template-based distributed certificate issuance in a multi-tenant environment
Tatar et al. Law versus technology: Blockchain, GDPR, and tough tradeoffs
Soltani et al. A new approach to client onboarding using self-sovereign identity and distributed ledger
CN111316278B (en) Secure identity and profile management system
Culnan et al. Consumer privacy: Balancing economic and justice considerations
KR100696316B1 (en) Method and apparatus for managing individual information
US20080209575A1 (en) License Management in a Privacy Preserving Information Distribution System
Navadkar et al. Overview of blockchain technology in government/public sectors
CN101490689A (en) Content control system and method using certificate chains
CN101390134A (en) Method for redistributing DRM protected content
CN101821747A (en) Multi-factor content protection
CN106992988A (en) A kind of cross-domain anonymous resource sharing platform and its implementation
KR20210158271A (en) System to provide genuinity verification and ownership change records of product esset by using a blockchain and a genuine authentiation tag technologies
CN112149077B (en) Supply chain billing method, system and computer equipment based on block chain technology
CN112181922A (en) Block chain data sharing method, system, device and medium
KR20220050606A (en) System and Method for Intelligent mediating based enhanced smart contract for privacy protection
Dutta et al. Blockchain vs GDPR in collaborative data governance
TWI724758B (en) Method for processing transaction via external node on blockchain and apparatus for performing the method
CN108985011A (en) A kind of genomic data management method and system based on block chain technology
US7660770B2 (en) System and method for providing a secure contact management system
CN112231751A (en) Data transmission system and method based on block chain
Wiebe et al. Protection of trade secrets in a data-driven, networked environment–Is the update already out-dated?
CN114037576A (en) System and method for allocating academic resources
Rech et al. A decentralized service-platform towards cross-domain entitlement handling
Chhabra et al. Blockchain, AI, and Data Protection in Healthcare: A Comparative Analysis of Two Blockchain Data Marketplaces in Relation to Fair Data Processing and the ‘Data Double-Spending’Problem

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181211