CN102025482B - Construction method of virtual genome-based cryptosystem (VGC) - Google Patents

Construction method of virtual genome-based cryptosystem (VGC) Download PDF

Info

Publication number
CN102025482B
CN102025482B CN200910192218.0A CN200910192218A CN102025482B CN 102025482 B CN102025482 B CN 102025482B CN 200910192218 A CN200910192218 A CN 200910192218A CN 102025482 B CN102025482 B CN 102025482B
Authority
CN
China
Prior art keywords
virtual
vgdb
vdmc
dna
dna sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN200910192218.0A
Other languages
Chinese (zh)
Other versions
CN102025482A (en
Inventor
曾纪晴
张明永
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China Botanical Garden of CAS
Original Assignee
South China Botanical Garden of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China Botanical Garden of CAS filed Critical South China Botanical Garden of CAS
Priority to CN200910192218.0A priority Critical patent/CN102025482B/en
Publication of CN102025482A publication Critical patent/CN102025482A/en
Application granted granted Critical
Publication of CN102025482B publication Critical patent/CN102025482B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention relates to information security technology, in particular to a virtual genome-based cryptosystem (VGC). The cryptosystem is provided with two matched keys, of which one is a virtual genome database (VGDB) consisting of random deoxyribonucleic acid (DNA) sequences and the other one is a position table that virtual genes of the VGDB are randomly distributed in a two-dimensional microarray, namely a virtual DNA microarray chip (VDMC). Any plaintext information can be freely written on the VDMC, namely points for forming the plaintext information are selected from the VDMC microarray. The selected points correspond to the virtual genes in the VGDB; small segments of DNA sequences are randomly selected from the virtual genes; and the uniqueness of the small segments of DNA sequences in the VGDB is determined by using a common tool of the bioinformatics, namely a basic local alignment search tool (BLAST), or other character string search algorithms such as a Knuth-Morris-Pratt (KMP) algorithm and the like. A cipher text is combined by the small segments of DNA sequences. The small segments of DNA sequences need only to perform BLAST on the VGDB during decryption, namely the points for forming the plaintext information can be discovered, and the plaintext information can be restored according to the VDMC. Any non-VGDB sequence can be randomly inserted into the cipher text and does not have any influence on the encryption. Thus, the VGC is an excellent information hiding system. In addition, the VGC key can be updated automatically so as to realize an indecipherable one-time-pad system. The cryptosystem is used for real-time quick secret information communication, digital signature and identity authentication.

Description

A kind of building method based on virtual genomic cryptographic system (VGC)
Technical field
The invention belongs to information security field, particularly relate to a kind of based on virtual genomic cryptographic system.
Background technology
Cryptographic system is the core of information security.Traditional cryptographic system is substantially all the cryptographic system that depends on mathematics, that is to say that they only have the fail safe in mathematical computations.At present only there is the cryptographic system of so-called one-time pad to there is the fail safe that can not decode in theory.But in fact the cryptographic system of one-time pad, owing to having secret key management and assignment problem, is therefore difficult to application.Cryptologists are being explored new cryptographic system now, such as quantum cryptography and DNA password.
Because DNA has ultra-large concurrency, ultralow energy consumption and the memory capacity of super-high density, DNA has now been used to the fields such as calculating, data storing and cryptography.The DNA password maturation that is all far from theory and practice, effectively DNA cryptographic system few.At present, DNA cryptographic system is not paid attention to coding method especially, often adopt G, A, T, tetra-basic group letter coding binary information of C or the out of Memory such as 1 English alphabet, numeral or punctuate of encoding with 3 basic group letters, this makes as easy as rolling off a log being cracked of this coded system itself.The fail safe of DNA cryptographic system does not just lie in this coded system naturally yet, and is the biology difficult problem of DNA operation itself.Such as Celland etc. has invented micro-some technology of a kind of DNA.Gehani etc. have designed an one-time cryptosystem scheme based on DNA, although this scheme can not be decoded in theory, but also there is obvious defect: not only cost is huge to build such One-time pad DNA, encryption and decryption are wasted time and energy, and the pollution of DNA processing ease, thereby cause this scheme in fact unavailable.
In fact, the existing cryptographic system based on DNA is all directly information " to be write " to DNA molecular, a little less than this coded system is highly brittle, and biologic operation all will be through complex processes such as synthetic, the extraction of DNA, clone, pcr amplification, DNA sequencing, molecular hyridizations, need expensive laboratory apparatus, also waste time and energy simultaneously, have in addition DNA pollution, PCR and order-checking to have certain probability of makeing mistakes, there is no positive control problems such as (such as not knowing in advance to carry sequence and the clip size of the DNA molecular of message).The DNA molecular that DNA cryptographic system is transmitted can only rely on physical transportation mode and transmit, can not transmit by electric wire, optical fiber and wireless channel, thereby can not be for real time communication, can only be for large-scale parallel data encryption, the secure data storage not high to requirement of real-time, and the cryptographic applications field such as authentication, digital signature and Information hiding.
Summary of the invention
The object of the invention is to set up a kind of based on virtual genomic novel cipher system, this system has been utilized BLAST (the Basic Local Alignment Search Tool) algorithm of bioinformatics, avoided the directly biologic operation to DNA molecular, not only can reach " one-time pad " the such fail safe that can not decode in theory, can on electronics and the network media, communicate real-time again, equally also can be for digital signature and authentication.
In order to reach object of the present invention, " based on virtual genomic cryptographic system " takes following steps and method (Fig. 1):
(1) first, produce a large amount of random dna sequences, and random number or name (each random dna sequence numbering or virtual gene of called after), each virtual gene adopts FASTA form, all these virtual genomic constitutions " virtual genome ", generate " virtual genome database " (Virtual Genome Database, VGDB).Also can as required, easily " virtual genome database " be converted to " virtual Protein Data Bank " (Virtual Protein Database, VPDB).
(2) then, all virtual gene Random assignment in " virtual genome database " is tieed up on array table at one 2, obtain a virtual gene distribution position table, this table is " virtual genetic chip " (Virtual DNA Microarray Chip, VDMC).
(3) information sender can be at " virtual genetic chip " (VDMC) upper " writing " any secret information that will transmit, and this " writing " is exactly to select for forming " point " of secret information having on the VDMC of huge dot matrix.
(4) due to each point on VDMC corresponding to " virtual genome database " a virtual gene in (VGDB), therefore on VDMC, select the point that forms secret information, be exactly to have found out the virtual gene in VGDB accordingly.
(5) then, the virtual gene of finding out from each, select randomly a DNA sequence dna small fragment, utilize local blast program to carry out similarity searching to VGDB, confirm that this small fragment DNA sequence dna of random choose is unique in VGDB.That is to say, this small fragment DNA sequence dna is one to one with the virtual gene of selecting, and in other virtual gene, there is no identical DNA sequence dna.
(6) by all these random chooses DNA sequence dna out in no particular order sequentially random combine get up (each small fragment DNA sequence dna comes as space, branch etc. separate by comma or other interval mode) just become cipher-text information.
(7) receiving party receives after ciphertext, by separate each small fragment DNA sequence dna coming in modes such as commas, VGDB is carried out to BLAST, find out each self-corresponding virtual gene, then on VDMC these virtual genetic markers out, secret information has just displayed naturally.
In order to strengthen security performance, can take following two kinds of methods:
(1) random dna sequence of the non-VGDB database of radom insertion in ciphertext.Because the random dna sequence of the non-VGDB database in ciphertext is analyzed and can not obtained any result through BLAST, so they can not affect decrypted result.The characteristic of this insertion redundant DNA sequence that can be random, makes this cryptographic system have unlimited secret key space.
(2) secret key can self, and VGC cryptographic system becomes the cryptographic system of one-time pad.Such as, i the position of the random small fragment DNA sequence dna of selecting in certain virtual gene, and this virtual gene is capable in m on VDMC, n row.So, after encrypting or deciphering, random small fragment sequence is deleted and is again inserted into (i+j) individual position from the i position of virtual gene, and the m of the virtual gene after rearrangement from VDMC is capable, n column position is deleted and is also again inserted on (m+a) row, (n+b) column position.J, a, numerical value transmit leg and the recipient of b can freely arrange.Therefore,, after each encryption and decryption process completes, secret key is just upgraded once automatically.Like this, VGC cryptographic system has just become the cryptographic system of one-time pad.
The present invention has following outstanding feature:
1, the secret key of this cryptographic system comprises VGDB and two secret keys of mutually mating of VDMC.VGDB is that a huge random dna sequence is virtual genome database, has huge secret key space; VDMC is virtual gene random distribution 2 dimension array tables in a VGDB.
2, secret key completely random, ciphertext is completely random also.
3, the algorithm of this cryptographic system is very simple, adopts the similarity searching program BLAST of DNA sequence dna.Also can adopt other string search algorithm, such as Knuth-Morris-Pratt (KMP) algorithm.
4, ciphertext can arbitrarily be inserted the random dna sequence of non-VGDB database.In addition, because virtual gene can be by a plurality of small fragment DNA sequence dna institutes " representative ", and also can radom insertion redundant DNA sequence in ciphertext, thereby same plaintext uses same secret key to encrypt, and obtains diverse ciphertext.Expressly and between ciphertext there is not relation one to one, thereby by expressly or ciphertext, even expressly with ciphertext to causing effective attack to this cryptographic system.
5, secret key can self, becomes the cryptographic system of one-time pad.Share after a secret key send and receive side, can upgrade by continuous secret key, obtains a brand-new secret key, can the perfect sharing problem for the first time that solves symmetric cryptosystem secret key.That is to say, the shared for the first time of secret key not necessarily will pass through back channel.
6, due to information encryption process " writing " through VGMC, cleartext information can be write in any form on VGMC, comprises image format, cleartext information is encoded into other password form (Fig. 2), personal letter form etc.Therefore this cryptographic system can easily realize the secondary coding of information.
7, this cryptographic system is not only suitable for top-secret confidential corespondence, also can be used for digital signature and authentication.From the 6th feature, this cryptographic system can realize real " handwritten " digital signature and authentication.Due to native system one-time pad, digital signature will cannot be forged completely.
Accompanying drawing explanation:
Fig. 1 is " based on virtual genomic cryptographic system " schematic diagram (VGC).The letters of encryption and decryption " T " of take are example.(1) the virtual genome database (VGDB) being formed by random dna sequence; (2) being the virtual genetic chip (VDMC) matching with VGDB, is the position table of the virtual gene random distribution in 2 dimension dot matrix in a VGDB; (3) on VDMC, " write " letter " T ", select the point that forms letter " T "; (4) find out the virtual gene of each point correspondence in VGDB in (3); (5) the virtual gene of finding out from each, select randomly a DNA sequence dna small fragment, utilize local blast program to carry out similarity searching to VGDB, confirm that this small fragment DNA sequence dna of random choose is unique in VGDB; (6) by all these random chooses DNA sequence dna out in no particular order sequentially random combine get up (each small fragment DNA sequence dna comes as space, branch etc. separate by comma or other interval mode) just become cipher-text information.Deciphering process be, by separate each small fragment DNA sequence dna coming in modes such as commas, VGDB is carried out to BLAST, find out each self-corresponding virtual gene, then on VDMC these virtual genetic markers out, the letter of encryption " T " has just displayed.
Fig. 2 explanation " based on virtual genomic cryptographic system " (VGC) transmits any information with secret.A, B and C represent that respectively VGC can transmit binary coding, Braille braille and Morse's code and transmit " SOS " information.VGC can transmit any enciphered message, that is to say, can transmit with VGC the ciphertext of conventional cipher system; D represents that VGC can transmit image information.
Embodiment
Embodiment 1 sets up VGC cryptographic system, and a letter of send and receive " T " (Fig. 1)
(1) first, random generation a large amount of (several more than thousand ten thousand, even millions of up to ten million) the even longer DNA sequence dna of several kb, and random number or name (each random dna sequence numbering or virtual gene of called after), each virtual gene adopts FASTA form, all these virtual genomic constitutions " virtual genome ", generate " virtual genome database " (VirtualGenome Database, VGDB).
(2) then, all virtual gene Random assignment in " virtual genome database " is tieed up on array table at one 2, obtain a virtual gene distribution position table, this table is " virtual genetic chip " (Virtual DNA Microarray Chip, VDMC).
(3) at " virtual genetic chip ", (VDMC) go up written word female " T ", on VDMC, select for forming " point " of letter " T ".
(4) find out the virtual gene of these Ying Yu " virtual genome database " in (VGDB), i.e. B2, C2, D2, C3 and C4.
(5) then, the virtual gene of finding out from each, select randomly a DNA sequence dna small fragment: from the virtual gene of B2, find out " CGACGTGCG ", from the virtual gene of C2, find out " ACTCGGC ", from the virtual gene of D2, find out " GCCGCGGGA ", from the virtual gene of C3, find out " GGAGCCAT ", from the virtual gene of C4, find out " CATGCCTCG ".Utilize local blast program to carry out similarity searching to VGDB, confirm that this small fragment DNA sequence dna of random choose is unique in VGDB.That is to say, this small fragment DNA sequence dna is one to one with the virtual gene of selecting, and in other virtual gene, there is no identical DNA sequence dna.
(6) by all these random chooses DNA sequence dna out in no particular order sequentially random combine get up (each small fragment DNA sequence dna comes with comma) just become cipher-text information: " CGACGTGCG; ACTCGGC; GCCGCGGGA, GGAGCCAT, CATGCCTCG "
(7) receiving party receives after ciphertext, and each small fragment DNA sequence dna in ciphertext " CGACGTGC G, ACTCGGC; GCCGCGGGA; GGAGCCAT, CATGCCTCG " is carried out to BLAST to VGDB, finds out each self-corresponding virtual gene, B2, C2, D2, C3 and C4, again on VDMC these virtual genetic markers out, letter " T " naturally just displayed.
Embodiment 2 digital signature
VGC system has special endorsement method for digital signature.What use due to VGC system is the input information output of image recognition formula, also on VDMC, " writes " and restore information, and therefore, the information itself that VGC transmits is exactly very personalized.VGC digital signature can really be accomplished handwritten authentic work signature.Can also on VDMC specific position, do some special marks, thereby ciphertext cannot be forged completely.

Claims (8)

1. the building method based on virtual genomic cryptographic system (VGC), is characterized in that carrying out with method as follows:
(1) produce a large amount of random dna sequences, and random number or name, each random dna sequence numbering or virtual gene of called after, each virtual gene adopts FASTA form, all these virtual genomic constitutions " virtual genome ", generate " virtual genome database " (Virtual Genome Database, VGDB);
(2) all virtual gene Random assignment in VGDB is tieed up on array table at one 2, obtained a virtual gene distribution position table, this table is " virtual genetic chip " (Virtual DNA Microarray Chip, VDMC);
(3) information sender can " be write " any secret information that will transmit on VDMC, namely selects for forming " point " of secret information having on the VDMC of huge dot matrix;
(4) because each point on VDMC is corresponding to a virtual gene in VGDB, therefore on VDMC, select the point that forms secret information, find out exactly these virtual genes corresponding in VGDB;
(5) the virtual gene of finding out from each, select randomly a small fragment DNA sequence dna, utilize bioinformatics to commonly use sequence alignment research tool BLAST (Basic Local Alignment Search Tool) program VGDB is carried out to similarity searching, this small fragment DNA sequence dna of confirming random choose is unique in VGDB, thereby this small fragment DNA sequence dna is one to one with the virtual gene of selecting;
(6) by all these random chooses small fragment DNA sequence dna out in no particular order sequentially random combine get up just to have become cipher-text information;
(7) receiving party receives after ciphertext, and each small fragment DNA sequence dna in ciphertext is carried out to BLAST analysis to VGDB, finds out each self-corresponding virtual gene, then on VDMC these virtual genetic markers out, secret information has just displayed.
2. the building method based on virtual genomic cryptographic system according to claim 1, is characterized in that its secret key consists of jointly virtual genome database (VGDB) and virtual genetic chip (VDMC).
3. the building method based on virtual genomic cryptographic system according to claim 1, is characterized in that described VGDB can be exchanged into virtual Protein Data Bank (Virtual Protein Database, VPDB).
4. the building method based on virtual genomic cryptographic system according to claim 1, is characterized in that described from virtual gene, select randomly the optional position that a small fragment DNA sequence dna refers in virtual gene DNA sequence and selecting randomly a small fragment DNA sequence dna.
5. the building method based on virtual genomic cryptographic system according to claim 1, it is characterized in that described blast program refers to the similarity searching instrument in bioinformatics, the object of using BLAST in ciphering process is to confirm the uniqueness of the small fragment DNA sequence dna of random choose, and the object of using BLAST in decrypting process is in order to utilize the uniqueness of small fragment DNA sequence dna to find out the virtual gene of its correspondence.
6. the building method based on virtual genomic cryptographic system according to claim 1, is characterized in that described blast program also can replace with other string search algorithm.
7. the building method based on virtual genomic cryptographic system according to claim 1, is characterized in that, in order to strengthen security performance, can arbitrarily inserting the random dna sequence of non-VGDB in ciphertext.
8. the building method based on virtual genomic cryptographic system according to claim 1, it is characterized in that its secret key can self, update method is: i the position of the random small fragment DNA sequence dna of selecting in certain virtual gene, and this virtual gene is capable in m on VDMC, n row, after encrypting or deciphering, (i+j) individual position is deleted and be again inserted into the small fragment DNA sequence dna of described random selection from the i position of virtual gene, the m from VDMC is capable for virtual gene after rearrangement, on n column position, delete and be again inserted into (m+a) OK, on (n+b) column position, j, a, numerical value transmit leg and the recipient of b can freely arrange.
CN200910192218.0A 2009-09-10 2009-09-10 Construction method of virtual genome-based cryptosystem (VGC) Expired - Fee Related CN102025482B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200910192218.0A CN102025482B (en) 2009-09-10 2009-09-10 Construction method of virtual genome-based cryptosystem (VGC)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200910192218.0A CN102025482B (en) 2009-09-10 2009-09-10 Construction method of virtual genome-based cryptosystem (VGC)

Publications (2)

Publication Number Publication Date
CN102025482A CN102025482A (en) 2011-04-20
CN102025482B true CN102025482B (en) 2014-02-12

Family

ID=43866386

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200910192218.0A Expired - Fee Related CN102025482B (en) 2009-09-10 2009-09-10 Construction method of virtual genome-based cryptosystem (VGC)

Country Status (1)

Country Link
CN (1) CN102025482B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9950261B2 (en) 2016-04-29 2018-04-24 International Business Machines Corporation Secure data encoding for low-resource remote systems

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103117850B (en) * 2011-11-16 2016-01-20 中国科学院华南植物园 A kind of method for building up of the cryptographic system based on random sequence database
CN103114127B (en) * 2011-11-16 2014-07-16 中国科学院华南植物园 DNA chip based cipher system
US9425825B2 (en) 2012-05-22 2016-08-23 International Business Machines Corporation Path encoding and decoding
CN103200011B (en) * 2013-04-26 2016-04-13 福建伊时代信息科技股份有限公司 A kind of cipher authentication method and device
CN104243148A (en) * 2013-06-24 2014-12-24 中科英华湖州工程技术研究中心有限公司 Encryption method and encryption device
GB2533391A (en) 2014-12-19 2016-06-22 Ibm Wall encoding and decoding
GB2533392A (en) 2014-12-19 2016-06-22 Ibm Path encoding and decoding
GB2533393A (en) 2014-12-19 2016-06-22 Ibm Pad encoding and decoding
CN104734848A (en) * 2015-03-02 2015-06-24 郑州轻工业学院 Recombinant DNA technology based information encrypting and hiding method and application
IL295370A (en) * 2020-02-05 2022-10-01 Quantum Digital Solutions Corp Information theory genomics-enabled hyper-scalability

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004088585A2 (en) * 2003-03-31 2004-10-14 Council Of Scientific And Industrial Research A method for storing information in dna
CN1790983A (en) * 2004-12-15 2006-06-21 卢明欣 Method for realizing asymmetric cipher system using DNA chip

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004088585A2 (en) * 2003-03-31 2004-10-14 Council Of Scientific And Industrial Research A method for storing information in dna
CN1790983A (en) * 2004-12-15 2006-06-21 卢明欣 Method for realizing asymmetric cipher system using DNA chip

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9950261B2 (en) 2016-04-29 2018-04-24 International Business Machines Corporation Secure data encoding for low-resource remote systems
US9968852B2 (en) 2016-04-29 2018-05-15 International Business Machines Corporation Secure data encoding for low-resource remote systems
US9968851B2 (en) 2016-04-29 2018-05-15 International Business Machines Corporation Secure data encoding for low-resource remote systems
US10080965B2 (en) 2016-04-29 2018-09-25 International Business Machines Corporation Secure data encoding for low-resource remote systems
US10166475B2 (en) 2016-04-29 2019-01-01 International Business Machines Corporation Secure data encoding for low-resource remote systems
US10166476B2 (en) 2016-04-29 2019-01-01 International Business Machines Corporation Secure data encoding for low-resource remote systems

Also Published As

Publication number Publication date
CN102025482A (en) 2011-04-20

Similar Documents

Publication Publication Date Title
CN102025482B (en) Construction method of virtual genome-based cryptosystem (VGC)
Khalifa et al. High-capacity DNA-based steganography
Marwan et al. DNA-based cryptographic methods for data hiding in DNA media
US10225074B2 (en) Encryption system and method based on biometric technology
Namasudra et al. Introduction of DNA computing in cryptography
Hamed et al. DNA based steganography: survey and analysis for parameters optimization
Mondal et al. Review on DNA cryptography
Sreeja et al. DNA for information security: A Survey on DNA computing and a pseudo DNA method based on central dogma of molecular biology
Gupta et al. DNA based cryptographic techniques: A review
Khalifa et al. Secure blind data hiding into pseudo DNA sequences using playfair ciphering and generic complementary substitution
Popovici Aspects of DNA cryptography
Vinodhini et al. A survey on DNA and image steganography
Agrawal et al. Data hiding using dictionary based substitution method in DNA sequences
Marwan et al. An Enhanced DNA-based Steganography Technique with a Higher Hiding Capacity.
JP6175453B2 (en) Encryption and decryption method using nucleic acid
Hirabayashi et al. Analysis on secure and effective applications of a DNA-based cryptosystem
Singh et al. A Review on DNA based Cryptography for Data hiding
Ibrahim et al. Enhancing the security of data hiding using double DNA sequences
Singh et al. Information hiding techniques based on DNA inconsistency: An overview
Bazli et al. Data encryption using bio-molecular information
CN114221783A (en) Data selective encryption and decryption system
Zhang et al. A new DNA cryptography algorithm based on the biological puzzle and DNA chip techniques
Gao et al. DNA computing in cryptography
Thangavel et al. A comparative study on DNA-based cryptosystem
Chaudhary et al. Hybrid approach for secure communication of data using chemical DNA

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20140212

Termination date: 20160910