CN109411022A - A kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy - Google Patents
A kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy Download PDFInfo
- Publication number
- CN109411022A CN109411022A CN201811317917.9A CN201811317917A CN109411022A CN 109411022 A CN109411022 A CN 109411022A CN 201811317917 A CN201811317917 A CN 201811317917A CN 109411022 A CN109411022 A CN 109411022A
- Authority
- CN
- China
- Prior art keywords
- sequence
- dna
- pam
- module
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention belongs to genetic engineering field more particularly to a kind of Motif of Adjacent containing PAM(Protospacer based on character microtomy) structure gRNA targeting sequence screening method and application system.The following steps are included: (1) reads in deoxynucleotide (DNA) sequential file data of target gene;(2) interactive interface input needs the PAM sequence of Analysis and Screening;(3) it interprets PAM module and specified PAM motif is converted into character lists;(4) compare substring module PAM motif is converted into character lists and be compared one by one with moving window sequence given position, and decision logic relationship.(5) search in given DNA sequence dna and its reverse complementary sequence of global search module meets the sequence of condition and stores into empty list.(6) file output module is presented result with text or spreadsheet.The present invention solves the problems, such as that the prior art cannot achieve the screening of the gRNA target sequence to screening in given DNA sequence dna containing any PAM identification motif, establishes technical foundation for the assessment and selection of next step gRNA target sequence.
Description
Technical field
The invention belongs to genetic engineering field more particularly to it is a kind of based on character microtomy contain PAM
(Protospacer Adjacent Motif) structure gRNA targets sequence screening method and application system.
Background technique
CRISPR(Clustered regularly interspaced short palindromicrepeats) rule
The short palindrome in cluster interval repeats;Cas9(CRISPR associated nuclease) it is CRISPR associated nucleic acid enzyme,
CCRISPR/Cas9 is a kind of most emerging skill for being instructed by RNA, being edited using Cas9 nuclease to target gene
Art.
CRISPR/Cas9 system is widely present in prokaryotic gene, is bacterium and archeobacteria for reply virus and matter
Grain is constantly attacked and the acquired immunity defense mechanism of coming that develops.In these organisms, the exogenic heredity object from bacteriophage
Matter obtains and is integrated into the site CRISPR.The segment of these sequence specifics is transcribed into the RNA(CRISPR- of short CRISPR
DerivedRNA), crRNA is combined with tracrRNA(trans-activating RNA) by base pairing and is formed double-stranded RNA,
Then tracrRNA/crRNA complex instructs Cas9 albumen to cut off double-stranded DNA.
Once crRNA is integrated to Cas9, the conformation of Cas9 nuclease changes, and generates an energy channel, DNA is allowed more to hold
Easily combine.Cas9/crRNA complex can identify that the site PAM (5'-NGG) causes DNA to untwist, and crRNA is made to find the site PAM
Adjacent DNA complementary strand.When Cas9 is integrated to, the site PAM is adjacent, on the DNA sequence dna complementary with crRNA, REC interlobar part
Bridge spiral and target DNA form RNA-DNA heteroduplex structure.The identification in the site PAM includes that can make target DNA double-strand break (DSB)
HNH and RuvC nuclear fragmentation activation, lead to DNA degradation.If crRNA is not complementary with target DNA, Cas9 will be released,
Find the new site PAM.It can be engaged by nonhomologous end after linear target gene group fracture in DNA (NHEJ) or homologous
The reparation (HDR) of mediation is repaired, and nonhomologous end engagement (NHEJ) can cause insertion or deletion error, to reach
The purpose of certain gene is knocked out to fixed point.
During carrying out genome editor using CRISPR/Cas9 system, tracrRNA and crRNA can be fused into
Targeting shearing can equally be played the role of for 1 RNA(sgRNA) expression.Therefore our CRISPR/Cas9 tool now
Only Cas9 nuclease and gRNA two parts.Using this tool, can the very convenient editor for efficiently carrying out any gene change
It makes, for example gene knockout, knocks in, rite-directed mutagenesis etc..Compared with ZFN and TALEN technology, CRISPR/Cas9 technology has
The features such as vector construction is simple, gene editing is high-efficient, at low cost is now widely used for gene functional research and animals and plants essence
The fields such as quasi-molecule breeding [1-2].
PAM sequence is to combine target essential, and particular sequence depends on the type of Cas9.Currently, widely used
The PAM that streptococcus pyogenes (Streptococcus pyogenes) Sp Cas9 is identified in plant is mainly [3] NGG.
In order to extend editor range of the CRISPR/Cas9 in genome, it is different that people identify identification from different microorganisms
The homologous protein of PAM, as Streptococcus thermophilus (Streptococcus thermophiles) CRISPR3 Cas9 is identified
NGGNG PAM [4], streptococcus thermophilus (Streptococcus thermophiles) CRISPR1 Cas9 identify NNAGAAW
PAM [5], Neisseria meningitidis (Neisseria meningitides) Nme Cas9 identify NNNNGATT PAM [6], gold
Yellow staphylococcus pyogenes (Staphylococcus aureus) Sa Cas9 identifies GGAGT PAM [7], campylobacter jejuni
(Campylobacter jejuni) CjCas9 identifies NNNNACAC or NNNNRYAC PAM [8].Sp Cas9 variant VQR
(D1135V/R1335Q/T1337R) and VRER (D1135V/G1218R/R1335E/T1337R) can identify NGA respectively
PAM and NGCG PAM [9];And variant x Cas9 can identify 3 kinds of PAM [10] of NG, GAA and GAT.
With going deep into for research, scientists have found a variety of CRISPR/Cas systems, can according to the quantity of Cas albumen
To be divided into two classes (Class I and Class II), can be divided into 6 kinds (Type I ~ VI) according to the structure and function of Cas, and
Multiple hypotypes (Subtype) can be further divided into.Compared to Class I, Class II only needs a Cas albumen, therefore mesh
Common system is Class II, such as Cas9 in preceding gene editing, and does not need the Cpf1 of tracr RNA
(Cas12a) and the Cas13 with RNA cleavage activity [11].Either Class I and Class II Cas9, is cutting
In DNA or RNA, require to identify the target sequence containing PAM structure.
In biomedicine field, it is accurately fixed that scientists can carry out target sequence using CRISPR/Cas9 technology
Bit manipulation provides new treatment means for genetic disease, cancer and disease of viral infection etc..Genetic disease is always
The refractory disease of human health, the discovery of genetic disease related mutation gene are threatened, CRISPR- is arrived in the building of mouse model
Cas9 treats genetic disease, before which has light in fields such as human genetic diseases' fundamental research, clinical treatments
Scape [12-13].
In crop breeding, the dependency basis such as CRISPR/Cas9 gene editing technical antagonism, yield, quality and fertility
The fixed point editor of cause, make its in crop orientation genetic improvement using more and more extensive, struck including to unfavorable gene
It removes, the editor to beneficial gene control region, the regulation to apparent gene etc. [11,14-18].
However the gene editing of animal, plant and microorganism is operated, require the screening and assessment to target sequence.But
It is that PAM identification motif complicated and changeable brings certain difficulty to the screening of gRNA target sequence, and especially PAM motif has letter
And property when.Currently, there is no specifically for gRNA target sequence algorithm and application system containing any PAM identification motif.
The present invention is based on character microtomies, using simple scripting language (python), quickly and accurately from DNA sequence
It is filtered out in column containing being arbitrarily designated the gRNA target sequence of PAM motif, while calculating the G/C content of the sequence, in DNA sequence dna
Position and the position of chain, and presented with data form or text formatting as a result, being the assessment and selection of next step gRNA target sequence
Establish technical foundation.
Summary of the invention
Technical problem to be solved by the present invention lies in provide one kind based on character microtomy quickly and accurately from DNA
It is filtered out in sequence containing the gRNA target sequence method and system for being arbitrarily designated PAM motif, it is intended to which solving the prior art can not be real
Now to the problem of screening identifies the gRNA target sequence of motif containing any PAM in given DNA sequence dna.
The invention is realized in this way a kind of quickly and accurately filtered out from DNA sequence dna based on character microtomy is contained
There is the gRNA target sequence method for being arbitrarily designated PAM motif, comprising:
File output module: specified path inputs DNA or cDNA in the form of text.
It calculates GC module: calculating gRNA target sequence G/C content.
DNA complementation module: given DNA sequence dna is converted into reverse complementary sequence.
The reversed reordering module of DNA: given DNA sequence dna is converted into reverse sequence.
It interprets PAM module: being inputted by user and specified PAM motif is converted into character lists.
Substring position module: initial position of the substring in total character string is returned to character style.
Compare substring module: by PAM motif be converted into character lists one by one with moving window sequence given position into
Row compares, and decision logic relationship.
Global search module: search meets the sequence of condition and deposits in given DNA sequence dna and its reverse complementary sequence
It stores up in empty list.
File output module: result is presented with text or spreadsheet.
Further, the calculating GC module is specifically used for:
G/C content is calculated according to list entries.
def gc(s):
gcc = 100 * (s.count('G') + s.count('C')) / len(s)
return str(float('%.3f' % gcc))+'%'。
Further, the DNA complementation module is specifically used for:
Former given sequence is converted into reverse complementary sequence using character string Hash substitution method, specifically replaces original with " T "
" A " in sequence;" A " replaces " T " in former sequence;" C " in former sequence is replaced with " G ";It is replaced in former sequence with " C "
" G ", finally gets up string-concatenation.
def DNA_complement(sequence):
sequence = sequence.upper()
basecomplement = {"A": "T",
"G": "C",
"T": "A",
"C": "G",
}
letters = list(sequence)
letters = [basecomplement[base] for base in letters]
return ''.join(letters)。
Further, the reversed reordering module of the DNA is specifically used for:
It is obtained in the way of the output of character string inverted order.
def DNA_reverse(sequence):
sequence = sequence.upper()
return sequence[::-1]。
Further, the interpretation PAM module is specifically used for:
Forbidden character is excluded first, and single or degeneracy base is then reduced into single or multiple alkali using Hash dictionary format
The form of base, and put into PAM list with character style.
def pamstring(string):
string = string.upper()
Print " the PAM structure that you input is: ", string
convert = []
dict = {'N': 'AGTC',
'M': 'AC',
'K': 'GT',
'S': 'CG',
'Y': 'TC',
'W': 'AT',
'R': 'AG',
'A': 'A',
'T': 'T',
'C': 'C',
'G': 'G',
'V': 'ACG',
'H': 'ACT',
'D': 'AGT',
'B': 'CGT'}
for i in string:
if i not in "MRWSYKVHDBNATGC":
print "PAM including illegal string"
convert = []
return convert
else:
convert.append(dict[i])
return convert。
Further, the substring position module is specifically used for:
Corresponding location information is obtained using the method for character index.
def substr(substr, strtotal):
start_position = strtotal.index(substr) + 1
end_position = start_position + len(substr) - 1
return str(start_position), str(end_position)。
Further, the relatively substring module is specifically used for:
Each character in PAM list is taken out using loop control, then carries out judgement ratio with moving window designated position sequence
Compared with, and comparison result is returned, as shown in Fig. 1.
def compare(l1,str2):
j = 0
flag = []
last = False
for i in l1:
if str2[-len(l1)+j] in i:
flag.append(True)
else:
flag.append(False)
j +=1
for i in flag:
if i == False:
last = False
break
else:last = True
return last。
Further, the global search module is specifically used for:
In given DNA sequence dna and using DNA, reversed, complementary module is obtained in reverse complementary sequence, and continuous moving window taking-up refers to
The sequence of measured length is compared using substring module is compared, while carrying out sequence information, GC to the sequence for the condition that meets
Content, chain location information store into a list.
def findpam(s,pamlist):
s = s.strip()
s = s.upper()
if len(s) < 30: return 'sequence must be >30 base'
i = 0
j = 30
l = []
while len(s) - j >= 0:
fragment = s[i:j]
p1 = substr(fragment, s)[0]
p2 = substr(fragment, s)[1]
if compare(pamlist, fragment):
l.append(fragment)
l.append(p1)
l.append(p2)
l.append(gc(fragment))
l.append('+')
if compare(pamlist, DNA_reverse(DNA_complement(fragment))) :
l.append(fragment)
l.append(p1)
l.append(p2)
l.append(gc(fragment))
l.append('-')
i += 1
j += 1
return l。
Further, the file output module is specifically used for:
It is exported using spcial character control result, such as "+", the characters such as "-" are presented the sequence for the condition that meets one by one.
Compared with prior art, the present invention quickly exhaustive satisfaction arbitrary PAM motif can be specified in given DNA sequence dna
All gRNA target sequences.All screening operations can be completed in single machine (desktop computer), do not need Internet remote computation or big
The networking of type integrated computer calculates.The present invention can provide the accurate information of target site to gene editing designer, into one
Step all plays an important role to the efficiency evaluation and function prediction of target site.
Detailed description of the invention
1. moving window schematic diagram of attached drawing.
Specific embodiment 1
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, to this hair
Bright further description.It should be appreciated that this place allows described specific embodiment to be only used to explain the present invention, it is not used to
Limit the present invention.
Widely used streptococcus pyogenes (Streptococcus pyogenes) Sp Cas9 identified in plant
PAM motif is " NGG ", it is assumed that a gene order is X_seq: " CCCTCACGTCGAGAGAGCTCGAGTGCACAGG ".
Firstly, X_seq sequence is stored in the form of text file, while the path of this document is specified in systems.
Secondly, any character can theoretically be inputted by inputting NGG(on interactive interface, but base letter can be limited
And the character in table).
In interpreting PAM module, PAM motif list is converted into [' ATGC ', ' G ', ' G '].
In global search module, in X_seq " CCCTCACGTCGAGAGAGCTCGAGTGCACAGG ", 30 bases
(the base number before PAM motif can specify, and usual situation is 30 nt) series of windows provides P1, P2.
X_seq:CCCTCACGTCGAGAGAGCTCGAGTGCACAGG
P1:CCCTCACGTCGAGAGAGCTCGAGTGCACAG-
P1RC:CTGTGCACTCGAGCTCTCTCGACGTGAGGG
P2:-CCTCACGTCGAGAGAGCTCGAGTGCACAGG
P2RC:CCTGTGCACTCGAGCTCTCTCGACGTGAGG.
Detailed process is as follows:
Successively compare the list of PAM motif and P1 using the function of comparing in substring module, i.e., third in the list of PAM motif
It the 30th progresss logical comparison and is returned in a logic list L in element and P1, is successively the 29 of second element and P2
Position charactor comparison simultaneously returns in above-mentioned logic list L, 29 charactor comparisons of first element and P2.The number that compares with
Element number is consistent in PAM motif list, and all elements in logic list L are finally carried out logic and operation.According to finally patrolling
It collects and decides whether to be recorded in final result list T with operation, if meeting condition (value T), the sequence of record window sequence
Information, the location information for calculating G/C content and chain.Similarly, window sequence is obtained with DNA complementation module, the reversed reordering module of DNA
The reverse complementary sequence of column, is equally judged.
Therefore in above-mentioned series of windows, the condition of satisfaction is tri- sequences of P2RC and P1RC on P2 and minus strand in normal chain
Column.Using file output module by above- mentioned information text output.It is as follows:
PAM: AGTC G G
30nt sequence Start position End position GC% Strand
CCCTCACGTCGAGAGAGCTCGAGTGCACAG 1 30 63.0% -
CCTCACGTCGAGAGAGCTCGAGTGCACAGG 2 31 63.0% +
CCTCACGTCGAGAGAGCTCGAGTGCACAGG 2 31 63.0% -
Above-mentioned selected target sequence meets NGG requirement, i.e. underlined sequences, while having marked location information, G/C content and in chain
On position.
Specific embodiment 2
With rice Phytoene dehydrogenase geneOsPDS(LOC_Os03g08570) cDNA is list entries, is selected empty
Intestines campylobacter (Campylobacter jejuni) CjCas9 identification NRYAC PAM motif, it will obtain:
PAM: AGTC AG TC A C
30nt sequence Start position End position GC% Strand
GGTGCTTCGCAAGTAGCAGCATCCAAGCAC 83 112 56.0% +
GTGCTTCGCAAGTAGCAGCATCCAAGCACT 84 113 53.0% -
GTGCTCTACAGGTTGTTTGCCAGGACTTTC 194 223 50.0% -
GGACTTTCCAAGACCTCCACTAGAAAACAC 216 245 46.0% +
GTATGAAACTGGGCTTCATATCTTTTTTGG 459 488 36.0% -
ATATCTTTTTTGGAGCTTATCCCAACATAC 476 505 33.0% +
GTATTAATGATCGGTTGCAATGGAAGGAAC 527 556 40.0% -
ATTAATGATCGGTTGCAATGGAAGGAACAC 529 558 40.0% +
GGTTTGATTTTCCTGAAACATTGCCTGCAC 602 631 43.0% +
CTGCACCCTTAAATGGAATATGGGCCATAC 626 655 46.0% +
GTGTTCCTGATCGAGTGAACGATGAGGTTT 791 820 46.0% -
ATGAGGTTTTCATTGCAATGTCAAAGGCAC 812 841 40.0% +
GTGCATTCTGATTGCTTTAAACCGATTTCT 876 905 36.0% -
GTATTCAGAAAATAGAACTTAATCCTGATG 1022 1051 30.0% -
GAACTTAATCCTGATGGAACAGTGAAACAC 1036 1065 40.0% +
ATCCTGATGGAACAGTGAAACACTTTGCAC 1043 1072 43.0% +
TAACTGGAGATGCTTATGTTTTTGCAACAC 1091 1120 36.0% +
CACCAGTTGATATCTTGAAGCTTCTTGTAC 1118 1147 40.0% +
GTACCTCAAGAGTGGAAAGAAATATCTTAT 1144 1173 33.0% -
TATATGGTTTGATAGAAAACTGAAGAACAC 1221 1250 30.0% +
GTGTTTATGCGGACATGTCAGTAACTTGCA 1289 1318 43.0% -
GCGGACATGTCAGTAACTTGCAAGGAATAC 1297 1326 46.0% +
TGCAGAGGAATGGGTTGGACGGAGTGACAC 1368 1397 56.0% +
AGATTCTGAAGTATCATGTTGTGAAGACAC 1475 1504 36.0% +
GTATCATGTTGTGAAGACACCAAGATCTGT 1485 1514 40.0% -
TGAAGGGTTCTATCTAGCTGGTGACTACAC 1569 1598 46.0% +
GTGCAGTTCTATCTGGGAAGCTTTGTGCTC 1628 1657 50.0% -
GTGCTCAGTCTGTAGTGGAGGATTATAAAA 1652 1681 40.0% -。
Specific embodiment 3
With rice betaine-aldehyde dehydrogenaseOsBADH2 (LOC_Os08g32870) DNA is list entries, selects thermophilus
Bacterium (Streptococcus thermophiles) CRISPR1 Cas9 identification NAGAAW PAM motif, it will obtain:
PAM: AGTC A G A A AT
30nt sequence start position end position GC% strand
ATTCTGCTTCTGTTTGGAATAAGTTGGAAG 823 852 36.0% -
TTTCTAGTGCCAAATGCATGCTAGATTTCT 1172 1201 36.0% -
TTTCTCACAGTTTTTCTCTTCAGGTTATAT 1197 1226 30.0% -
TTTCTCTTCAGGTTATATTTCTCGTATTTC 1209 1238 30.0% -
TTTCTCGTATTTCCTTTTCCTAAAGGATTG 1226 1255 33.0% -
TTTCTGGCATATATAGGTTATTATTATTAT 1269 1298 20.0% -
ATTCTCCAGAACAAGATTACCCATATTATG 1300 1329 33.0% -
TTTCTAGCAAAGCAGGGGATGCTAGCCTTC 1425 1454 50.0% -
TTTCTATCATAAAAATTTTCATGGCATATG 1512 1541 23.0% -
TTTCTTCAGATAATCGAGAGGAAATCTGAG 1570 1599 36.0% -
GCTTTGAGTACTTTGCAGATCTTGCAGAAT 1771 1800 40.0% +
TTTCTCTCATCCTGCGCTTATATTTATTTA 1916 1945 30.0% -
ATGTGTTAAGTTTGACCAAGTTTATAGAAA 2158 2187 26.0% +
TTTCTTCTTAATATAATGATACACAGCTCT 2271 2300 26.0% -
TTTCTCATATGTTGTCAGCATGATTCACTT 2398 2427 33.0% -
ATTCTAATTTGTTGTTTCTTTGTTATGTTC 2510 2539 23.0% -
TTTCTTTGTTATGTTCTTATCGACAATTAC 2524 2553 26.0% -
ATCGACAATTACAAATTTGATTCTGAGAAT 2542 2571 26.0% +
ATTCTGAGAATCATGTTCGGGATGTGTATT 2561 2590 36.0% -
TTTCTACTGCAGGAACTATCCTCTCCTGAT 2589 2618 43.0% -
TTTCTGTTAGGTTGCATTTACTGGGAGTTA 3023 3052 36.0% -
TTTCTGTGGATATTTTTTGTTCTCTTTCTA 3112 3141 26.0% -
TTTCTACTAACTCTCTATTATCAATTCTCA 3136 3165 26.0% -
ATTCTCAATGTTGTCCTTTTCTTTTAACTC 3159 3188 30.0% -
TTTTCTTTTAACTCCTTTACTTTTTAGAAT 3175 3204 20.0% +
TTTCTTTTAACTCCTTTACTTTTTAGAATT 3176 3205 20.0% -
ATTCTAGTAGCCAGTTCTATCCTGTTTCTT 3229 3258 36.0% -
TTTCTTACCTTTTTATGGTTCGTCTTTTCT 3253 3282 30.0% -
TTTCTTGACAGCCTGTTTCACTGGAACTTG 3278 3307 43.0% -
ATTCTGAAGTGCGGGACTTTGTAAAGCACT 3383 3412 43.0% -
CTTTTTGGTGTCTTGGGCTTGTTGCAGAAA 3445 3474 43.0% +
ACTGGTCCCAGACGAGCAGGATGCAAGAAA 3476 3505 53.0% +
TTTCTTAGAAGTTACACCTCAAGGATTAGC 3533 3562 36.0% -
TTTCTTAAAATGTGCTATTGATTAAAAAGA 3568 3597 20.0% -
TTATAATGCCATGCCAACTGAGTAAAGAAA 3664 3693 33.0% +
TTTCTTTTCGTGGCAAGGAAGGCAGTTAGG 3749 3778 46.0% -
ATTCTTAGTTCTGGAAAACTGTGTTCTTTA 3807 3836 30.0% -
ATTCTAGCTGATTATGAATTCTGTTTATAT 3886 3915 23.0% -
ATTCTGTTTATATTTCACTAATTTTGAATC 3903 3932 20.0% -
ATTCTTCATGTAAGCATTGAATATATCCGT 4116 4145 30.0% -
TTTCTGATCAACTCCTGAGTTCAGATTATT 4177 4206 33.0% -
TTGCTCCTGACCATGAAAGTTTTGCAGAAA 4453 4482 40.0% +
AAGTTTTGCAGAAAAAAATCGCTAAAGAAT 4469 4498 26.0% +
AGAAAAAAATCGCTAAAGAATTTCAAGAAA 4478 4507 23.0% +
TGTAAACTTTTTCTAAATTCAAAAAAGAAA 4602 4631 16.0% +
TTTCTAAATTCAAAAAAGAAATGCCACTGA 4611 4640 26.0% -
CTTTTGTATATATTTTCAAAGCACCAGAAT 4915 4944 26.0% +
ATTCTGACTGGTGGGGTTAGACCCAAGGTA 5045 5074 50.0% -
AGGTACCCACATATCATTATGAAGTAGAAA 5101 5130 33.0% +
CTTGTATGTTTTTGTCAGCATCTGGAGAAA 5135 5164 36.0% +
TTTCTATATTGAACCCACAATCATTACTGA 5167 5196 30.0% -
TTTTTGGTCCAGTGCTCTGTGTGAAAGAAT 5232 5261 40.0% +
ATTCTGCTACTACTACTTTTGATAGTTATG 5383 5412 30.0% -
CATGGTTGCATCAAGCTGATATTCAAGAAT 5512 5541 36.0% +
ATTCTATGCATCTCCAGTTCTTCCCTGGAC 5558 5587 46.0% -
ATATTTGACCCCTTTTTTTTGCAAAAGAAA 5678 5707 26.0% +。
Specific embodiment 4
In gene editing design, the site of editor is often selected on the exon of gene, but only does screening with cDNA sequence,
Usually the sequence errors at splicing site are screened into.Editing sites are selected on exon, but be located at exon with
The sequence of introne stitching portion.It at this moment, can be respectively using the DNA of target gene and cDNA as input file, then in the result
Take shared part.
The foregoing is merely preferred embodiments of the invention, are not intended to limit the invention, all in spirit of the invention
With any modification made within principle, equivalent replacement and improvement etc. be should all be included in the protection scope of the present invention.
Bibliography
[1] Voytas DF. Plant Genome Engineering with Sequence-specific Nucleases
[J]. Annual Review of Plant Biology, 2013, 64(64): 327-350.
[2] Wang Y,Cheng X,Shan Q, et al. Simultaneous Editing of Three
Homoeoalleles in Hexaploid Bread Wheat Confers Heritable Resistance to
Powdery Mildew[J]. Nature Biotechnology, 2014, 32(9): 947-951.
[3] Hsu PD,Lander ES,Zhang F. Development and Applications of Crispr-cas9
for Genome Engineering[J]. Cell, 2014, 157(6): 1262-1278.
[4] Horvath P., Romero D.-A., Coute-monvoisin A.-C., wait Diversity,
activity, and evolution of CRISPR loci in Streptococcus thermophilus[J].
Journal of bacteriology, 2008, 190(4): 12-1401.
[5] Deveau H., Barrangou R., Garneau J.-E. wait Phage response to CRISPR-
encoded resistance in Streptococcus thermophilus[J]. Journal of bacteriology,
2008, 190(4): 400-1390.
[6] Zhang Y., Heidrich N., Ampattu B.-J., wait Processing-independent CRISPR
RNAs limit natural transformation in Neisseria meningitidis[J]. Molecular
cell, 2013, 50(4): 488-503.
[7] Ran FA,Cong L,Yan WX, et al. In Vivo Genome Editing Using
Staphylococcus Aureus Cas9.[J]. Nature, 2015, 520(7546): 186.
[8] Kim E,Koo T,Park SW, et al. In Vivo Genome Editing with a Small Cas9
Orthologue Derived From Campylobacter Jejuni[J]. Nature Communications, 2017,
8 (phase missings): 14500.
[9] Kleinstiver BP,Prew MS,Tsai SQ, et al. Engineered Crispr-cas9
Nucleases with Altered Pam Specificities[J]. Nature, 2015, 523(7561): 481.
[10] Hu JH,Miller SM,Geurts MH, et al. Evolved Cas9 Variants with Broad
Pam Compatibility and High Dna Specificity [J] Nature, 2018,556 (7699): the page number
Range lacks
[11] Yang Yizhou, Li Wei, Yi Tuyong wait plant virus interaction research and gene editing technology in breeding for disease resistance
It is notified to using progress [J] biotechnology, 2018,34 (8): 8-16.
[12] Sun Wei, Liu Shanshan CRISPR_Cas9 gene editing system research progress [J] animal medicine progress, 2018,
v.39;No.302(8): 97-100.
[13] Mei Wen, Sun Meitao, Wang Weisi, wait research of the CRISPR_Cas9 technology in genetic disease gene therapy into
Open up [J] biotechnology communications, 2018,29 (4): 551-557.
[14] midsummer ice, Tan Yanning, Sun Zhizhong wait to orient reduction rice using CRISPR/Cas9 genome editing technique and fall
Graininess [J] Scientia Agricultura Sinica, 2018,51 (14): 2631-2641.
[15] Xin Gaowei, recklessly prosperous Xun, Wang Kejian, region sequence is adjacent between waiting before Cas9 protein variant VQR efficient identification rice NGAC
Nearly motif [J] heredity, publishes year missing, volume missing (phase missing): 1-10.
[16] Wang Huiyuan, Fan Yuelei, Chu Xin wait CRISPR gene editing Development situation to analyze [J] life science,
Publish year missing, volume missing (phase missing): 1-11.
[17] Zhou Xin, Deng Li, Wang Qing wait to cultivate glutinous rice [J] Molecular Plant Breeding using gene editing technology,
2018, 16(17): 5608-5615.
[18] Xue Mande, Long Yan, Pei Xinwu gene editing technology and its in crop breeding application and safety management [J]
Chinese agriculture science and technology Leader, 2018,20 (9): 12-22.
Sequence table
<110>Ningxia Academy of Agri-Forestry Sciences's agricultural biotechnologies research center (Ningxia agricultural biotechnologies emphasis room)
<120>a kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy
<141> 2018-11-07
<160> 89
<170> SIPOSequenceListing 1.0
<210> 1
<211> 31
<212> DNA
<213>artificial sequence (Oryza sativa)
<400> 1
ccctcacgtc gagagagctc gagtgcacag g 31
<210> 2
<211> 30
<212> DNA
<213>artificial sequence (Oryza sativa)
<400> 2
ccctcacgtc gagagagctc gagtgcacag 30
<210> 3
<211> 30
<212> DNA
<213>artificial sequence (Oryza sativa)
<400> 3
ctgtgcactc gagctctctc gacgtgaggg 30
<210> 4
<211> 30
<212> DNA
<213>artificial sequence (Oryza sativa)
<400> 4
cctcacgtcg agagagctcg agtgcacagg 30
<210> 5
<211> 30
<212> DNA
<213>artificial sequence (Oryza sativa)
<400> 5
cctgtgcact cgagctctct cgacgtgagg 30
<210> 6
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 6
ggtgcttcgc aagtagcagc atccaagcac 30
<210> 7
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 7
gtgcttcgca agtagcagca tccaagcact 30
<210> 8
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 8
gtgctctaca ggttgtttgc caggactttc 30
<210> 9
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 9
ggactttcca agacctccac tagaaaacac 30
<210> 10
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 10
gtatgaaact gggcttcata tcttttttgg 30
<210> 11
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 11
atatcttttt tggagcttat cccaacatac 30
<210> 12
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 12
gtattaatga tcggttgcaa tggaaggaac 30
<210> 13
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 13
attaatgatc ggttgcaatg gaaggaacac 30
<210> 14
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 14
ggtttgattt tcctgaaaca ttgcctgcac 30
<210> 15
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 15
ctgcaccctt aaatggaata tgggccatac 30
<210> 17
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 17
gtgttcctga tcgagtgaac gatgaggttt 30
<210> 17
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 17
atgaggtttt cattgcaatg tcaaaggcac 30
<210> 18
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 18
gtgcattctg attgctttaa accgatttct 30
<210> 19
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 19
gtattcagaa aatagaactt aatcctgatg 30
<210> 20
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 20
gaacttaatc ctgatggaac agtgaaacac 30
<210> 21
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 21
atcctgatgg aacagtgaaa cactttgcac 30
<210> 22
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 22
taactggaga tgcttatgtt tttgcaacac 30
<210> 23
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 23
caccagttga tatcttgaag cttcttgtac 30
<210> 24
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 24
gtacctcaag agtggaaaga aatatcttat 30
<210> 25
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 25
tatatggttt gatagaaaac tgaagaacac 30
<210> 26
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 26
gtgtttatgc ggacatgtca gtaacttgca 30
<210> 27
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 27
gcggacatgt cagtaacttg caaggaatac 30
<210> 28
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 28
tgcagaggaa tgggttggac ggagtgacac 30
<210> 29
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 29
agattctgaa gtatcatgtt gtgaagacac 30
<210> 30
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 30
gtatcatgtt gtgaagacac caagatctgt 30
<210> 31
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 31
tgaagggttc tatctagctg gtgactacac 30
<210> 32
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 32
gtgcagttct atctgggaag ctttgtgctc 30
<210> 33
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 33
gtgctcagtc tgtagtggag gattataaaa 30
<210> 34
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 34
attctgcttc tgtttggaat aagttggaag 30
<210> 35
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 35
tttctagtgc caaatgcatg ctagatttct 30
<210> 36
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 36
tttctcacag tttttctctt caggttatat 30
<210> 37
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 37
tttctcttca ggttatattt ctcgtatttc 30
<210> 38
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 38
tttctcgtat ttccttttcc taaaggattg 30
<210> 39
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 39
tttctggcat atataggtta ttattattat 30
<210> 40
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 40
attctccaga acaagattac ccatattatg 30
<210> 41
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 41
tttctagcaa agcaggggat gctagccttc 30
<210> 42
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 42
tttctatcat aaaaattttc atggcatatg 30
<210> 43
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 43
tttcttcaga taatcgagag gaaatctgag 30
<210> 44
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 44
gctttgagta ctttgcagat cttgcagaat 30
<210> 45
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 45
tttctctcat cctgcgctta tatttattta 30
<210> 46
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 46
atgtgttaag tttgaccaag tttatagaaa 30
<210> 47
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 47
tttcttctta atataatgat acacagctct 30
<210> 48
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 48
tttctcatat gttgtcagca tgattcactt 30
<210> 49
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 49
attctaattt gttgtttctt tgttatgttc 30
<210> 50
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 50
tttctttgtt atgttcttat cgacaattac 30
<210> 51
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 51
atcgacaatt acaaatttga ttctgagaat 30
<210> 52
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 52
attctgagaa tcatgttcgg gatgtgtatt 30
<210> 53
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 53
tttctactgc aggaactatc ctctcctgat 30
<210> 54
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 54
tttctgttag gttgcattta ctgggagtta 30
<210> 55
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 55
tttctgtgga tattttttgt tctctttcta 30
<210> 56
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 56
tttctactaa ctctctatta tcaattctca 30
<210> 57
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 57
attctcaatg ttgtcctttt cttttaactc 30
<210> 58
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 58
ttttctttta actcctttac tttttagaat 30
<210> 59
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 59
tttcttttaa ctcctttact ttttagaatt 30
<210> 60
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 60
attctagtag ccagttctat cctgtttctt 30
<210> 61
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 61
tttcttacct ttttatggtt cgtcttttct 30
<210> 62
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 62
tttcttgaca gcctgtttca ctggaacttg 30
<210> 63
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 63
attctgaagt gcgggacttt gtaaagcact 30
<210> 64
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 64
ctttttggtg tcttgggctt gttgcagaaa 30
<210> 65
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 65
actggtccca gacgagcagg atgcaagaaa 30
<210> 66
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 66
tttcttagaa gttacacctc aaggattagc 30
<210> 67
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 67
tttcttaaaa tgtgctattg attaaaaaga 30
<210> 68
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 68
ttataatgcc atgccaactg agtaaagaaa 30
<210> 69
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 69
tttcttttcg tggcaaggaa ggcagttagg 30
<210> 70
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 70
attcttagtt ctggaaaact gtgttcttta 30
<210> 71
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 71
attctagctg attatgaatt ctgtttatat 30
<210> 72
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 72
attctgttta tatttcacta attttgaatc 30
<210> 73
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 73
attcttcatg taagcattga atatatccgt 30
<210> 74
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 74
tttctgatca actcctgagt tcagattatt 30
<210> 75
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 75
ttgctcctga ccatgaaagt tttgcagaaa 30
<210> 76
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 76
aagttttgca gaaaaaaatc gctaaagaat 30
<210> 77
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 77
agaaaaaaat cgctaaagaa tttcaagaaa 30
<210> 78
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 78
tgtaaacttt ttctaaattc aaaaaagaaa 30
<210> 79
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 79
tttctaaatt caaaaaagaa atgccactga 30
<210> 80
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 80
cttttgtata tattttcaaa gcaccagaat 30
<210> 81
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 81
attctgactg gtggggttag acccaaggta 30
<210> 82
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 82
aggtacccac atatcattat gaagtagaaa 30
<210> 83
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 83
cttgtatgtt tttgtcagca tctggagaaa 30
<210> 84
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 84
tttctatatt gaacccacaa tcattactga 30
<210> 85
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 85
tttttggtcc agtgctctgt gtgaaagaat 30
<210> 86
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 86
attctgctac tactactttt gatagttatg 30
<210> 87
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 87
catggttgca tcaagctgat attcaagaat 30
<210> 88
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 88
attctatgca tctccagttc ttccctggac 30
<210> 89
<211> 30
<212> DNA
<213>rice (Oryza sativa)
<400> 89
atatttgacc cctttttttt gcaaaagaaa 30
Claims (6)
1. a kind of Motif of Adjacent containing Protospacer (PAM) structure gRNA based on character microtomy targets sequence
Column filter method and application, which comprises the following steps: (1) read in deoxynucleotide (DNA) sequence of target gene
Column file data;(2) interactive interface input needs the PAM sequence of Analysis and Screening;(3) PAM module is interpreted by specified PAM motif
It is converted into character lists;(4) compare substring module by PAM motif be converted into character lists one by one with moving window sequence to
Positioning, which is set, to be compared, and decision logic relationship;(5) global search module is in given DNA sequence dna and its reverse complementary sequence
The middle sequence for searching for the condition that meets simultaneously is stored into empty list;(6) file output module is by result with text or electronic watch trellis
Formula is presented.
2. a kind of gRNA of structure containing PAM based on character microtomy according to claim 1 targets sequence screening method
And application, which is characterized in that entire method include interpret PAM module, compare substring module, global search module and
File output module.
3. interpretation PAM module according to claim 2, which is characterized in that secondly exclusion forbidden character first utilizes Hash
Algorithm restores degeneracy sequence, and returns in a PAM character lists.
4. according to comparison substring module described in right 2, which is characterized in that taken out using loop control every in PAM list
One character compared with then sequence carries out judgement with sliding window designated position, and returns to comparison result, wherein sliding window benefit
It is taken out with character dicing method.
5. according to global search module described in right 2, which is characterized in that given DNA sequence dna and using DNA it is reversed,
Complementary module obtains in reverse complementary sequence, and continuous sliding window takes out the sequence of designated length, using comparing substring mould
Block is compared, while storing the location information of the sequence for the condition that meets progress sequence information, G/C content, chain to a list
In.
6. according to global search module described in right 2, which is characterized in that exported, will be met using spcial character control result
The sequence of condition is presented one by one.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811317917.9A CN109411022A (en) | 2018-11-07 | 2018-11-07 | A kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811317917.9A CN109411022A (en) | 2018-11-07 | 2018-11-07 | A kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109411022A true CN109411022A (en) | 2019-03-01 |
Family
ID=65471918
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811317917.9A Pending CN109411022A (en) | 2018-11-07 | 2018-11-07 | A kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109411022A (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150356239A1 (en) * | 2012-12-12 | 2015-12-10 | The Broad Institute Inc. | Methods, models, systems, and apparatus for identifying target sequences for cas enzymes or crispr-cas systems for target sequences and conveying results thereof |
CN105492611A (en) * | 2013-06-17 | 2016-04-13 | 布罗德研究所有限公司 | Optimized CRISPR-CAS double nickase systems, methods and compositions for sequence manipulation |
CN106845151A (en) * | 2015-12-07 | 2017-06-13 | 中国农业大学 | The screening technique and device of CRISPR-Cas9 system sgRNA action target spots |
CN108026523A (en) * | 2015-07-06 | 2018-05-11 | 帝斯曼知识产权资产管理有限公司 | Guide RNA assembles carrier |
CN108034671A (en) * | 2017-12-08 | 2018-05-15 | 中国农业科学院植物保护研究所 | One plasmid vector and establish the method for plant population using it |
-
2018
- 2018-11-07 CN CN201811317917.9A patent/CN109411022A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150356239A1 (en) * | 2012-12-12 | 2015-12-10 | The Broad Institute Inc. | Methods, models, systems, and apparatus for identifying target sequences for cas enzymes or crispr-cas systems for target sequences and conveying results thereof |
CN105492611A (en) * | 2013-06-17 | 2016-04-13 | 布罗德研究所有限公司 | Optimized CRISPR-CAS double nickase systems, methods and compositions for sequence manipulation |
CN108026523A (en) * | 2015-07-06 | 2018-05-11 | 帝斯曼知识产权资产管理有限公司 | Guide RNA assembles carrier |
CN106845151A (en) * | 2015-12-07 | 2017-06-13 | 中国农业大学 | The screening technique and device of CRISPR-Cas9 system sgRNA action target spots |
CN108034671A (en) * | 2017-12-08 | 2018-05-15 | 中国农业科学院植物保护研究所 | One plasmid vector and establish the method for plant population using it |
Non-Patent Citations (3)
Title |
---|
JOSH MCMENMY: "Programming the genome with CRISPR", 《HTTPS://WWW.FREECODECAMP.ORG/NEWS/PROGRAMMING-THE-GENOME-WITH-CRISPR-BD567A214E2A/》 * |
SERGEY V,PRYKHOZHIJ: "CRISPR MultiTargeter: A Web Tool to Find Common and Unique CRISPR Single Guide RNA Targets in a Set of Similar Sequences", 《PLOS ONE》 * |
王远立: "计算机辅助CRISPR向导RNA设计", 《生物工程学报》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Dickinson et al. | CRISPR-based methods for Caenorhabditis elegans genome engineering | |
Talla et al. | Rapid increase in genome size as a consequence of transposable element hyperactivity in wood-white (Leptidea) butterflies | |
US20210317445A1 (en) | System and method for gene editing cassette design | |
Cong et al. | Tiger swallowtail genome reveals mechanisms for speciation and caterpillar chemical defense | |
Babushok et al. | Current topics in genome evolution: molecular mechanisms of new gene formation | |
Glöckner et al. | Sequence and analysis of chromosome 2 of Dictyostelium discoideum | |
Xia et al. | RNA-Seq analysis and de novo transcriptome assembly of Hevea brasiliensis | |
Góngora-Castillo et al. | Genomic approaches for interrogating the biochemistry of medicinal plant species | |
Kojima et al. | Long-term inheritance of the 28S rDNA-specific retrotransposon R2 | |
CN109997192A (en) | Method for rule-based genome design | |
Tsanova et al. | DNA-free gene editing in plants: a brief overview | |
KR102029197B1 (en) | A deep-learning based system for evaluating the RNA-guided nuclease activity using deep-learning | |
Ren et al. | A mechanism for genome size reduction following genomic rearrangements | |
Lee et al. | Comprehensive analyses of the complete mitochondrial genome of Figulus binodulus (Coleoptera: Lucanidae) | |
Yi et al. | ConTEdb: a comprehensive database of transposable elements in conifers | |
Grover et al. | The Gossypium stocksii genome as a novel resource for cotton improvement | |
Hong et al. | The PacBio full-length transcriptome of the tea aphid as a reference resource | |
Li et al. | Highly efficient genome editing using geminivirus-based CRISPR/Cas9 system in cotton plant | |
Mohamed et al. | Multi-tissue transcriptome profiling of North American derived Atlantic salmon | |
Coates et al. | Mobilizing the genome of Lepidoptera through novel sequence gains and end creation by non-autonomous Lep 1 Helitrons | |
AU2021100248A4 (en) | Method for Screening gRNA Target Sequence Containing PAM Structure Based on Character Slicing Technology and Application Thereof | |
Cui et al. | Advances in cis-element-and natural variation-mediated transcriptional regulation and applications in gene editing of major crops | |
Kamra et al. | A phylogenomic study of Iridaceae Juss. based on complete plastid genome sequences | |
CN109411022A (en) | A kind of gRNA of structure containing PAM targeting sequence screening method and application based on character microtomy | |
Liu et al. | Extensively current activity of transposable elements in natural rice accessions revealed by singleton insertions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |