Wax-apple EST-SSR molecular labeling
Technical field
The invention belongs to field of molecular marker, and in particular to wax-apple EST-SSR molecular labeling.
Background technique
Wax-apple [Syzygium samarangense(Blume) Merri.et Perry] originate in the Malay Peninsula and peace
Dammam archipelago, be Myrtaceae (Myrtaceae) Syzygium (Syzygium) tropical evergreen fruit tree, alias wax jambo, Nan Yang
Portugal, Java Portugal, sweet tea mist, mirabilitum crystallina pomegranate etc..Wax-apple is rich in a variety of medicinal ingredients, such as flavonoids, chalcone, hexane, tool
There are antidiarrheal, analgesia, antibacterial, antihyperglycemic, anticholinesterase, anti-inflammatory, raising immunity and other effects.Wax-apple fruit colour is gorgeous,
Form is unique, and fresh and sweet tasty and refreshing, mouthfeel is good, is rich in phenolic substances, is one of the fruit of great health value, by domestic and international consumer
Like.There is extensive plantation on Malaysia, Thailand, Indonesia and other places, is the important component of local fruit industry, fresh fruit
Past American-European countries in constant demand.The area of China's early start commercialization plantation wax-apple is Taiwan, in recent years, with traditional hot-zone advantage
The falling of the prices such as fruit tree such as lichee, longan, citrus, wax-apple become emerging tropical fruit tree, and cultivated area gradually expands, main product
Area is the ground such as Taiwan, Hainan, Fujian, Guangdong and Guangxi, according to incompletely statistics, China's wax-apple cultivated area 6.87 ten thousand in 2014
Mu, 13.6 ten thousand tons of yield, 30 ~ 50 yuan of per kilogram price, high financial profit has development potentiality.
Wax-apple has with genetic diversity abundant, but correlative study report is fewer.Taiwan is according to fruit color by lotus
Mist is divided into big (depth) Red Indian race, light red kind, pink kind, five classes (positive loyalty of woods etc., 2004) such as blueness kind and white race.Wang Jiabao etc.
(2004) Studies on Isozymes is carried out to 11 parts of wax-apple resources in Hainan, the results showed that 11 parts of resources are polymerized to red according to color difference
Color pericarp, white pericarp, 3 major class of green pericarp, red peel are close with the white pericarp more green pericarp of affiliation.But what bridge etc.
(2006) ISSR analysis carried out to 12 parts of wax-apple resources and 2 wax-apple sibling species, cluster analysis result show wax-apple not with
Fruit colour is identical to be gathered for one kind, and with the maturity period it is close gather for one kind, therefore he points out that the maturity period may close in wax-apple relationship
It plays an important role in system and classification position.Wax-apple is planted after the states such as Java, Malaysia introduce China in long-term adaptability
Many local varieties and resource are produced under training, these local varieties are mostly named with fruit color, such as ' bright red wax-apple ', ' pink
Wax-apple ', ' pake purpke mist ' etc., this is very likely to cause homonym and synonym, not only collects, protects to Wax Apple Germplasm
It deposits, identify and using difficulty is caused, also constrain the breeding and popularization and application of excellent variety.
DNA molecular marker is the important means for identifying Genetic Diversity of Germplasm, wherein microsatellite sequence (i.e. simple weight
Complex sequences Simple sequence repeat, SSR) molecular labeling is because of its site-specific, high polymorphism, high stability, repetition
Property it is good, be widely used in the advantages that codominance analysis of genetic diversity, cultivar identification, genetic map establish etc..
EST-SSR label derives from the transcriptional domain of DNA sequence dna, more than the SSR marker cross-species amplification developed based on genome sequence
Height, also more economical convenience are widely used in recent years in the SSR molecular marker exploitation of various plants, Rosa roxburghii Tratt, onion,
It is had been reported that on the gardening plants such as Lonicera edulis, sponge gourd.The present invention carries out SSR marker using the data that transcript profile sequencing obtains
Search analyzes its distribution and composition characteristic, carries out preliminary usability evaluation, more to cultivar identification, the germ plasm resource for wax-apple
Sample analysis establishes Core Germplasms and molecular mark lays the foundation.
Summary of the invention
The purpose of the present invention is to provide wax-apple EST-SSR molecular labelings, carry out SSR marker using wax-apple transcript profile data
Exploitation can obtain the site SSR of upper frequency, and type is abundant, provides for wax-apple analysis of genetic diversity and genetic map construction
More abundant reliable label selection.
To achieve the above object, the present invention adopts the following technical scheme:
Wax-apple EST-SSR molecular labeling number is WASSR3, WASSR12-13, WASSR37-39, WASSR41-42,
WASSR44、WASSR50、WASSR55、WASSR60、WASSR63、WASSR70、WASSR72、WASSR74、WASSR80、
WASSR85, WASSR87-88, WASSR91-92, the primer sequence of the corresponding wax-apple EST-SSR molecular labeling of each number is such as
Shown in SEQ ID NO.1-44.
Application of the wax-apple EST-SSR molecular labeling in wax-apple analysis of genetic diversity and genetic map construction.
Specific method of the invention is:
(1) wax-apple genomic DNA is extracted using CTAB method;
(2) search of the site SSR is carried out using MISA program (http://pgrc.ikp-gatersleben.de/misa),
It is respectively to be scanned for for standard for 6,5,5,5 and 5 times with two to the minimum number of repetition of Hexanucleotide.With 3.0 software of Primer
Design of primers and evaluation are carried out to the sequence before and after SSR repetitive unit, every SSR generates 3 primers.Primer sequence length 18 ~
27bp, G/C content 40% ~ 60%, 57 ~ 63 DEG C of annealing temperature, the Tm value of upstream and downstream primer differs≤2 DEG C, it is contemplated that amplified production is long
Spend 100 ~ 280 bp;And without secondary structure and dimer;
(3) EST-SSR primer screening: PCR reaction system is 20ul, wherein 4 μ l, 5U Taq enzyme of 2.5mmol/L dNTP
0.3 μ l, 100 ng DNA 1.5 μ l, the upstream and downstream primer of 10 μm of ol/L each 1 μ l, 10 × Buffer(Mg2+) 2.5 μ l,
Add ddH2O is mended to 20 μ l.PCR amplification program are as follows: 95 DEG C of 5 min of initial denaturation;Then 35 circulations are carried out, each circulation includes
94 DEG C of 30 s of denaturation, 56 DEG C of 30 s(annealing temperatures of annealing are different because of different primers), 72 DEG C of 1 min of extension;Last 72 DEG C of extensions
7 min。
The present invention has the advantages that
The present invention is to 21 153 Unigene(4.75Mb from 3 wax-apple kind fruits) sequence carry out EST-SSR search
Rope, obtains 11 641 sites SSR, the frequency of occurrences 55.03%, be higher than Lonicera edulis (32.51%), Rosa roxburghii Tratt (20.37%),
The orchard fruits such as lichee (16.35%), citrus (21.74%, 5.2kb), pears (7.1%, 7.2%), also above radish (23.79%), silk
The herbaceous plant such as melon (14.97%), capsicum (7.84%), lavender (9.88%).From the point of view of forefathers are to the result of study of plant, greatly
The EST-SSR of most plants is based on two, Trinucleotide repeats type, and leading repetition motif is different because of species.Present invention hair
Show wax-apple based on mononucleotide repeat type (43.44%), followed by dinucleotides repeat type (30.98%) and trinucleotide
Repeat type (24.38%), though four, five and Hexanucleotide repeat type occur, proportion very little (1.21%), this and cottonrose hibiscus
Lee Rong, Cortex Eucommiae are similar with the result of study of Fructus Forsythiae, are different from lichee, loquat and Lonicera edulis.The mononucleotide of wax-apple repeats
The abundantest with A/T in motif, the fruit tree research result such as this is with citrus and Fu-Rong plums is consistent;Dinucleotides repeats AG/ in motif
CT is most, identical as the most fruit trees having been reported, and is different from lichee (GA/TC);Trinucleotide repeats motif is with CCG/CGG
It is main, it is different based on AAG/CTT from Kiwi berry, citrus and strawberry etc., also different from lichee (GAA/TTC) and pears (ACC/GGT).
The present invention selects 100 pairs of SSR primer of 14 ~ 25bp to synthesize from 25734 pairs of primer kinds at random, wherein 67 pairs
Primer can amplify ideal PCR product, and effective amplification rate is 67.00%.Wherein, 22 pairs of SSR primers presentations are polymorphic in 30 pairs of primers
Property, occupy the 73.33% of effect primer.This ratio is higher than Rosa roxburghii Tratt (52.17%), Fu-Rong plums (42.5%), Lonicera edulis
(62.5%), a variety of fruit trees such as lichee (66.67%).This illustrates that the site SSR in wax-apple est sequence is more, primer amplification efficiency
It is higher, and its polymorphism is also relatively high.22 pairs of polymorphic differences primer pairs, 30 parts of wax-apple materials carry out UPGMA cluster, are divided
It is threshold value with genetic distance 0.68 for 2 major class, the 1st class can be divided into 4 groups, gathers so that fruit colour or source place are close for one kind,
2nd class can be divided into 2 groups, distinguished with fruit colour difference, more accurately reflect the difference between 30 parts of wax-apple materials.
The present invention is higher using the SSR marker availability that wax-apple transcript profile data mining goes out, and grinds for Wax Apple Germplasm genetic diversity
Study carefully, genetic map construction, gene mapping and cloning and molecular mark etc. are laid a good foundation.
Detailed description of the invention
Polymorphism of Fig. 1 primer 63 in 30 wax-apple materials.Kind number title is shown in Table 1.
For Fig. 2 for trying the UPGMA dendrogram of wax-apple material, kind number title is shown in Table 1.
Specific embodiment
1 materials and methods
1.1 transcript profile data sources
Wax-apple transcript profile data source carries out Illumina high throughput deep sequencing to fruit in 2016 in this seminar
As a result.Acquire that ' purplish red ', ' black pearl ', ' blueness bores ' 3 wax-apple kind fruit, and each kind fetches from 3 differences when sequencing
The wax-apple fruit of single plant each 20, liquid nitrogen flash freezer, -80 DEG C of preservations.When sequencing, each single plant constructs 1 library.Sequencing commission north
Jing Baimaike Biotechnology Co., Ltd carries out RNA-Seq transcription using 2500 PE125 system of Illumina HiSeqTM
Group sequencing (no ginseng) carries out sequence assembling using Trinity, the valid data of 7.80G is obtained through filtering.
1.2 materials and its DNA are extracted
It is Inst. of Fruit Trees, Fujian Prov. Academy of Agricultural Sciences's wax-apple germplasm for the material of SSR primer screening and usability evaluation
30 parts of germ plasm resources (table 1) of resource garden.Extracting genome DNA is carried out using CTAB method.
1 wax-apple SSR polymorphism analysis material of table
1.3 sites transcript profile SSR identify and SSR design of primers
The search of the site SSR is carried out using MISA program (http://pgrc.ikp-gatersleben.de/misa), with two
It is respectively to be scanned for for standard for 6,5,5,5 and 5 times to the minimum number of repetition of Hexanucleotide.With 3.0 software of Primer to SSR
Sequence before and after repetitive unit carries out design of primers and evaluation, and every SSR generates 3 primers.Primer sequence 18 ~ 27bp of length,
G/C content 40% ~ 60%, 57 ~ 63 DEG C of annealing temperature, the Tm value of upstream and downstream primer differs≤2 DEG C, it is contemplated that and amplified production length 100 ~
280 bp;And without secondary structure and dimer.
1.4 EST-SSR primer screenings
PCR reaction system is 20ul, wherein 0.3 μ l, 100 ng DNA of 4 μ l, 5U Taq enzyme of 2.5mmol/L dNTP
1.5 μ l, the upstream and downstream primer of 10 μm of ol/L each 1 μ l, 10 × Buffer(Mg2+) 2.5 μ l, add ddH2O is mended to 20 μ l.
PCR amplification program are as follows: 95 DEG C of 5 min of initial denaturation;Then 35 circulations are carried out, each circulation includes 94 DEG C of denaturation 30 s, and 56 DEG C
30 s(annealing temperatures of annealing are different because of different primers), 72 DEG C of 1 min of extension;7 min of last 72 DEG C of extensions.Pcr amplification product
With 2% agar sugar detection, 6% denaturing polyacrylamide gel is detected, 160V voltage, 3h, is observed and is taken pictures after silver staining colour developing.
1.5 data statistics
The Unigene quantity and total Unigene ratio of number that SSR occurrence frequency is SSR, the frequency of occurrences of SSR are SSR's
The quantity ratio of number and total Unigene, SSR be evenly distributed distance be 1kb or more Unigene base number and SSR quantity it
Than.Using the method manually read tape, the clear band repeated on electrophoretogram is denoted as " 1 ", same position is without band or is not easy point
The weak band distinguished is denoted as " 0 ", establishes raw data matrix.Cluster drawing is carried out by Hierarchical Clustering using software NTsys2.10.
2 results and analysis
The distribution and design feature of SSR in 2.1 transcript profiles
87 538 Unigene(sequences are obtained after wax-apple transcript profile is assembled altogether and are always about 75 156 717 bp), it uses
For MISA software to Unigene(21 153 of 1kb or more, sequence is 47 491 726 bp) it scans for, discovery is wherein
Contain 11 641 sites SSR in 8 115 Unigene sequences, wherein 2 773 Unigene contain two or more
The site EST-SSR.SSR occurrence frequency is 38.36%, the frequency of occurrences 55.03%, and 1 SSR occurs in average 4.08kb, wherein
Composite S SR has 1432, accounts for 12.3%.6 kinds of SSR repeat type have, i.e., mononucleotide to Hexanucleotide repeats.Wherein monokaryon
Thuja acid, dinucleotides and the Trinucleotide repeats frequency of occurrences are dominant, respectively 43.44%, 30.98% and the 24.38% of the total SSR of Zhan;
Tetranucleotide, pentanucleotide and Hexanucleotide repeat type negligible amounts account for 0.92%, the 0.14% and 0.15%(table of sum respectively
2).
The number of repetition of wax-apple transcript profile SSR repetitive unit is distributed between 5 ~ 23 times, wherein 5 ~ 10 SSR are shared
8753, account for the 75.19% of sum;Secondly it is 11 ~ 20 SSR, shares 2880, accounts for the 24.74% of sum;20 repetitions with
On only 8, only account for 0.07%.The length of wax-apple transcript profile SSR is concentrated mainly on 10 ~ 256bp, and wherein length is less than 12bp
SSR have 2873 (24.68%), length 12 ~ 20bp SSR up to 7094, account for sum 60.94%, length 20bp with
On SSR have 920, only account for 7.90%.
Type, quantity and the distribution frequency of 2 wax-apple SSR of table
SSR motif repeat type and frequecy characteristic in 2.2 transcript profiles
11 641 sites SSR contain 72 kinds of repetition motifs altogether in wax-apple transcript profile, and mononucleotide to Hexanucleotide repeats to divide
There are not 2,4,10,24,15 and 17 kind.From the point of view of distribution frequency (table 3), occur at most with mononucleotide repeat type A/T, Zhan is total
The 40.15% of SSR accounts for mononucleotide repeats the sum of motif 92.43%.Followed by dinucleotides repeat type AG/CT, Zhan are total
The 26.99% of SSR accounts for the 87.13% of dinucleotides sum.In addition, with CCG/CGG, AGG/CCT in Trinucleotide repeats motif
With AAG/CTT dominance, 33.62%, 20.05% and the 16.98% of trinucleotide sum is accounted for respectively;Tetranucleotide, pentanucleotide and
Hexanucleotide repetition motif distribution is more dispersed, and the frequency of appearance is lower.
Different microsatellites repeat the frequency that motif (motif) occurs in the transcription of 3 wax-apple of table
2.3 wax-apple transcript profile SSR design of primers and screening
Design of primers is carried out using 8 115 Unigene sequences of the Primer3.0 to the site containing SSR, designs primer altogether
25 734 pairs.The primer pair wax-apple ' pink kind ' of 100 pairs of different repeat units (two, three, four, five, Hexanucleotide) is selected at random
DNA carries out SSR-PCR amplification to verify its validity.The result shows that 76 pairs of primers realize effectively amplification, 100 pairs of SSR primers are accounted for
76%.In 76 pairs of effective amplimers, 67 are consistent to (88.12%) pcr amplification product with expected size, there is 9 pairs
(13.43%) amplified production length is more than to be expected.
2.4 polymorphism analysis
30 parts of wax-apples are chosen, amplification is carried out using 30 pairs of effective EST-SSR primers and polymorphism is evaluated.Wherein draw for 22 pairs
There are polymorphic differences (tables 4) for object, occupy the 73.33% of effect amplimer.The polymorphic bands number that each pair of primer generates is 2 ~ 6
Between, 63 band are obtained in 23 pairs of primers, wherein polymorphic bands 52, and each pair of primer averagely generates 2.36 polymorphism pieces
Section, Fig. 1 are the amplification situation of primer 63.
4 22 pairs of wax-apple SSR primer information of table
Clustering is carried out using 22 pairs of polymorphism SSR primer pairs, 30 parts of wax-apple materials, at genetic distance 0.60, for examination
Material is divided into 2 major class (Fig. 2).1st class includes 26 parts of materials, is threshold value with genetic distance 0.68, such is divided into 4 groups,
Wherein with ' Indonesia great Ye ', ' black diamond ' and ' purplish red ' be the red peel wax-apple of representative gather be it is a kind of, with ' emerald ', ' Thailand
Green kind ' and ' Peth Jinda ' is that the green pericarp wax-apple of representative is got together, ' imperial Wen Shisheng ' and ' Indian red ' are classified as one kind,
Gather from ' green kind of the Changtai ' in same orchard, ' blueness is bored ' and ' pink kind of Changtai ' for one kind.2nd class includes 4 parts of materials, wherein
Red peel wax-apple (' ruby ', ' perfume ', ' agriculture section 4 ') is classified as one kind, and ' white lotus mist ' pericarp is white, is constituted a class by itself.
The foregoing is merely presently preferred embodiments of the present invention, all equivalent changes done according to scope of the present invention patent with
Modification, is all covered by the present invention.
SEQUENCE LISTING
<110>Inst. of Fruit Trees, Fujian Prov. Academy of Agricultural Sciences
<120>wax-apple EST-SSR molecular labeling
<130> 44
<160> 44
<170> PatentIn version 3.3
<210> 1
<211> 20
<212> DNA
<213>artificial sequence
<400> 1
tgtttcaact cgcacgactc 20
<210> 2
<211> 20
<212> DNA
<213>artificial sequence
<400> 2
aaacggtttt cattgatggc 20
<210> 3
<211> 20
<212> DNA
<213>artificial sequence
<400> 3
gtgtcatcga aaccctccaa 20
<210> 4
<211> 20
<212> DNA
<213>artificial sequence
<400> 4
ttgaatcttg caagcgactg 20
<210> 5
<211> 20
<212> DNA
<213>artificial sequence
<400> 5
gtcatgcttg ctctgtccaa 20
<210> 6
<211> 20
<212> DNA
<213>artificial sequence
<400> 6
ttctggattg caggaacctt 20
<210> 7
<211> 20
<212> DNA
<213>artificial sequence
<400> 7
gatggctggt gactcaggtt 20
<210> 8
<211> 20
<212> DNA
<213>artificial sequence
<400> 8
gaccctttcc ttgagtggaa 20
<210> 9
<211> 23
<212> DNA
<213>artificial sequence
<400> 9
ggaaaaggag agaattttct tga 23
<210> 10
<211> 20
<212> DNA
<213>artificial sequence
<400> 10
ttttccccca tgagagtgag 20
<210> 11
<211> 20
<212> DNA
<213>artificial sequence
<400> 11
gggttttcag ccaaaaattg 20
<210> 12
<211> 20
<212> DNA
<213>artificial sequence
<400> 12
cgagcgagag agagagatgg 20
<210> 13
<211> 20
<212> DNA
<213>artificial sequence
<400> 13
gagcttgttt atcgaaggcg 20
<210> 14
<211> 20
<212> DNA
<213>artificial sequence
<400> 14
aaactggcga caaaaacgag 20
<210> 15
<211> 20
<212> DNA
<213>artificial sequence
<400> 15
gtcgcgtcac cattatttca 20
<210> 16
<211> 20
<212> DNA
<213>artificial sequence
<400> 16
ggtttagtta agggcggctc 20
<210> 17
<211> 20
<212> DNA
<213>artificial sequence
<400> 17
gaggagtcgc tgaactgtcc 20
<210> 18
<211> 20
<212> DNA
<213>artificial sequence
<400> 18
cgcacccact gttttaggat 20
<210> 19
<211> 20
<212> DNA
<213>artificial sequence
<400> 19
aagccgaaga tgaagacgaa 20
<210> 20
<211> 20
<212> DNA
<213>artificial sequence
<400> 20
ggacatatat ccgcggtttg 20
<210> 21
<211> 20
<212> DNA
<213>artificial sequence
<400> 21
atatgaatcc cgggccttac 20
<210> 22
<211> 20
<212> DNA
<213>artificial sequence
<400> 22
ctcccgtctc tccacaacat 20
<210> 23
<211> 20
<212> DNA
<213>artificial sequence
<400> 23
tggtgtctgt aactgctgcc 20
<210> 24
<211> 20
<212> DNA
<213>artificial sequence
<400> 24
agttcttggg gaggaatcgt 20
<210> 25
<211> 20
<212> DNA
<213>artificial sequence
<400> 25
ttggtaaatc ggcatcttcc 20
<210> 26
<211> 20
<212> DNA
<213>artificial sequence
<400> 26
accgaaaagg gacaaggagt 20
<210> 27
<211> 20
<212> DNA
<213>artificial sequence
<400> 27
ggctcacttg tgcaacaaaa 20
<210> 28
<211> 20
<212> DNA
<213>artificial sequence
<400> 28
cgactgtcca cattggtgtc 20
<210> 29
<211> 20
<212> DNA
<213>artificial sequence
<400> 29
accttttcat gcaaattccg 20
<210> 30
<211> 20
<212> DNA
<213>artificial sequence
<400> 30
aggatcatgc attcggtttc 20
<210> 31
<211> 20
<212> DNA
<213>artificial sequence
<400> 31
gggtttgggt ttgagttgtg 20
<210> 32
<211> 20
<212> DNA
<213>artificial sequence
<400> 32
aaaagcaaac ggggctactt 20
<210> 33
<211> 20
<212> DNA
<213>artificial sequence
<400> 33
gtcgagagga aggagtggaa 20
<210> 34
<211> 20
<212> DNA
<213>artificial sequence
<400> 34
tcgttataat cgtccggctt 20
<210> 35
<211> 20
<212> DNA
<213>artificial sequence
<400> 35
ctccaccaca accacaacag 20
<210> 36
<211> 20
<212> DNA
<213>artificial sequence
<400> 36
ggttcacgtt ctctcgaagc 20
<210> 37
<211> 20
<212> DNA
<213>artificial sequence
<400> 37
gcttttaagg aaccgggaac 20
<210> 38
<211> 20
<212> DNA
<213>artificial sequence
<400> 38
agattcccac tgaatgtccg 20
<210> 39
<211> 22
<212> DNA
<213>artificial sequence
<400> 39
tttctgagat ttgacgagga aa 22
<210> 40
<211> 20
<212> DNA
<213>artificial sequence
<400> 40
actgcttgca ccgttgaact 20
<210> 41
<211> 20
<212> DNA
<213>artificial sequence
<400> 41
tgagaaaccc acaaagtccc 20
<210> 42
<211> 20
<212> DNA
<213>artificial sequence
<400> 42
gcgtcaaatc ccaatcactt 20
<210> 43
<211> 20
<212> DNA
<213>artificial sequence
<400> 43
tctagatctt tgagcgggga 20
<210> 44
<211> 20
<212> DNA
<213>artificial sequence
<400> 44
aatgccgtca atttctcacc 20