CN108251517A - A kind of method of similar sequences relative populations in analysis system - Google Patents
A kind of method of similar sequences relative populations in analysis system Download PDFInfo
- Publication number
- CN108251517A CN108251517A CN201711477562.5A CN201711477562A CN108251517A CN 108251517 A CN108251517 A CN 108251517A CN 201711477562 A CN201711477562 A CN 201711477562A CN 108251517 A CN108251517 A CN 108251517A
- Authority
- CN
- China
- Prior art keywords
- sequence
- homologous
- similar sequences
- analysis system
- relative populations
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The method that the present invention discloses similar sequences relative populations in a kind of analysis system, includes the following steps:At least two homologous sequences to be detected are expanded with a primer pair, amplification carries out generation sequencing, obtained peak-data will be sequenced and submit to computer program together as the homologous sequence of reference diversity sequence and be compared, the mean intensity in each sequence all differences site is counted, the ratio of each sequence average signal strength is the quantity ratio for representing the homologous sequence.The present invention is not that can express or there is a situation where expression difference for similar sequences in a system, the relative populations for analyzing the similar sequences that can be simple and convenient can be sequenced by a generation, be of great significance for the research of its corresponding functional gene.
Description
Technical field
The invention belongs to technical field of molecular biology, and in particular to similar sequences relative populations in a kind of analysis system
Method.
Background technology
Functional genomics is a kind of information provided using Structural genomics and product, develops and applies new experiment
Means, by the function of analyzing gene comprehensively on genome or system level so that biological study to term single gene or
The multiple genes of research steering or protein of protein are carried out at the same time systematic research.The function of gene includes:Biological function,
Phosphorylation modification such as is carried out to specific protein as protein kinase;Cytology function such as participates in iuntercellular and intracellular letter
Number pipeline;Developmentally function, such as participate in morphogenesis.These functional genes usually have multiple copies in genome,
Sequence height between these copies is similar, only the SNP differences of the difference of number of base sequence or even only several bases,
However the gene of these differences is not that can express or the difference there are expression quantity under specific circumstances, and therefore, research
The relative populations of the multiple copies of functional gene become extremely difficult in genome.
How these similar homologous gene relative abundance in the cell is understood, for determining that it corresponds to the work(of gene
It can study significant.Round pcr is a kind of Protocols in Molecular Biology that specific DNA fragmentation is expanded for amplification, for
It all expresses between homologous gene or is easily determined the situation not expressing is by regular-PCR, however for homologous gene
The difference condition only measured, there is presently no a kind of simple and convenient analysis methods.
Invention content
In order to solve the problems in the prior art, the purpose of the present invention is to, it is proposed that similar sequences are opposite in a kind of analysis system
The method of quantity is just capable of the relative abundance of simple and quick judgement homologous gene product by generation sequencing approach.
To achieve the above object, the technical solution adopted by the present invention is:
The method that the present invention provides similar sequences relative populations in a kind of analysis system, includes the following steps:It will be to be detected
At least two homologous sequences expanded with a primer pair, amplification carries out generation sequencing, obtained peak value will be sequenced
According to this and the homologous sequence as reference diversity sequence is submitted to computer program and is compared together, counts each sequence
The mean intensity in all differences site, the ratio of each sequence average signal strength are the quantity ratio for representing the homologous sequence
Value.
The generation sequencing is based on Capillary Electrophoresis, fluorescence signal and record on track is scanned after electrophoresis, specifically
, generation sequencing result data are the data software of ABI companies exploitation, and the peak-data span is with a certain distance from electrophoresis starting point
On A, T, C, G corresponds to fluorescence signal intensity.
Preferably, the amplification procedure has same amplification efficiency to all homologous sequences.
Preferably, the primer pair and the homologous sequence are 100% matching, the amplification target that amplification procedure is selected
Ranging from 4~30, difference site in DNA.
Preferably, the reference diversity sequence is all homologous sequences in the amplification target DNA.
Preferably, the homologous sequence compared every time is two kinds.
Preferably, the method that the computer program is compared includes:First calculated according to the reference diversity sequence of submission
All differences site, the specific base for recording these difference sites and position in the sequence;Then according to these location informations
The signal strength of 4 kinds of bases on these positions is found out in sequencing result file data, and is recorded;It counts on all differences site
The signal strength of different bases;The signal strength in each site is uniformed;Count each sequence all differences site
Mean intensity, the ratio of each sequence average signal strength represent the quantity ratio of each homologous sequence in starting system.
Compared with prior art, the beneficial effects of the invention are as follows:Not being for similar sequences in a system can
Express or there is a situation where expression difference, can by a generation be sequenced can be simple and convenient analyze the similar sequences
Relative populations, be of great significance for the research of its corresponding functional gene.
Description of the drawings
Fig. 1 is the peak-data figure that the sequencing result data software in embodiment is opened.
Specific embodiment
Below in conjunction with the attached drawing in the present invention, technical scheme of the present invention is clearly and completely described, it is clear that
Described embodiment is only part of the embodiment of the present invention, instead of all the embodiments.Based on the implementation in the present invention
Example, all other embodiment that those of ordinary skill in the art are obtained under the conditions of creative work is not made belong to
The scope of protection of the invention.
Embodiment 1
(1) promoter Pg1090 and promoter Paction respectively drive gfp and gfp (m) pieces in Transgenic Rice Plants
Section transcription;It extracts plant RNA and carries out reverse transcription, obtain gfp and gfp (m) homologous fragments, respectively such as SEQ ID in sequence table
Shown in NO.1~2.
(2) above-mentioned gfp to be detected and gfp (m) homologous fragments are expanded with a primer pair, amplification carries out one
Generation sequencing;The primer pair is as shown in sequence table SEQ ID NO.3~4:
GFP-F:ttcttcaagg acgacggcaa
GFP-R:aagttggcct ttatcccgtt
The primer pair and the homologous fragment to be detected are 100% matching, the amplification target DNA that amplification procedure is selected
Ranging from 4~30, middle difference site.
(3) sequencing obtains 2017121889hmapz_GFP.ab1 sequencing result files, and sequencing result is opened with data software
Peak-data figure it is as shown in Figure 1;Obtained peak-data and the homologous sequence as reference diversity sequence will be sequenced
SEQ ID submit to NO.1~2 computer program and are compared together, and comparison process is as shown in the table:
Note:Base _ A:The corresponding base in difference site in sequence A (gfp);Base _ B:Difference in sequence B (gfp (m))
The corresponding base of ectopic sites;Peak value _ A:Difference site base signal strength values on sequence A (gfp);Peak value _ B:Sequence B (gfp
(m)) difference site base signal strength values on;Homogenization value _ A:Difference site base signal strength is uniform on sequence A (gfp)
Value after change;Homogenization value _ B:Value in sequence B (gfp (m)) after the homogenization of difference site base signal strength.
All differences sites is calculated according to the reference diversity sequence of submission, record these difference sites specific base and
Position in sequence obtains totally 21 difference sites;Then it is found out in sequencing result file data according to these location informations
The signal strength of 4 kinds of bases on these positions, and record;Count the signal strength of different bases on all differences site;To every
The signal strength in a site is uniformed;Count the mean intensity in each sequence all differences site, each sequence average signal
The ratio of intensity represents the quantity ratio of each homologous sequence in starting system.
Result of calculation shows A (gfp):Relative populations ratio=1.712828 of B (gfp (m)).
According to the record of above-described embodiment and present disclosure, those skilled in the art can be in same system two
Kind or the relative populations of a variety of homologous sequences carry out simple and convenient analysis, are of great significance for the research of functional gene.
It although an embodiment of the present invention has been shown and described, for the ordinary skill in the art, can be with
Understanding without departing from the principles and spirit of the present invention can carry out these embodiments a variety of variations, modification, replace
And modification, the scope of the present invention is defined by the appended.
SEQUENCE LISTING
<110>Wuhan Ai Deshi bio tech ltd
<120>A kind of method of similar sequences relative populations in analysis system
<130>
<160> 4
<170> PatentIn version 3.3
<210> 1
<211> 200
<212> DNA
<213>Rice gfp segments
<400> 1
ttcttcaagg acgacggcaa ctacaagacg cgagctgagg tgaagttcga gggcgacacg 60
ctcgtcaacc gtatcgagct caagggaatc gacttcaagg aggacggaaa cattctcgga 120
cacaagctgg agtacaacta caactcccac aacgtttaca taatggcgga caagcagaag 180
aacggcatta aggccaactt 200
<210> 2
<211> 200
<212> DNA
<213>Rice gfp (m) segments
<400> 2
ttcttcaagg acgacgggaa ctacaagacc cgtgcagagg tcaagttcga gggggacacc 60
ctcgtgaaca gaatcgagct caagggtatc gacttcaagg aggacggtaa catactcggt 120
cacaagctcg agtacaacta caactcgcac aacgtataca ttatggccga caagcagaag 180
aacgggataa aggccaactt 200
<210> 3
<211> 20
<212> DNA
<213>Artificial sequence GFP-F
<400> 3
ttcttcaagg acgacggcaa 20
<210> 4
<211> 20
<212> DNA
<213>Artificial sequence GFP-R
<400> 4
aagttggcct ttatcccgtt 20
Claims (6)
1. a kind of method of similar sequences relative populations in analysis system, which is characterized in that include the following steps:It will be to be detected
At least two homologous sequences are expanded with a primer pair, and amplification carries out generation sequencing, the peak-data that sequencing is obtained
And the homologous sequence as reference diversity sequence is submitted to computer program and is compared together, counts each sequence institute
The mean intensity in variant site, the ratio of each sequence average signal strength are the quantity ratio for representing the homologous sequence.
2. the method for similar sequences relative populations in a kind of analysis system according to claim 1, which is characterized in that described
Amplification procedure has same amplification efficiency to all homologous sequences.
3. the method for similar sequences relative populations in a kind of analysis system according to claim 2, which is characterized in that described
Primer pair and the homologous sequence are 100% matching, difference site ranging from 4 in the amplification target DNA that amplification procedure is selected
~30.
4. the method for similar sequences relative populations in a kind of analysis system according to claim 3, which is characterized in that described
Reference diversity sequence is all homologous sequences in the amplification target DNA.
5. the method for similar sequences relative populations in a kind of analysis system according to claim 1, which is characterized in that every time
The homologous sequence compared is two kinds.
6. according to the method for similar sequences relative populations in a kind of analysis system of Claims 1 to 5 any one of them, feature
It is, the method that the computer program is compared includes:All differences position is first calculated according to the reference diversity sequence of submission
Point, the specific base for recording these difference sites and position in the sequence;Then according to these location informations in sequencing result
The signal strength of 4 kinds of bases on these positions is found out in file data, and is recorded;Count different bases on all differences site
Signal strength;The signal strength in each site is uniformed;The mean intensity in each sequence all differences site is counted, respectively
The ratio of sequence average signal strength represents the quantity ratio of each homologous sequence in starting system.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711477562.5A CN108251517A (en) | 2017-12-29 | 2017-12-29 | A kind of method of similar sequences relative populations in analysis system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711477562.5A CN108251517A (en) | 2017-12-29 | 2017-12-29 | A kind of method of similar sequences relative populations in analysis system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108251517A true CN108251517A (en) | 2018-07-06 |
Family
ID=62724674
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711477562.5A Pending CN108251517A (en) | 2017-12-29 | 2017-12-29 | A kind of method of similar sequences relative populations in analysis system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108251517A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1261734A1 (en) * | 2000-02-25 | 2002-12-04 | Montclair Group | GENOMIC ANALYSIS OF tRNA GENE SETS |
CN105925691A (en) * | 2016-05-16 | 2016-09-07 | 孙雷 | Kit for rapidly detecting the numbers of human No.13, No.18, No.21, X and Y chromosomes |
CN107111693A (en) * | 2014-12-29 | 2017-08-29 | 考希尔股份有限公司 | Method for determining the genotype in high homology region |
-
2017
- 2017-12-29 CN CN201711477562.5A patent/CN108251517A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1261734A1 (en) * | 2000-02-25 | 2002-12-04 | Montclair Group | GENOMIC ANALYSIS OF tRNA GENE SETS |
CN107111693A (en) * | 2014-12-29 | 2017-08-29 | 考希尔股份有限公司 | Method for determining the genotype in high homology region |
CN105925691A (en) * | 2016-05-16 | 2016-09-07 | 孙雷 | Kit for rapidly detecting the numbers of human No.13, No.18, No.21, X and Y chromosomes |
Non-Patent Citations (1)
Title |
---|
赵耀等: "核苷酸序列测定法定量评价病毒相对适合度", 《重庆医科大学学报》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bentolila et al. | Comprehensive high-resolution analysis of the role of an Arabidopsis gene family in RNA editing | |
AU2020201280B2 (en) | Methods for single-molecule analysis | |
Duhaime et al. | Towards quantitative metagenomics of wild viruses and other ultra‐low concentration DNA samples: a rigorous assessment and optimization of the linker amplification method | |
US9469874B2 (en) | Long-range barcode labeling-sequencing | |
CN110520542A (en) | Method for targeting nucleic acid sequence enrichment and the application in the nucleic acid sequencing of error correcting | |
WO2018005691A1 (en) | Efficient genetic screening method | |
JP2018535481A5 (en) | ||
WO2017214461A1 (en) | Linear genome assembly from three dimensional genome structure | |
US20190185932A1 (en) | Method for preparing libraries for massively parallel sequencing based on molecular barcoding and use of libraries prepared by the method | |
CN104232627A (en) | 2b-RAD pooling technology | |
Zhao et al. | Bioinformatics analysis of alternative polyadenylation in green alga Chlamydomonas reinhardtii using transcriptome sequences from three different sequencing platforms | |
CN110669834A (en) | Method for developing polymorphic SSR (simple sequence repeat) marker based on transcriptome sequence | |
Dames et al. | Comparison of the Illumina Genome Analyzer and Roche 454 GS FLX for resequencing of hypertrophic cardiomyopathy-associated genes | |
Gómez-Lozano et al. | Identification of bacterial small RNAs by RNA sequencing | |
Khattra et al. | Large-scale production of SAGE libraries from microdissected tissues, flow-sorted cells, and cell lines | |
Farias-Hesson et al. | Semi‐automated library preparation for high‐throughput DNA sequencing platforms | |
US8829172B2 (en) | Multiplex barcoded paired-end diTag (mbPED) sequencing approach and ITS application in fusion gene identification | |
Wadley et al. | Nanopore sequencing for detection and characterization of phosphorothioate modifications in native DNA sequences | |
Wu | Mouse oocytes, a complex single cell transcriptome | |
WO2012096016A1 (en) | Nucleic acid information processing device and processing method thereof | |
CN108251517A (en) | A kind of method of similar sequences relative populations in analysis system | |
US11959131B2 (en) | Method for measuring mutation rate | |
KR20230085149A (en) | Multiplexed target-binding candidate screening analysis | |
Chen et al. | scCircle-seq unveils the diversity and complexity of circular DNAs in single cells | |
US20130143746A1 (en) | Method for detecting gene region features based on inter-alu polymerase chain reaction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180706 |