CN108165564A - Mycobacterium tuberculosis H37Rv encoding gene and its application - Google Patents
Mycobacterium tuberculosis H37Rv encoding gene and its application Download PDFInfo
- Publication number
- CN108165564A CN108165564A CN201711251274.8A CN201711251274A CN108165564A CN 108165564 A CN108165564 A CN 108165564A CN 201711251274 A CN201711251274 A CN 201711251274A CN 108165564 A CN108165564 A CN 108165564A
- Authority
- CN
- China
- Prior art keywords
- mycobacterium tuberculosis
- encoding gene
- gene
- sequence
- tuberculosis h37rv
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 81
- 241001646725 Mycobacterium tuberculosis H37Rv Species 0.000 title claims abstract description 15
- 108700035964 Mycobacterium tuberculosis HsaD Proteins 0.000 title claims abstract description 14
- 241001302239 Mycobacterium tuberculosis complex Species 0.000 claims abstract description 37
- 238000001514 detection method Methods 0.000 claims abstract description 17
- 201000008827 tuberculosis Diseases 0.000 claims description 13
- 150000001413 amino acids Chemical class 0.000 claims description 9
- 230000003321 amplification Effects 0.000 claims description 8
- 238000004458 analytical method Methods 0.000 claims description 7
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 7
- 241000894007 species Species 0.000 claims description 5
- 238000003745 diagnosis Methods 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 238000001502 gel electrophoresis Methods 0.000 claims description 2
- 238000012360 testing method Methods 0.000 claims description 2
- 101150030367 cls gene Proteins 0.000 claims 1
- 238000002372 labelling Methods 0.000 claims 1
- 239000002773 nucleotide Substances 0.000 claims 1
- 125000003729 nucleotide group Chemical group 0.000 claims 1
- 230000020509 sex determination Effects 0.000 claims 1
- 108010033276 Peptide Fragments Proteins 0.000 description 31
- 102000007079 Peptide Fragments Human genes 0.000 description 31
- 108020004414 DNA Proteins 0.000 description 14
- 230000001580 bacterial effect Effects 0.000 description 14
- 238000000034 method Methods 0.000 description 14
- 238000012795 verification Methods 0.000 description 10
- 102000004169 proteins and genes Human genes 0.000 description 9
- 241000894006 Bacteria Species 0.000 description 7
- 108700026244 Open Reading Frames Proteins 0.000 description 7
- 108090000765 processed proteins & peptides Proteins 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000001962 electrophoresis Methods 0.000 description 4
- 150000002500 ions Chemical class 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 210000002345 respiratory system Anatomy 0.000 description 4
- 241001312372 Mycobacterium canettii Species 0.000 description 3
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 244000052769 pathogen Species 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- FRXSZNDVFUDTIR-UHFFFAOYSA-N 6-methoxy-1,2,3,4-tetrahydroquinoline Chemical compound N1CCCC2=CC(OC)=CC=C21 FRXSZNDVFUDTIR-UHFFFAOYSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000186359 Mycobacterium Species 0.000 description 2
- 206010057190 Respiratory tract infections Diseases 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 244000052616 bacterial pathogen Species 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 150000001875 compounds Chemical group 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 238000012797 qualification Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 206010011409 Cross infection Diseases 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- 241001467553 Mycobacterium africanum Species 0.000 description 1
- 241000186366 Mycobacterium bovis Species 0.000 description 1
- 241000992249 Mycobacterium canettii CIPT 140070010 Species 0.000 description 1
- 241000211133 Mycobacterium caprae Species 0.000 description 1
- 241000187919 Mycobacterium microti Species 0.000 description 1
- 241000699502 Mycobacterium mungi Species 0.000 description 1
- 241000656726 Mycobacterium orygis Species 0.000 description 1
- 241001457456 Mycobacterium pinnipedii Species 0.000 description 1
- 241001093674 Mycobacterium suricattae Species 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 238000012550 audit Methods 0.000 description 1
- 230000007321 biological mechanism Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 208000008128 pulmonary tuberculosis Diseases 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/35—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Mycobacteriaceae (F)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/689—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present invention relates to a kind of Mycobacterium tuberculosis H37Rv encoding genes, can be used as the standard gene of mycobacterium tuberculosis complex Molecular Identification, for the Molecular Identification and clinical detection of mycobacterium tuberculosis complex.
Description
Technical field
The present invention relates to genetic test fields, and in particular to pathogen species are identified.
Background technology
Mycobacterium tuberculosis (Mycobacterium tuberculosis, MTB) is to cause mankind pathogen lungy.
Each organ of whole body can be invaded, but using pulmonary tuberculosis to be most common.Tuberculosis is infectious disease particularly important so far, the serious threat mankind
Life and health.According to WHO, every year there are about 8,000,000 new cases generation, at least 3,000,000 people die of the disease.The clinical bacterium of MTB
Strain hardly possible cultivates, is slow-growing and other mycobacteria energy cross-infections, tuberculosis and other respiratory tract infection symptom difficulties are distinguished etc.
Feature brings great difficulty to clinical quick diagnosis and treatment.Therefore establish quick, accurate, special, sensitive, cheap knot
Core disease detection method is the prerequisite and clinical labororatory's mycobacteria detection faces of effective treatment, control tuberculosis sprawling
The new challenge faced and new task.
Mycobacterium tuberculosis complex (Mycobacterium tuberculosis complex, MTBC), including
M.tuberculosis、M.africanum、M.orygis、M.bovis、M.microti、M.canettii、M.caprae、
The mycobacterias monoid such as M.pinnipedii, M.suricattae, M.mungi, these species can cause people and other life
Body tuberculosis.Following three classes are broadly divided into the identification method of MTBC both at home and abroad at present:Traditional isolated culture;Molecular level is examined
It surveys (IS6110, restriction fragment length polymorphism analysis, multidigit point variable number repeated fragment polymorphism analysis etc.);Thalline into
Divide (aliphatic acid, mycolic acid) chromatogram analysis method.Though three classes method has the advantages of respective, also there is shortcoming, such as pass
System is separately cultured period length, and with thalline can to cultivate rate low;Molecular level detection at present is in terms of specificity, sensitivity and simplicity
It is still poorer;Drive member specificity analysis cost is higher, complicated for operation.
MTB H37Rv completed genome sequencing in 1998, were the earliest MTB bacterial strains for completing full figure sequencing.Since then, respectively
State researchers are tactful based on algorithm optimization, annotating software update, transcription group and proteomics etc., always in perfect, benefit
Fill H37Rv gene annotation databases.However, since MTB belongs to prokaryotes, due to prokaryotic gene group remarking technology in itself
Deficiency, in genome annotation still there may be annotation mistake (excessively annotation, genetic borders mistake and ORF starting, stop bit
Point mistake, alternative splicing, ribosomes displacement, leakage annotation), to deeply, accurately parse biological mechanism bring puzzlement.To solve
This problem, protein gene group (Proteogenomics) though have been used for the correction that H37Rv has annotated gene, however, high
Ratio false positive, routine techniques are difficult to annotation predictive genes, new gene verification, new gene functional analysis and its application etc.
The problem that the field is faced.
Generally speaking, traditional mycobacterium tuberculosis complex (MTBC) identification strategy has period length, complex steps, specifically
The defects of property and sensitivity is not high.H37Rv full-length genomes are annotated again further to improve, find to omit annotation in H37Rv
Gene, it is ensured that H37Rv full-length genomes omit annotation gene and its application technology in MTBC Molecular Identifications is effectively protected,
It is imperative to develop and use H37Rv new genes fast accurate mirror method for distinguishing in MTBC monoids.
Invention content
It is an object of the present invention to provide a kind of new encoding gene of Mycobacterium tuberculosis H37Rv, which is H37Rv
Leakage annotation encoding gene Rv3108c (- | 3476972-3477175 |), it can be used as mycobacterium tuberculosis complex bar code point
Son label, for detecting mycobacterium tuberculosis complex, sequence is as shown in SEQ ID NO.1.
Other objects of the present invention include providing Specific PCR primers and the offer that can be used for expanding above-mentioned encoding gene
It is a kind of to detect or identify the detection method that whether there is mycobacterium tuberculosis complex in sample;The present invention also provides with above-mentioned volume
The application of the detection kit and said gene of code gene-correlation.
According to an aspect of the present invention, by comparing protein gene group investigative technique, it was found that one in H37Rv
It is difficult to by the albumen coded sequence of predictive genes software discovery, which can be effectively by MTBC and the other species differentiations belonged to
It comes.The gene is the omission annotation base of a mycobacterium tuberculosis (Mycobacterium tuberculosis H37Rv)
Cause, i.e. Rv3108c (- | 3476972-3477175 |), after NCBI-BLASTP, with M.tuberculosis 1825K,
A70645,140070010 similitudes of M.canettii CIPT are 99%, are below 76% with the similitude of other bacterial strains, belong to
In Unknown Function albumen.It is studied through comparative genomics and finds that the gene order can be by mycobacterium tuberculosis complex (MTBC) bacterium
Strain is distinguished with other kinds of Mycobacterium.
Specifically, design can realize that specific amplification draws to the Rv3108c of MTBC (- | 3476972-3477175 |) gene
Object, primer as proposed by the invention, primer sequence are:
F:5’-GACCAGTGCCCTCGCAGT-3’;
R:5’-AGGACGATCATGGCTCCG-3’。
The difference of the presence or absence of gene DNA sequence PCR product in sample to be tested or DNA sequence dna, can be quickly accurate
True identification of M TBC.
According to another aspect of the present invention, based on the new standard code gene of above-mentioned Mycobacterium tuberculosis H37Rv, this hair
The bright method for specifically establishing detection or identifying mycobacterium tuberculosis complex, step are as follows:
(1) the separation and Extraction genomic DNA from sample to be tested;
(2) DNA obtained using step (1) carries out PCR amplification as template using following primer:
F:5’-GACCAGTGCCCTCGCAGT-3’(SEQ ID NO.4);
R:5’-AGGACGATCATGGCTCCG-3’(SEQ ID NO.5)。
(3) DNA product obtained to step (2) amplification carries out gel electrophoresis analysis or is sequenced;
(4) result of step (3) and bar code gene Rv3108c (- | 3476972-3477175 |) are compared, such as
Fruit homology is more than 99%, and judgement sample to be tested contains mycobacterium tuberculosis complex.
Further, above-mentioned detection method according to DNA bar code principle, tentatively carries out electrophoretic analysis, such as to PCR product
Fruit strain to be tested does not have target stripe, and it is not MTBC to illustrate the bacterial strain;If there is band, then can further sequence verification, will survey
The standard sequence that sequence obtains series and the Rv3108c of H37Rv (- | 3476972-3477175 |) carries out Homology search and comparison, obtains
The similitude between sequence is obtained, if sequence homology is more than 99%, you can judgement bacterial strain may be MTBC;According to identification bacterium to be identified
The DNA bar code sequence of strain and standard sequence cluster situation are common to distinguish MTBC families and non-tuberculous mycobacteria, respiratory tract
Pathogen and common respiratory tract virus.
The detection method can be used to the strain idenfication research to mycobacterium tuberculosis complex, it can also be used to clinical quick
It examines.Sample to be tested can be from H37Rv bacterial strains, other MTBC, non-tuberculous mycobacteria, respiratory tract encountered pathogenic bacteria, breathing
Road common virus bacterial strain;Either directly using tuberculosis and other respiratory tract patient sputums, saliva or blood.
Based on the above method, the present invention also provides detection kit, and kit containers are built-in to detect knot
The reagent of standard code gene new core mycobacteria H37Rv, what is concurrently provided can be through governmental drug management organization
Manufacture, use and sales informations audit, in relation to drug or biological products.For example, after using PCR amplification, sample is directly detected
The reagent of Rv3108c in product (- | 3476972-3477175 |) gene, such as amplimer, dNTP can be contained, reacted for PCR
Archaeal dna polymerase and its buffer solution, endonuclease reaction and/or reagent needed for sequencing reaction etc. it is one or more.Art technology
Personnel are it is known that more than component is only illustrative, for example, above-mentioned Specific PCR primers, the use may be used in the primer
The enzyme of PCR amplification is can be used in the archaeal dna polymerase of PCR reactions.The detection of the encoding gene of the present invention can also be with integrated
The mode of such as genetic chip provide.
Advantageous effect:The present invention provides one kind to be used as mycobacterium tuberculosis complex (Mycobacterium
Tuberculosis complex, MTBC) Molecular Identification standard gene and method for identifying molecules, the gene can effectively by
MTBC comes with the other species differentiations belonged to, and existing mycobacterium tuberculosis complex is overcome using the identification method of the gene
The shortcomings of design of primers multiplicity in qualification process, result poor repeatability, has the characteristics that general, easy amplification, easily compares, can
Accurately to identify to come from the close other mycobacterias of affiliation or other respiratory tract infection germs by the monoid, it is
Tuberculosis epidemiological survey and clinical tuberculosis patient quick diagnosis differentiate the strong technological means of offer and research tool.
Description of the drawings
Fig. 1:Support the peptide spectrum matching evidence of the newly encoded gene found;
Fig. 2:Section of synthesized peptide mass spectrogram is compared with former identification peptide fragment mass spectrogram;
Fig. 3:Peptide fragment is located the protein sequence corresponding diagram of region ORF codings;It is identified for proteomics underscore part
And it is synthesized the peptide fragment of peptide fragment verification;
Fig. 4:Rv3108c (- | 3476972-3477175 |) standard gene sequence homology compares;
Fig. 5:Protein sequence BLASTP knots corresponding to H37Rv bacterial strains Rv3108c (- | 3476972-3477175 |) gene
Fruit;
Fig. 6:Rv3108c (- | 3476972-3477175 |) specific primer PCR amplified production agarose gel electrophoresis result;
Wherein, each Lane Sample specifying information is shown in Table 1;
Fig. 7:Rv3108c (- | 3476972-3477175 |) gene PCR expands sequencing result and standard sequence compares.
Specific embodiment
The present invention will be further described With reference to embodiment, but does not limit scope of the invention as claimed.
Agents useful for same of the present invention is commercially available.
Embodiment 1:Find the leakage annotation encoding gene of H37Rv strain gene groups
The high covering protein group verification of 1.1 pairs of H37Rv strain gene groups
The depth that protein group has been carried out to H37Rv bacterial strains using high covering protein technique covers research.It is based on
Tuberculosis (20160307) database has carried out its genome using 3 engines of pFind the verification of annotation encoding gene.
In order to find new protein-coding region, we are based on protein gene omics technology, and H37Rv is sent out in NCBI with pAnno softwares
Full-length genome (NC_000962.3) file of table carries out six reading frame data base interpretations, and using this database to spectra count
According to the identification for having carried out new peptide fragment and novel protein.In order to reduce false positive rate, we have used 3 during data filtering
Kind separately estimates the filter method of classification FDR to having annotated peptide fragment and new peptide fragment, is S-FDR, T-FDR I and T-FDR respectively
II。
Through data analysis, we identify 3238 H37Rv and have annotated gene altogether, coverage be up to the 80% of the bacterial strain with
On, this is to report maximum H37Rv protein spectrum data so far.In addition, after being filtered through 3 kinds of FDR≤1, we obtain new peptide fragment.
In order to further ensure that new peptide fragment quality, the spectrogram corresponding to our new peptide fragments remaining to above-mentioned filtering has carried out spectrogram quality
Screening, finally remains the high-quality peptide fragment of some spectrograms.For further investigate the higher peptide fragment of these spectrogram quality not by
Caused by having annotated peptide fragment and single amino acids mutation occurs, We conducted amino acid mutation verifications, it is ensured that these new peptide fragments are
H37Rv newly identifies peptide fragment.
The coding albumen and database authentication of 1.2 couples of Rv3108c (- | 3476972-3477175 |) gene
After excessively high covering protein group verification, it has been found that some doubtful new leakage annotation peptide fragments, it can to above-mentioned height
Doubtful new peptide fragment progress peptide fragment synthesis verification is believed to obtain, according to new peptide fragment original spectrum and peptide fragment synthesis spectrum similarity >=0.8 conduct of marking
Similarity threshold after marking is screened, has several peptide fragments by verification, corresponding to new open reading frame (Open Reading
Frame, ORF), i.e., the potential leakage annotation gene of current H37Rv bacterial strains.
Wherein, it has been found that new leakage annotation gene Rv3108c (- | 3476972-3477175 |), compare through BLASTP,
It is 99% with M.tuberculosis 1825K, A70645,140070010 similitudes of M.canettii CIPT, with other bacterium
The similitude of strain is below 76%, belongs to Unknown Function albumen.We detect peptide fragment ATSALAVIR (SEQ ID NO.6), and
Corresponding to new gene Rv3108c (- | 3476972-3477175 |), as shown in Figure 1, spectrogram quality is fine, continuous of b/y ions
Match, miscellaneous peak signal is relatively low, as a result very credible.
Further to confirm this qualification result, we newly identify that the amino acid sequence chemistry of peptide fragment is synthesized according to us
The peptide fragment, and produce using above-mentioned mass spectral analysis condition the two level spectrogram of the section of synthesized peptide.
The high energy collision MS that we generate section of synthesized peptide2It is verified, level-one parent ion and two level daughter ion are equal
Meet theoretical value, show that the peptide section sequence that we synthesize is correct;On this basis, we are had checked according to large-scale protein by hand
The MS of the section of synthesized peptide of new peptide section sequence that matter group data authentication arrives2With the new peptide fragment spectrogram of Large scale identification, the two is almost
The one cosin values for showing the acquisition of daughter ion similitude are 0.98, it was demonstrated that the correct nothing of new peptide fragment that we identify from H37Rv
Accidentally.(Fig. 2).
After the sequence for confirming above-mentioned leakage annotation peptide fragment, according to the gene location where above-mentioned peptide fragment, with previous termination
The region that codon and the latter terminator codon include is boundary, obtains including the open reading frame of above-mentioned new leakage annotation peptide fragment
(ORF) DNA sequence dna, as shown in SEQ ID NO.2.
TAGTCAGCTGGCATCCTGAAGGGCATGCCAGGCAAGGAAATCGATCGAGTCCGGGCGACCAGTGCCCTCGCAGTGAT
TAGGCAGCACCCGGTAATGGTGTTCTTCGCGCTGTCGCCGGTACTCGCCGCATTGGGTGTCATGTGGTGGCTAGCCG
GTGCTGGATGGGCTATCGTCGCGGCCCTGGTGCTGGTGGTCGTCGGCGGAGCCATGATCGTCCTCAAACGCTGA
(SEQ ID NO.2)
Open reading frame coding is as shown in Figure 3 with the correspondence of amino acid sequence.
Further translation verification, finds true gene order (SEQ ID NO.1) from above-mentioned open reading frame DNA (SEQ
ID NO.2) inATGStart, common 204bp, 67 amino acid of coding, theoretical molecular weight 7.10kDa, as Rv3108c (- |
3476972-3477175 |) gene.
ATGCCAGGCAAGGAAATCGATCGAGTCCGGGCGACCAGTGCCCTCGCAGTGATTAGGCAGCACCCGGTAATGGTGTT
CTTCGCGCTGTCGCCGGTACTCGCCGCATTGGGTGTCATGTGGTGGCTAGCCGGTGCTGGATGGGCTATCGTCGCGG
CCCTGGTGCTGGTGGTCGTCGGCGGAGCCATGATCGTCCTCAAACGCTGA(SEQ ID NO.1)
The gene theory coded product amino acid sequence is as shown in SEQ ID NO.3:
MPGKEIDRVRATSALAVIRQHPVMVFFALSPVLAALGVMWWLAGAGWAIVAALVLVVVGGAMIVLKR
(SEQ ID NO.3)
NCBI-BLASTP analyses are carried out to the amino acid sequence of gene encoding production theoretical shown in the SEQ ID NO.3, with
M.tuberculosis 1825K, A70645, M.canettii CIPT140070010 similitudes are 99%, with other bacterial strains
Similitude is less than 76%, and belongs to Unknown Function albumen.(see Fig. 5).Show Rv3108c that we detect (- | 3476972-
3477175 |) gene outcome is missed annotation in H37Rv bacterial strain databases.
The DNA sequence dna of the Rv3108c (- | 3476972-3477175 |) gene is compared genome local by us
BLAST is analyzed, as shown in Figure 5, the results showed that Rv3108c (- | 3476972-3477175 |) gene order belongs to MTBC families spy
Specific gene, the sequence for not having homology higher in other species, this shows what we had found in H37Rv bacterial strains
Rv3108c (- | 3476972-3477175 |) gene order has preferable sequence-specific, can be interior other with belonging to by MTBC
Mycobacteria and other Bacteria infecting respiratories distinguish.
Embodiment 2:The method for establishing the compound groups of identification of M TBC
(1) primer is designed:
Based on the CDS sequences of Rv3108c (- | 3476972-3477175 |) gene as shown in SEQ ID NO.1, use
Oligo7.0 devises PCR primer, and primer sequence is as follows:
F:5’-GACCAGTGCCCTCGCAGT-3’(SEQ ID NO.4);
R:5’-AGGACGATCATGGCTCCG-3’(SEQ ID NO.5)
Above-mentioned primer is as follows with the position relationship of Rv3108c (- | 3476972-3477175 |) gene, wherein drawing
Object corresponding position subscript list is crossed.
ATGCCAGGCAAGGAAATCGATCGAGTCCGGGCGACCAGTGCCCTCGCAGTGATTAGGCAGCACCCGGTAATGGTGTT
CTTCGCGCTGTCGCCGGTACTCGCCGCATTGGGTGTCATGTGGTGGCTAGCCGGTGCTGGATGGGCTATCGTCGCGG
CCCTGGTGCTGGTGGTCGTCGGCGGAGCCATGATCGTCCTCAAACGCTGA(SEQ ID NO.1)
(2) total DNA of strain to be tested of the extraction including M.tuberculosis H37Rv, 40 plants of Mycobacterium marks
For quasi- bacterial strain by Chinese medicine bacteria culture preservation administrative center (CMCC) preservation, remaining 16 plants of non-tuberculous mycobacteria is Chinese
309 hospital clinical separation strains of people's liberation army have completed the sequencing of strain 16S rna genes, comparison and NCBI sequences and have submitted work,
Strain to be tested is as shown in table 1:
The related strain that table 1. is selected
(3) amplification of DNA fragments carries out polymerase chain (PCR) and reacts, and above-mentioned F/R primers used are expanded.
PCR system (25 μ L) is ddH2O (9.5 μ L), 2XTaq PCR MasterMix (TIANGEN, 12.5 μ L) primers F
(10 μM, 1 μ L), primer R (10 μM, 1 μ L), DNA profiling (1 μ L);
Amplification program:94 DEG C of pre-degeneration 3min, 94 DEG C denaturation 30s, 58 DEG C annealing 30s, 72 DEG C extension 1min, 35 follow
Ring, 72 DEG C of extension 5min.
(4) amplified production electrophoresis detection, the electrophoresis detection in Ago-Gel, 1 × TBE electrophoresis liquids.As a result such as Fig. 6 institutes
Show, MTBC and positive controls amplified band occur, and amplification is consistent with expection at 162bp, and specificity is
98.3%.
(5) in order to further verify the sequence of the DNA of amplification, we have carried out extension increasing sequence sequencing and and former sequence ratio
Compared with as shown in fig. 7, result is consistent completely with expection, sequence is correct, this further demonstrates depositing for new leakage annotation gene
.
This shows that the method that the compound group's identifications of MTBC are carried out based on Rv3108c (- | 3476972-3477175 |) gene is true
Reliably.
SEQUENCE LISTING
<110>Beijing Proteome Research Center
<120>Mycobacterium tuberculosis H37Rv encoding gene and its application
<130> BJ1936-17P121794
<160> 6
<170> PatentIn version 3.3
<210> 1
<211> 204
<212> DNA
<213> Artificial
<220>
<223>Mycobacterium tuberculosis H37Rv encoding gene Rv3108c (- | 3476972-3477175 |)
<400> 1
atgccaggca aggaaatcga tcgagtccgg gcgaccagtg ccctcgcagt gattaggcag 60
cacccggtaa tggtgttctt cgcgctgtcg ccggtactcg ccgcattggg tgtcatgtgg 120
tggctagccg gtgctggatg ggctatcgtc gcggccctgg tgctggtggt cgtcggcgga 180
gccatgatcg tcctcaaacg ctga 204
<210> 2
<211> 228
<212> DNA
<213> Artificial
<220>
<223>Include the open reading frame DNA sequence dna of leakage annotation peptide fragment
<400> 2
tagtcagctg gcatcctgaa gggcatgcca ggcaaggaaa tcgatcgagt ccgggcgacc 60
agtgccctcg cagtgattag gcagcacccg gtaatggtgt tcttcgcgct gtcgccggta 120
ctcgccgcat tgggtgtcat gtggtggcta gccggtgctg gatgggctat cgtcgcggcc 180
ctggtgctgg tggtcgtcgg cggagccatg atcgtcctca aacgctga 228
<210> 3
<211> 67
<212> PRT
<213> Artificial
<220>
<223>The theoretical code product amino acid sequence of Rv3108c (- | 3476972-3477175 |) gene
<400> 3
Met Pro Gly Lys Glu Ile Asp Arg Val Arg Ala Thr Ser Ala Leu Ala
1 5 10 15
Val Ile Arg Gln His Pro Val Met Val Phe Phe Ala Leu Ser Pro Val
20 25 30
Leu Ala Ala Leu Gly Val Met Trp Trp Leu Ala Gly Ala Gly Trp Ala
35 40 45
Ile Val Ala Ala Leu Val Leu Val Val Val Gly Gly Ala Met Ile Val
50 55 60
Leu Lys Arg
65
<210> 4
<211> 18
<212> DNA
<213> Artificial
<220>
<223>F primer sequences
<400> 4
gaccagtgcc ctcgcagt 18
<210> 5
<211> 18
<212> DNA
<213> Artificial
<220>
<223>R primer sequences
<400> 5
aggacgatca tggctccg 18
<210> 6
<211> 9
<212> PRT
<213> Artificial
<220>
<223>Leakage annotation peptide fragment
<400> 6
Ala Thr Ser Ala Leu Ala Val Ile Arg
1 5
Claims (10)
1. a kind of Mycobacterium tuberculosis H37Rv encoding gene, the nucleotide sequence such as SEQ ID NO.1 institutes of the encoding gene
Show.
2. Mycobacterium tuberculosis H37Rv encoding gene described in claim 1, it is characterised in that the gene code such as SEQ ID
Amino acid shown in NO.3 sequences.
3. a kind of bar code molecular labeling, as detecting and/or identifying mycobacterium tuberculosis complex, it includes examined as standard
The Mycobacterium tuberculosis H37Rv encoding gene described in claim 1 of cls gene.
A 4. species-specific PCR primers, for expanding Mycobacterium tuberculosis H37Rv encoding gene described in claim 1.
5. the PCT primers described in claim 4, which is characterized in that the sequence of the primer is:
F:5’-GACCAGTGCCCTCGCAGT-3’;
R:5’-AGGACGATCATGGCTCCG-3’。
6. a kind of detection method of mycobacterium tuberculosis complex identification, includes the following steps:
(1) the separation and Extraction genomic DNA from sample to be tested;
(2) DNA obtained using step (1) adds in amplimer as template, carries out PCR;
(3) DNA product obtained to step (2) amplification carries out gel electrophoresis analysis or is sequenced;
(4) result of step (3) is compared with encoding gene described in claim 1, it is to be measured according to its homologous sex determination
It whether there is mycobacterium tuberculosis complex in sample.
7. the amplimer sequence described in the detection method described in claim 6, wherein step (2) is:
F:5’-GACCAGTGCCCTCGCAGT-3’;
R:5’-AGGACGATCATGGCTCCG-3’。
8. the detection method described in claim 6, wherein in step (4), if homology is more than 99%, test sample is treated in judgement
Contain mycobacterium tuberculosis complex in product.
9. a kind of detection kit, will comprising Mycobacterium tuberculosis H37Rv encoding gene described in claim 1 and/or right
Seek the Specific PCR primers described in 4.
10. Mycobacterium tuberculosis H37Rv encoding gene described in claim 1 is in tuberculosis epidemiological survey and/or clinical knot
Application in core patient quick diagnosis and discriminating.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711251274.8A CN108165564B (en) | 2017-12-01 | 2017-12-01 | Mycobacterium tuberculosis H37Rv encoding gene and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711251274.8A CN108165564B (en) | 2017-12-01 | 2017-12-01 | Mycobacterium tuberculosis H37Rv encoding gene and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108165564A true CN108165564A (en) | 2018-06-15 |
CN108165564B CN108165564B (en) | 2021-06-08 |
Family
ID=62525068
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711251274.8A Active CN108165564B (en) | 2017-12-01 | 2017-12-01 | Mycobacterium tuberculosis H37Rv encoding gene and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108165564B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110343706A (en) * | 2019-01-25 | 2019-10-18 | 北京蛋白质组研究中心 | Mycobacterium tuberculosis H37Rv encoding gene and its application |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060182685A1 (en) * | 2004-09-04 | 2006-08-17 | Bishai William R | Hollow fiber technique for in vivo study of cell populations |
WO2012031752A2 (en) * | 2010-09-07 | 2012-03-15 | Institut Pasteur Korea | Tuberculosis mutants |
CN104212890A (en) * | 2010-02-24 | 2014-12-17 | 布罗德研究所有限公司 | Methods of diagnosing infectious disease pathogens and their drug sensitivity |
-
2017
- 2017-12-01 CN CN201711251274.8A patent/CN108165564B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060182685A1 (en) * | 2004-09-04 | 2006-08-17 | Bishai William R | Hollow fiber technique for in vivo study of cell populations |
CN104212890A (en) * | 2010-02-24 | 2014-12-17 | 布罗德研究所有限公司 | Methods of diagnosing infectious disease pathogens and their drug sensitivity |
WO2012031752A2 (en) * | 2010-09-07 | 2012-03-15 | Institut Pasteur Korea | Tuberculosis mutants |
Non-Patent Citations (2)
Title |
---|
COLE,S.T. ET AL.: ""Mycobacterium tuberculosis H37Rv complete genome,ACCESSION:AL123456.3 "", 《GENBANK》 * |
EILERTSON,B. ET AL.: ""membrane protein [Mycobacterium tuberculosis TRS4],ACCESSION:AQO70035.1"", 《GENBANK》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110343706A (en) * | 2019-01-25 | 2019-10-18 | 北京蛋白质组研究中心 | Mycobacterium tuberculosis H37Rv encoding gene and its application |
Also Published As
Publication number | Publication date |
---|---|
CN108165564B (en) | 2021-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ikryannikova et al. | Discrimination between Streptococcus pneumoniae and Streptococcus mitis based on sorting of their MALDI mass spectra | |
JP5317430B2 (en) | Probe set, probe carrier, and fungal discrimination identification method | |
Ikryannikova et al. | Misidentification of alpha-hemolytic streptococci by routine tests in clinical practice | |
CN109182517B (en) | Gene for molecular typing of medulloblastoma and application thereof | |
KR101954392B1 (en) | Molecular marker for detecting Erwinia amylovora and uses thereof | |
CN113881789B (en) | Probe and primer pair composition for detecting cryptococcus and detection method and application | |
CN110408629A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN110408630A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN115976235B (en) | Identification method of Lactobacillus delbrueckii CICC6047 strain, and primer, kit and application thereof | |
CN108165562A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN108165561A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN108004253A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN105420392B (en) | One group of gene new mutation relevant to newborn's Tendon defection phenotype and detection kit | |
CN108165564A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN108165565A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN108165560A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN110408632A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN108165563A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN110408631A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
CN110343706A (en) | Mycobacterium tuberculosis H37Rv encoding gene and its application | |
Salehi et al. | Internal Transcribed Spacer rDNA and TEF-1α Gene Sequencing of Pathogenic Dermatophyte Species and Differentiation of Closely Related Species Using PCR-RFLP of The Topoisomerase II | |
KR100568702B1 (en) | DNA microarray for species identification of microorganism and its manufacture method | |
CN110106276A (en) | A kind of kit and discrimination method identifying Radix Angelicae Sinensis | |
JP4379308B2 (en) | Genotyping method of methicillin-resistant Staphylococcus aureus and primer set used therefor | |
Savelkoul et al. | High density whole genome fingerprinting of methicillin-resistant and-susceptible strains of Staphylococcus aureus in search of phenotype-specific molecular determinants |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240131 Address after: 100850 No. 27 Taiping Road, Beijing, Haidian District Patentee after: ACADEMY OF MILITARY MEDICAL SCIENCES Country or region after: China Address before: Building 1, No.33, kekeyuan Road, Changping District, Beijing Patentee before: BEIJING PROTEOME RESEARCH CENTER Country or region before: China |