CN108165560B - Mycobacterium tuberculosis H37Rv encoding gene and application thereof - Google Patents

Mycobacterium tuberculosis H37Rv encoding gene and application thereof Download PDF

Info

Publication number
CN108165560B
CN108165560B CN201711250169.2A CN201711250169A CN108165560B CN 108165560 B CN108165560 B CN 108165560B CN 201711250169 A CN201711250169 A CN 201711250169A CN 108165560 B CN108165560 B CN 108165560B
Authority
CN
China
Prior art keywords
gene
mycobacterium tuberculosis
h37rv
rv0572a
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711250169.2A
Other languages
Chinese (zh)
Other versions
CN108165560A (en
Inventor
徐平
张瑶
王富强
孙金帅
武舒佳
常蕾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Academy of Military Medical Sciences AMMS of PLA
Original Assignee
BEIJING PROTEOME RESEARCH CENTER
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING PROTEOME RESEARCH CENTER filed Critical BEIJING PROTEOME RESEARCH CENTER
Priority to CN201711250169.2A priority Critical patent/CN108165560B/en
Publication of CN108165560A publication Critical patent/CN108165560A/en
Application granted granted Critical
Publication of CN108165560B publication Critical patent/CN108165560B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/35Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Mycobacteriaceae (F)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Zoology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Wood Science & Technology (AREA)
  • Microbiology (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Medicinal Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention relates to a coding gene of mycobacterium tuberculosis H37Rv, which can be used as a standard gene for molecular identification of mycobacterium tuberculosis complex and is used for molecular identification and clinical detection of the mycobacterium tuberculosis complex.

Description

Mycobacterium tuberculosis H37Rv encoding gene and application thereof
Technical Field
The invention relates to the field of gene detection, in particular to identification of pathogenic bacteria species.
Background
Mycobacterium Tuberculosis (MTB) is a pathogenic bacterium that causes tuberculosis in humans. It can invade all organs of the body, but pulmonary tuberculosis is the most common. Tuberculosis is an extremely important infectious disease so far and seriously threatens the life health of human beings. It is reported by WHO that about 800 new cases occur each year, and at least 300 million people die from the disease. The clinical bacterial strain of MTB is difficult to culture, slow in growth, capable of cross-infecting with other mycobacteria, difficult to distinguish between tuberculosis and other respiratory tract infection symptoms and the like, and brings great difficulty to clinical rapid diagnosis and treatment. Therefore, the establishment of a quick, accurate, specific, sensitive and cheap tuberculosis detection method is a necessary premise for effectively treating and controlling tuberculosis spreading, and is a new challenge and a new task for detecting mycobacterium in clinical laboratories.
Mycobacterium tuberculosis complex (MTBC) includes the Mycobacterium groups m.tuberculosis, m.africanum, m.orygis, m.bovis, m.microti, m.canettii, m.caprae, m.pinnipedii, m.subcatetate, m.mungi, which all cause tuberculosis in humans and other life forms. At present, the domestic and foreign MTBC identification method is mainly divided into the following three categories: traditional separation culture method; molecular level detection (IS6110, restriction fragment length polymorphism analysis, multi-site variable number repeat polymorphism analysis, etc.); a method for analyzing the components of a microorganism (fatty acid, mycolic acid) by chromatography. The three methods have respective advantages, but have disadvantages, such as long separation culture period and low thallus culturable rate; at present, the molecular level detection is poor in specificity, sensitivity and simplicity; the analysis cost of the thallus component characteristics is high, and the operation is complex.
MTB H37Rv completed whole genome sequencing in 1998, the MTB strain that completed whole genome sequencing the earliest. From this point on, researchers in various countries are perfecting and supplementing H37Rv gene annotation databases based on strategies such as algorithm optimization, annotation software updating, transcriptomics and proteomics. However, since MTB belongs to prokaryotes, annotation errors (over-annotation, gene boundary error, ORF initiation, termination site error, alternative splicing, ribosome translocation, missing annotation) may still exist in genome annotation due to the inherent shortcomings of the prokaryote genome annotation technology, which brings trouble to deep and accurate analysis of biological mechanisms. In order to solve the problem, proteomics (proteomics) has been used for correcting the annotated gene of H37Rv, however, high-proportion false positive, difficulty in annotated gene prediction, new gene verification, new gene function analysis and application thereof, and the like, are problems faced in the field.
In general, the traditional mycobacterium tuberculosis complex (MTBC) identification strategy has the defects of long period, tedious steps, low specificity and sensitivity and the like. In order to further perfect re-annotation of the H37Rv whole genome, missing annotation genes in H37Rv are found, the H37Rv whole genome missing annotation genes and application technologies thereof in MTBC molecular identification are effectively protected, and a method for quickly and accurately identifying the MTBC group by using the H37Rv new genes is imperatively developed.
Disclosure of Invention
An object of the present invention is to provide a new encoding gene of mycobacterium tuberculosis H37Rv, which is H37Rv minus annotation encoding gene Rv0572A (-665434-665601 |), which can be used as a barcode molecular marker of mycobacterium tuberculosis complex for detecting mycobacterium tuberculosis complex, and the sequence of which is shown in SEQ ID No. 1.
Other objects of the present invention include providing specific PCR primers useful for amplifying the above-described encoding genes and providing a method of detecting or identifying the presence of a binding Mycobacterium complex in a sample; the invention also provides a detection kit related to the coding gene and application of the gene.
According to one aspect of the invention, by comparing proteomic research techniques, a protein coding sequence of H37Rv that is difficult to find by genetic prediction software was discovered that effectively distinguishes MTBC from other species of the same genus. The gene is a missing annotation gene of Mycobacterium tuberculosis (Mycobacterium tuberculosis H37Rv), namely Rv0572A (- | 665434-. Comparative genomics studies show that the gene sequence can distinguish the Mycobacterium tuberculosis complex (MTBC) strain from other species of Mycobacterium.
Specifically, a primer capable of realizing specific amplification on the Rv0572A (-665434-665601) gene of MTBC is designed, namely the primer provided by the invention, and the sequence of the primer is as follows:
F:5’-ACGCTGCAAGCAAACCATCGAAC-3’;
R:5’-TAGCTTCGCGGAACGTGCTCCC-3’。
according to the existence of the gene DNA sequence PCR product in the sample to be detected or the difference of the DNA sequence, the MTBC can be quickly and accurately identified.
According to another aspect of the present invention, based on the above-mentioned new standard encoding gene of Mycobacterium tuberculosis H37Rv, the present invention specifically establishes a method for detecting or identifying Mycobacterium tuberculosis complex, comprising the following steps:
(1) separating and extracting genome DNA from a sample to be detected;
(2) and (2) performing PCR amplification by using the DNA obtained in the step (1) as a template and adopting the following primers:
F:5’-ACGCTGCAAGCAAACCATCGAAC-3’(SEQ ID NO.4);
R:5’-TAGCTTCGCGGAACGTGCTCCC-3’(SEQ ID NO.5)。
(3) performing gel electrophoresis analysis or sequencing on the DNA product obtained by amplification in the step (2);
(4) and (3) comparing the result of the step (3) with the barcode gene Rv0572A (-665434-665601), and if the homology is more than 98%, judging that the sample to be detected contains the Mycobacterium tuberculosis complex.
Further, the detection method is characterized in that electrophoresis analysis is performed on the PCR product primarily according to the DNA bar code principle, and if the strain to be detected does not have a target band, the strain is not MTBC; if the band exists, further sequencing verification can be carried out, the sequence obtained by sequencing and the standard sequence of Rv0572A (-665434-665601) of H37Rv are subjected to homologous comparison and alignment to obtain the similarity between the sequences, and if the sequence homology is more than 99 percent, the strain can be judged to be MTBC; and (3) distinguishing the MTBC family from nontuberculous mycobacteria, common respiratory pathogenic bacteria and common respiratory viruses according to the clustering condition of the DNA barcode sequence of the strain to be identified and the standard sequence.
The detection method can be used for strain identification research of the mycobacterium tuberculosis complex and can also be used for clinical rapid inspection. The sample to be detected can be H37Rv strain, other MTBC, nontuberculous mycobacteria, respiratory tract common pathogenic bacteria and respiratory tract common virus strain; or may use sputum, saliva or blood directly from tuberculosis and other respiratory patients.
Based on the above method, the present invention also provides a detection kit, wherein the kit contains a reagent for detecting the novel standard encoding gene of Mycobacterium tuberculosis H37Rv in a container, and simultaneously provides manufacturing, using and marketing information about the medicine or biological product, which can be approved by a government drug administration. For example, the reagent for directly detecting the Rv0572A (- |665434-665601|) gene in the sample after PCR amplification may contain one or more of amplification primers, DNA polymerase used for PCR reaction and its buffer, reagents required for enzyme digestion reaction and/or sequencing reaction, etc. It is known to those skilled in the art that the above components are merely illustrative, and for example, the primers may employ the specific PCR primers described above, and the DNA polymerase used for the PCR reaction is an enzyme capable of being used for PCR amplification. The detection of the encoding gene of the present invention can also be provided in the form of an integrated, e.g., gene chip.
Has the advantages that: the invention provides a standard gene and a molecular identification method for molecular identification of Mycobacterium tuberculosis complex (MTBC), wherein the gene can effectively distinguish MTBC from other species of the same genus, the identification method using the gene overcomes the defects of primer design multiplicity, poor result repeatability and the like in the existing identification process of the Mycobacterium tuberculosis complex, has the characteristics of universality, easy amplification and easy comparison, can accurately identify the class from other mycobacteria with close relativity or other respiratory tract infectious germs, and provides powerful technical means and research tools for the epidemiological investigation and the rapid diagnosis and identification of clinical tuberculosis patients.
Drawings
FIG. 1: evidence of peptide profile matching supporting the discovery of new coding genes;
FIG. 2: comparing the mass spectrogram of the synthesized peptide fragment with the mass spectrogram of the original identified peptide fragment;
FIG. 3: a corresponding diagram of a protein sequence coded by ORF of the peptide fragment locus region; the underlined part is the peptide identified in proteomics and verified by the synthetic peptide;
FIG. 4: comparing the homology of the Rv0572A (- |665434-665601|) standard gene sequence;
FIG. 5: the result of BLASTP of the protein sequence corresponding to the Rv0572A (- |665434-665601|) gene of the H37Rv strain;
FIG. 6: the result of agarose gel electrophoresis of the PCR amplification product of the Rv0572A (- |665434-665601 |);
the specific information of each lane sample is shown in Table 1.
FIG. 7: the PCR amplification sequencing result of the Rv0572A (- | 665434-containing 665601|) gene is compared with a standard sequence.
Detailed Description
The invention is further described with reference to specific embodiments, but the scope of the claims is not limited thereto. The reagents used in the present invention are all commercially available.
Example 1: search for genes encoding missing release of the genome of strain H37Rv
1.1 high coverage proteomic validation of the genome of the H37Rv strain
The deep coverage study of proteome was performed on the H37Rv strain using the high coverage proteome technique. Annotated encoding gene validation was performed on its genome using the pFind 3 engine based on the Tuberculosis (20160307) database. To find new protein coding regions, we performed six-reading-frame database translation of H37Rv in the genome-wide (NC _000962.3) file published at NCBI using pAnno software based on proteomic technology, and identified new peptide fragments and new proteins using this database for mass spectrometry data. To reduce the false positive rate, we used 3 filtering methods to separately estimate class FDR for the annotated and new peptide fragments, S-FDR, T-FDR I and T-FDR II, respectively, during the data filtering.
Through data analysis, a total of 3238H 37Rv annotated genes are identified, and the coverage is as high as more than 80% of the strain, which is the largest mass spectrum data of the H37Rv protein reported so far. In addition, we obtained new peptide fragments after 3 FDRs ≤ 1 filtration. In order to further ensure the quality of the new peptide fragments, spectrogram quality screening is carried out on spectrograms corresponding to the new peptide fragments left after filtration, and finally some peptide fragments with good spectrogram quality are reserved. To further investigate that these peptides with higher spectral quality were not due to single amino acid mutations in the annotated peptide, we performed amino acid mutation checks to ensure that these new peptides were newly identified peptides of H37 Rv.
1.2 verification of the encoded protein and database of the Rv0572A (- |665434-665601|) Gene
After high coverage proteome verification, we find some suspected new peptide fragments which are leaked to release, and perform peptide fragment synthesis verification on the suspected new peptide fragments with high reliability, and score more than or equal to 0.8 according to the similarity between the original spectrum and the synthesized spectrum of the new peptide fragments as a similarity threshold, and after scoring and screening, a plurality of peptide fragments pass through verification and correspond to a new Open Reading Frame (ORF), namely the potential leaked to release genes of the current H37Rv strain.
Among them, we found a new missing-release gene Rv0572A (-665434-665601. after BLASTP comparison, we found that the protein sequences in M.bovis RefSeq were perfectly matched, and there were higher similarity protein annotation information in the M.tuboculosis 2382Q, M.tuboculosis Bir 84, M.tuboculosis 2358P, M.tuboculosis 1814Y and M.tuboculosis 0018W databases. Six encoded peptides were identified, as shown in FIG. 1, and peptide fragment "GQAGIVDDGAVLIHVPGECPHPGEHVPR (SEQ ID NO: 6)" was identified from each of the three large Pandey, Xu, and Aebresold data sets; peptide fragment "GQAGIVDDGAVLIHVPGECPHPGEHVPRS (SEQ NO. 7)" was identified from the Pandey dataset; "RGQAGIVDDGAVLIHVPGECPHPGEHVPRS (SEQ NO. 8)" was identified from the Pandey dataset; peptide fragment "QTIEPGWLYITAHR (SEQ NO. 9)" was identified from both Pandey and Xu datasets; peptide fragment "CKQTIEPGWLYITAHR (SEQ NO. 10)" was identified from the Pandey dataset; peptide fragment "RGQAGIVDDGAVLIHVPGECPHPGEHVPR (SEQ ID NO: 11)" was identified from both Pandey and Xu datasets. The corresponding spectrogram quality of the peptide fragments is good, continuous b/y ion matching exists, the signal of a stray peak is low, and the result is very credible.
To further confirm this identification, we chemically synthesized the peptide according to the amino acid sequence of our newly identified peptide and generated a secondary spectrum of the synthesized peptide using the mass spectrometry conditions described above.
Our high energy collision MS on synthetic peptide fragments2Verification is carried out, and the primary parent ions and the secondary daughter ions both accord with theoretical values, so that the sequence of the synthesized peptide fragment is correct; on this basis, we manually examined MS of synthetic peptides of novel peptide sequences identified from large-scale proteomic data2And large-scale identification of new peptide fragment spectra, both of which are almost identical. Peptide fragment "GQAGIVDDGAVLIHVPGECPHPGEHVPR (SEQ NO.6) "shows a cosin value of 0.80, a cosin value of 0.95 for peptide fragment" GQAGIVDDGAVLIHVPGECPHPGEHVPRS (SEQ NO.7) ", a cosin value of 0.95 for peptide fragment" RGQAGIVDDGAVLIHVPGECPHPGEHVPRS (SEQ NO.8) ", a cosin value of 0.98 for peptide fragment" QTIEPGWLYITAHR (SEQ NO.9) ", a cosin value of 0.96 for peptide fragment" CKQTIEPGWLYITAHR (SEQ NO.10) ", and a cosin value of 0.98 for peptide fragment" RGQAGIVDDGAVLIHVPGECPHPGEHVPR (SEQ NO.11) ", which proved that these new peptide fragments identified from H37Rv were correct. (FIG. 2).
After confirming the sequence of the peptide fragment to be released, according to the gene position of the peptide fragment, taking the region included by the former stop codon and the latter stop codon as a boundary, obtaining the Open Reading Frame (ORF) DNA sequence containing the new peptide fragment to be released, as shown in SEQ ID NO. 2.
TAGGGAAGCGAGGAGCACATGGCCGCCGACCCGCAATGTACACGCTGCAAGCAAACCATCGAACCCGGATGGCTATACATCACCGCCCATCGCCGCGGTCAAGCCGGGATCGTCGATGACGGCGCAGTACTGATTCACGTGCCCGGTGAATGCCCGCACCCCGGGGAGCACGTTCCGCGAAGCTAG(SEQ ID NO.2)
The correspondence between the open reading frame code and the amino acid sequence is shown in FIG. 3.
Further translation verification revealed that the authentic gene sequence (SEQ ID NO.1) was found from the above-mentioned open reading frame DNA (SEQ ID NO.2)ATGAt the beginning, 168bp in total encodes 56 amino acids, the theoretical molecular weight of which is 45.97kDa, namely the Rv0572A (- |665434 |) gene 665601 |.
ATGGCCGCCGACCCGCAATGTACACGCTGCAAGCAAACCATCGAACCCGGATGGCTATACATCACCGCCCATCGCCGCGGTCAAGCCGGGATCGTCGATGACGGCGCAGTACTGATTCACGTGCCCGGTGAATGCCCGCACCCCGGGGAGCACGTTCCGCGAAGCTAG(SEQ ID NO.1)
The theoretical coding product amino acid sequence of the gene is shown as SEQ ID NO. 3:
MAADPQCTRCKQTIEPGWLYITAHRRGQAGIVDDGAVLIHVPGECPHPGEHVPRS(SEQ ID NO.3)
the amino acid sequence of the theoretical gene-encoded product shown in SEQ ID NO.3 was subjected to NCBI-BLASTP analysis, and found that the protein sequences in M.bovis RefSeq were completely matched, and the protein annotation information was highly similar in the M.tubocussios 2382Q, M.tubocussios Bir 84, M.tubocussios 2358P, M.tubocussios 1814Y and M.tubocussios 0018W databases (see FIG. 4). It was shown that our detected Rv0572A (- |665434-665601|) gene product was missed in the H37Rv strain database for annotation.
We carried out comparative genome local BLAST analysis on the DNA sequence of the Rv0572A (- |665434-665601|) gene, as shown in FIG. 5, and the result showed that the Rv0572A (- |665434-665601|) gene sequence belongs to MTBC family specific gene and has no more homologous sequence in other species, which indicates that the Rv0572A (- |665434-665601|) gene sequence found in the H37Rv strain has better sequence specificity and can distinguish MTBC from other mycobacteria and other respiratory tract infectious bacteria in the same genus.
Example 2: method for establishing and identifying MTBC complex group
(1) Designing a primer:
based on the CDS sequence of the Rv0572A (- |665434-665601|) gene shown in SEQ ID NO.1, a PCR primer was designed by using Oligo7.0, and the primer sequence was as follows:
F:5’-ACGCTGCAAGCAAACCATCGAAC-3’(SEQ ID NO.4);
R:5’-TAGCTTCGCGGAACGTGCTCCC-3’(SEQ ID NO.5)
the positional relationship between the above primers and the Rv0572A (- |665434-665601|) gene is shown below, wherein the primers are underlined.
ATGGCCGCCGACCCGCAATGTACACGCTGCAAGCAAACCATCGAACCCGGATGGCTATACATCACCGCCCATCGCCGCGGTCAAGCCGGGATCGTCGATGACGGCGCAGTACTGATTCACGTGCCCGGTGAATGCCCGCACCCCGGGGAGCACGTTCCGCGAAGCTAG(SEQ ID NO.1)
(2) Extracting total DNA of strains to be detected including M.tuberculosis H37Rv, wherein 40 mycobacterium standard strains are preserved by China medical bacterial strain preservation management center (CMCC), the other 16 non-tuberculous mycobacteria are clinical isolates of 309 hospital of China people's liberation military, and the work of sequencing and comparing strains 16SRNA genes and submitting NCBI sequences is completed, wherein the strains to be detected are shown in Table 1:
TABLE 1 related strains selected
Figure BDA0001491567760000081
Figure BDA0001491567760000091
(3) The DNA fragment was amplified and subjected to Polymerase Chain Reaction (PCR) using the above F/R primer.
PCR System (25. mu.L) as ddH2O (9.5. mu.L), 2XTaq PCR MasterMix (TIANGEN, 12.5. mu.L), primer F (10. mu.M, 1. mu.L), primer R (10. mu.M, 1. mu.L), DNA template (1. mu.L);
and (3) amplification procedure: pre-denaturation at 94 ℃ for 3min, denaturation at 94 ℃ for 30s, annealing at 58 ℃ for 30s, extension at 72 ℃ for 1min, 35 cycles, and extension at 72 ℃ for 5 min.
(4) And (4) detecting the amplified product by electrophoresis in agarose gel and 1 xTBE electrophoresis solution. As a result, as shown in FIG. 6, an amplification band appeared at 144bp in MTBC and positive control group, and the actual amplification result agreed with the expected one.
(5) To further verify the sequence of the amplified DNA, we sequenced the amplified sequence and compared it with the original sequence, as shown in FIG. 7, which is a perfect match to the expected sequence without errors, further verifying the presence of a new missing annotated gene.
This indicates that the method for identifying MTBC complex group based on Rv0572A (- |665434-665601|) gene is truly reliable.
SEQUENCE LISTING
<110> Peking proteome research center
<120> Mycobacterium tuberculosis H37Rv encoding gene and application thereof
<130> BJ1936-17P121791
<160> 11
<170> PatentIn version 3.3
<210> 1
<211> 168
<212> DNA
<213> Artificial
<220>
<223> Mycobacterium tuberculosis H37Rv encoding gene Rv0572A (- |665434-
<400> 1
atggccgccg acccgcaatg tacacgctgc aagcaaacca tcgaacccgg atggctatac 60
atcaccgccc atcgccgcgg tcaagccggg atcgtcgatg acggcgcagt actgattcac 120
gtgcccggtg aatgcccgca ccccggggag cacgttccgc gaagctag 168
<210> 2
<211> 186
<212> DNA
<213> Artificial
<220>
<223> open reading frame DNA sequence comprising peptide fragment with missing annotation
<400> 2
tagggaagcg aggagcacat ggccgccgac ccgcaatgta cacgctgcaa gcaaaccatc 60
gaacccggat ggctatacat caccgcccat cgccgcggtc aagccgggat cgtcgatgac 120
ggcgcagtac tgattcacgt gcccggtgaa tgcccgcacc ccggggagca cgttccgcga 180
agctag 186
<210> 3
<211> 55
<212> PRT
<213> Artificial
<220>
<223> the amino acid sequence of the theoretical coding product of Rv0572A (-665434-665601 |)
<400> 3
Met Ala Ala Asp Pro Gln Cys Thr Arg Cys Lys Gln Thr Ile Glu Pro
1 5 10 15
Gly Trp Leu Tyr Ile Thr Ala His Arg Arg Gly Gln Ala Gly Ile Val
20 25 30
Asp Asp Gly Ala Val Leu Ile His Val Pro Gly Glu Cys Pro His Pro
35 40 45
Gly Glu His Val Pro Arg Ser
50 55
<210> 4
<211> 23
<212> DNA
<213> Artificial
<220>
<223> F primer sequences
<400> 4
acgctgcaag caaaccatcg aac 23
<210> 5
<211> 22
<212> DNA
<213> Artificial
<220>
<223> R primer sequences
<400> 5
tagcttcgcg gaacgtgctc cc 22
<210> 6
<211> 28
<212> PRT
<213> Artificial
<220>
<223> first missing annotation peptide fragment
<400> 6
Gly Gln Ala Gly Ile Val Asp Asp Gly Ala Val Leu Ile His Val Pro
1 5 10 15
Gly Glu Cys Pro His Pro Gly Glu His Val Pro Arg
20 25
<210> 7
<211> 29
<212> PRT
<213> Artificial
<220>
<223> second omission annotation of peptide fragments
<400> 7
Gly Gln Ala Gly Ile Val Asp Asp Gly Ala Val Leu Ile His Val Pro
1 5 10 15
Gly Glu Cys Pro His Pro Gly Glu His Val Pro Arg Ser
20 25
<210> 8
<211> 30
<212> PRT
<213> Artificial
<220>
<223> third missing annotation peptide fragment
<400> 8
Arg Gly Gln Ala Gly Ile Val Asp Asp Gly Ala Val Leu Ile His Val
1 5 10 15
Pro Gly Glu Cys Pro His Pro Gly Glu His Val Pro Arg Ser
20 25 30
<210> 9
<211> 14
<212> PRT
<213> Artificial
<220>
<223> fourth missing annotation peptide fragment
<400> 9
Gln Thr Ile Glu Pro Gly Trp Leu Tyr Ile Thr Ala His Arg
1 5 10
<210> 10
<211> 16
<212> PRT
<213> Artificial
<220>
<223> fifth bar missing annotation peptide fragment
<400> 10
Cys Lys Gln Thr Ile Glu Pro Gly Trp Leu Tyr Ile Thr Ala His Arg
1 5 10 15
<210> 11
<211> 29
<212> PRT
<213> Artificial
<220>
<223> sixth missing annotation peptide fragment
<400> 11
Arg Gly Gln Ala Gly Ile Val Asp Asp Gly Ala Val Leu Ile His Val
1 5 10 15
Pro Gly Glu Cys Pro His Pro Gly Glu His Val Pro Arg
20 25

Claims (5)

1. An identification method for distinguishing a Mycobacterium tuberculosis complex strain from other strains of Mycobacterium which is not used for the diagnosis and treatment of diseases, characterized in that whether a Mycobacterium tuberculosis complex exists in a sample to be detected is determined by detecting whether a gene Rv0572A (- |665434 |) encoding Mycobacterium tuberculosis H37Rv exists in the sample to be detected, and the nucleotide sequence of the gene Rv0572A (- |665434 |) encoding Mycobacterium tuberculosis H37Rv is shown as SEQ ID No. 1.
2. The method as claimed in claim 1, wherein the gene Rv0572A (-665434-665601 |) encoding H37Rv encodes the amino acids shown in the sequence of SEQ ID No. 3.
3. The method of claim 1, comprising the steps of:
(1) separating and extracting genome DNA from a sample to be detected;
(2) adding an amplification primer by taking the DNA obtained in the step (1) as a template to perform polymerase chain reaction;
(3) carrying out gel electrophoresis analysis and sequencing on the DNA product obtained by amplification in the step (2);
(4) and (3) comparing the result of the step (3) with the encoding gene Rv0572A (- |665434-665601|) of H37Rv, and judging whether the type of the Mycobacterium tuberculosis complex exists in the sample to be detected according to the homology of the result.
4. The method of claim 3, wherein the amplification primer sequence of step (2) is:
F: 5’- ACGCTGCAAGCAAACCATCGAAC -3’;
R: 5’- TAGCTTCGCGGAACGTGCTCCC -3’。
5. the method according to claim 3, wherein in the step (4), if the homology is more than 99%, it is judged that the Mycobacterium tuberculosis complex of the class is present in the sample to be tested.
CN201711250169.2A 2017-12-01 2017-12-01 Mycobacterium tuberculosis H37Rv encoding gene and application thereof Active CN108165560B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711250169.2A CN108165560B (en) 2017-12-01 2017-12-01 Mycobacterium tuberculosis H37Rv encoding gene and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711250169.2A CN108165560B (en) 2017-12-01 2017-12-01 Mycobacterium tuberculosis H37Rv encoding gene and application thereof

Publications (2)

Publication Number Publication Date
CN108165560A CN108165560A (en) 2018-06-15
CN108165560B true CN108165560B (en) 2021-06-08

Family

ID=62524983

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711250169.2A Active CN108165560B (en) 2017-12-01 2017-12-01 Mycobacterium tuberculosis H37Rv encoding gene and application thereof

Country Status (1)

Country Link
CN (1) CN108165560B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110343706A (en) * 2019-01-25 2019-10-18 北京蛋白质组研究中心 Mycobacterium tuberculosis H37Rv encoding gene and its application

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006035317A2 (en) * 2004-09-30 2006-04-06 Institut Pasteur Immunogenic glycopeptides for diagnosing pathogenic microorganisms infections
WO2009017902A2 (en) * 2007-06-22 2009-02-05 Ibis Biosciences, Inc. Compositions and methods for identification of subspecies characteristics of mycobacterium tuberculosis
CN101421415A (en) * 2004-07-12 2009-04-29 阿维斯塔金格兰技术有限公司 Construction of a comparative database and identification of virulence factors through comparison of polymorphic regions in clinical isolates of infectious organisms
WO2012135815A2 (en) * 2011-04-01 2012-10-04 Occam Biolabs, Inc. Methods and kits for detecting cell-free pathogen-specific nucleic acids

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2003242504A1 (en) * 2002-07-13 2004-02-02 Statens Serum Institut Therapeutic tuberculosis vaccines

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101421415A (en) * 2004-07-12 2009-04-29 阿维斯塔金格兰技术有限公司 Construction of a comparative database and identification of virulence factors through comparison of polymorphic regions in clinical isolates of infectious organisms
WO2006035317A2 (en) * 2004-09-30 2006-04-06 Institut Pasteur Immunogenic glycopeptides for diagnosing pathogenic microorganisms infections
WO2009017902A2 (en) * 2007-06-22 2009-02-05 Ibis Biosciences, Inc. Compositions and methods for identification of subspecies characteristics of mycobacterium tuberculosis
WO2012135815A2 (en) * 2011-04-01 2012-10-04 Occam Biolabs, Inc. Methods and kits for detecting cell-free pathogen-specific nucleic acids

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"hypothetical protein BTB1458_0626 [Mycobacterium tuberculosis],ACCESSION:AOZ41636.1";Guo,A.;《Genpept》;20161027;第1-2页 *
"Mycobacterium tuberculosis H37Rv complete genome,ACCESSION:AL123456.3";Cole,S.T. et al.;《GenBank》;20150227;第1页 *

Also Published As

Publication number Publication date
CN108165560A (en) 2018-06-15

Similar Documents

Publication Publication Date Title
CN110408630B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN110408629B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
Jiang et al. Comparison of the proteome of isoniazid-resistant and-susceptible strains of Mycobacterium tuberculosis
CN108913768B (en) Multiplex liquid phase gene chip primer, kit and analysis method for simultaneously detecting seven glucosamine drug resistance genes
CN108165561B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN108004253B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN108165562B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
Imai et al. Rapid and accurate species identification of mitis group streptococci using the MinION nanopore sequencer
CN108165560B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN108165564B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN108165565B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN110408632B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN110511995B (en) Tuberculosis markers and application thereof
CN110408631B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN108165563B (en) Mycobacterium tuberculosis H37Rv encoding gene and application thereof
CN114041188A (en) Method for detecting helicobacter pylori levels in fecal samples
RAO Infection of Mycobacterium mageritense at surgical site: a first case report of India
CN114196779A (en) Pathogenic microorganism detection method and kit based on targeted sequencing
CN110527721B (en) Old tuberculosis marker and application thereof
JP2014113079A (en) Highly accurate detection method of trichophyton
Sharba et al. SEQUENCE AND STRUCTURE ANALYSIS OF HLA-G IN BREAST CANCER PATIENTS USING BIOINFORMATICS TOOLS AND TECHNIQUES.
CN114107454A (en) Respiratory tract infection pathogen detection method based on macrogene/macrotranscriptome sequencing
Mironov et al. Multilocus sequence-typing scheme for Borrelia miyamotoi—the erythema-free ixodid tick-borne borreliosis pathogens
CN110343706A (en) Mycobacterium tuberculosis H37Rv encoding gene and its application
CN112048552B (en) Intestinal flora for diagnosing myasthenia gravis and application thereof

Legal Events

Date Code Title Description
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20240201

Address after: 100850 No. 27 Taiping Road, Beijing, Haidian District

Patentee after: ACADEMY OF MILITARY MEDICAL SCIENCES

Country or region after: China

Address before: Building 1, No.33, kekeyuan Road, Changping District, Beijing

Patentee before: BEIJING PROTEOME RESEARCH CENTER

Country or region before: China

TR01 Transfer of patent right